Beitrag in einem Tagungsband
Not Enough Data to Be Fair? Evaluating Fairness Implications of Data Scarcity Solutions
Details zur Publikation
Autor(inn)en: | Karst, F.; Li, M.; Reinhard, P.; Leimeister, J. |
Herausgeber: | Bui, Tung X. |
Verlag: | Department of IT Management, Shidler College of Business, University of Hawaii |
Verlagsort / Veröffentlichungsort: | Honolulu |
Publikationsjahr: | 2025 |
Zeitschrift: | Proceedings of the 58th Hawaii International Conference on System Sciences |
Seitenbereich: | 6886-6895 |
Buchtitel: | Proceedings of the 58th Hawaii International Conference on System Sciences : Hilton Waikoloa Village, January 7-10, 2025 |
eISBN: | 978-0-9981331-8-8 |
URN / URL: |
Sprachen: | Englisch |
This study explores the implications of the use of data scarcity solutions on fairness in machine learning, specifically in consumer credit interest rate prediction. We develop a comprehensive taxonomy of Data Scarcity Solutions (DSS) by analyzing academic literature, data science competitions, and practical implementations. We identify six distinct DSS clusters: Data Extension, Pre-Training, Public Data Inclusion, Data Sharing, Federated Learning, and Active Learning. Our evaluation shows that most DSS enhance both performance and fairness, with minimal negative correlation between the two. Notably, approaches incorporating external or synthetic data significantly improve fairness. This research contributes to understanding DSS beyond algorithmic performance, providing a framework for evaluating their societal impact. Furthermore, it offers practitioners a taxonomy to select the right method for tackling data scarcity and addresses fairness concerns in real-world scenarios.
Schlagwörter
Consumer Credit, Data Scarcity, Machine Learning, Taxonomy