Conference proceedings article

Not Enough Data to Be Fair? Evaluating Fairness Implications of Data Scarcity Solutions



Publication Details
Authors:
Karst, F.; Li, M.; Reinhard, P.; Leimeister, J.
Editor:
Bui, Tung X.
Publisher:
Department of IT Management, Shidler College of Business, University of Hawaii
Place:
Honolulu

Publication year:
2025
Journal:
Proceedings of the 58th Hawaii International Conference on System Sciences
Pages range :
6886-6895
Book title:
Proceedings of the 58th Hawaii International Conference on System Sciences : Hilton Waikoloa Village, January 7-10, 2025
eISBN:
978-0-9981331-8-8
Languages:
English


Abstract

This study explores the implications of the use of data scarcity solutions on fairness in machine learning, specifically in consumer credit interest rate prediction. We develop a comprehensive taxonomy of Data Scarcity Solutions (DSS) by analyzing academic literature, data science competitions, and practical implementations. We identify six distinct DSS clusters: Data Extension, Pre-Training, Public Data Inclusion, Data Sharing, Federated Learning, and Active Learning. Our evaluation shows that most DSS enhance both performance and fairness, with minimal negative correlation between the two. Notably, approaches incorporating external or synthetic data significantly improve fairness. This research contributes to understanding DSS beyond algorithmic performance, providing a framework for evaluating their societal impact. Furthermore, it offers practitioners a taxonomy to select the right method for tackling data scarcity and addresses fairness concerns in real-world scenarios.



Keywords
Consumer Credit, Data Scarcity, Machine Learning, Taxonomy


Authors/Editors

Last updated on 2025-20-06 at 09:02