
This paper compares traditional credit scoring methods, deep learning models, and large language models (LLMs), using synthetic data to protect privacy and ensure consistency. Credit scoring has traditionally used methods like logistic regression and new AI models which may improve prediction accuracy. In this paper it was tested and evaluated these baseline methods (logistic regression), deep learning (Gradient Boosting Machine and Neural Networks), and LLM-based models for feature extraction and prediction looking at performance in areas like accuracy, precision, and recall. The results show that deep learning and LLM-based models perform better with complex data, while traditional models still work well with lower computational demands. This paper provides valuable insights into balancing accuracy, interpretability, and computational efficiency when developing credit scoring models.
© 2025 Andreea-Mădălina Bozagiu, Georgian-Dănuț Mihai, Andrei Costin Neacşu, George Alexandru Neacşu, published by Bucharest University of Economic Studies
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.