Sector-specific financial forecasting with machine learning algorithm and SHAP interaction values

Ergenç, Cansu; Aktaş, Rafet

Sector-specific financial forecasting with machine learning algorithm and SHAP interaction values

Financial Internet Quarterly

Volume 21 (2025): Issue 1 (March 2025)

By:

Cansu Ergenç

and Rafet Aktaş

Open Access

|Mar 2025

References

Abbas, A.T., Helmy, M.O., Al-Abduljabbar, A.A., Soliman, M.S., Hasan, A.S. & Elkaseer, A. (2023). Precision face milling of maraging steel 350: An experimental investigation and optimization using different machine learning techniques. Machines, 11(11), 1-20, https://doi.org/10.3390/machines11111001.
Search in Google Scholar Back to article
Abdulrazzak, A.Y., Mohammed, S.L., Al-Naji, A. & Chahl, J. (2024). Real-time jaundice detection in neonates based on machine learning models. BioMedInformatics, 4(1), 623-637, https://doi.org/10.3390/biomedinformatics4010034.
Search in Google Scholar Back to article
Afreen, M. (2020). Review paper on composite leading index creation for forecasting the Bangladeshi financial sector. International Journal of Finance & Banking Studies, 9(4), 23-32, https://doi.org/10.20525/ijfbs.v9i4.791.
Search in Google Scholar Back to article
Akinrinola, O., Addy, W.A., Ajayi-Nifise, A.O., Odeyemi, O. & Falaiye, T. (2024). Application of machine learning in tax prediction: A review with practical approaches. Global Journal of Engineering and Technology Advances, 18(2), 102-117, https://doi.org/10.30574/gjeta.2024.18.2.0028.
Search in Google Scholar Back to article
Avelar, E.A. & Jordão, R.V.D. (2024). The role of artificial intelligence in the decision-making process: A study on the financial analysis and movement forecasting of the world’s largest stock exchanges. Management Decision, Pre-print, 1-19, https://doi.org/10.1108/MD-09-2023-1625.
Search in Google Scholar Back to article
Azad, M., Chikalov, I., Hussain, S., Moshkov, M. & Zielosko, B. (2022). Greedy algorithms for decision trees with hypotheses. arXiv Preprint, https://arxiv.org/abs/2203.08848.
Search in Google Scholar Back to article
Ballings, M., Van den Poel, D., Hespeels, N. & Gryp, R. (2015). Evaluating multiple classifiers for stock price direction prediction. Expert Systems with Applications, 42(20), 7046-7056. https://doi.org/10.1016/j.eswa.2015.04.013.
Search in Google Scholar Back to article
Baptista, M.L., Goebel, K. & Henriques, E.M. (2022). Relation between prognostics predictor evaluation metrics and local interpretability SHAP values. Artificial Intelligence, 306, 1-22. https://doi.org/10.1016/j.artint.2021.103667.
Search in Google Scholar Back to article
Barauskaite, G. & Streimikiene, D. (2021). Corporate social responsibility and financial performance of companies: The puzzle of concepts, definitions and assessment methods. Corporate Social Responsibility and Environmental Management, 28(1), 278-287, https://doi.org/10.1002/csr.2033.
Search in Google Scholar Back to article
Barboza, F., Kimura, H. & Altman, E. (2017). Machine learning models and bankruptcy prediction. Expert Systems with Applications, 83, 405-417, https://doi.org/10.1016/j.eswa.2015.05.013.
Search in Google Scholar Back to article
Barnhizer, D. & Barnhizer, D. (2019). The Artificial Intelligence Contagion: Can Democracy Withstand the Imminent Transformation of Work, Wealth and the Social Order? SCB Distributors, West Rancho Dominguez.
Search in Google Scholar Back to article
Bhattacharya, A. (2022). Applied Machine Learning Explainability Techniques: Make ML models explainable and trustworthy for practical applications using LIME, SHAP, and more. Packt Publishing, Birmingham.
Search in Google Scholar Back to article
Breiman, L. (1996). Bagging predictors. Machine Learning, 24, 123-140.
Search in Google Scholar Back to article
Celen, B., Ozcelik, M.B., Turgut, F.M., Aras, C., Sivaraman, T., Kotak, Y., Geisbauer, C. & Schweiger, H.G. (2022). Calendar ageing modelling using machine learning: An experimental investigation on lithium-ion battery chemistries. Open Research Europe, 2(96), 1-24, https://doi.org/10.12688/openreseurope.14745.2.
Search in Google Scholar Back to article
Chen, C.P. & Zhang, C.Y. (2014). Data-intensive applications, challenges, techniques and technologies: A survey on big data. Information Sciences, 275, 314-347, https://doi.org/10.1016/j.ins.2014.01.015.
Search in Google Scholar Back to article
Chen, T. & Guestrin, C. (2016). XGBoost: A scalable tree boosting system. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (pp. 785-794), New York.
Search in Google Scholar Back to article
Claveria, O., Monte, E. & Torra, S. (2016). Combination forecasts of tourism demand with machine learning models. Applied Economics Letters, 23(6), 428-431, https://doi.org/10.1080/13504851.2015.1078441.
Search in Google Scholar Back to article
Davis, J., Devos, L., Reyners, S. & Schoutens, W. (2020). Gradient boosting for quantitative finance. Journal of Computational Finance, 24(4), 1-30.
Search in Google Scholar Back to article
Deng, Y.H., Luo, X.Q., Yan, P., Zhang, N.Y., Liu, Y. & Duan, S.B. (2022). Outcome prediction for acute kidney injury among hospitalized children via eXtreme Gradient Boosting Algorithm. Scientific Reports, 12(1), 1-11.
Search in Google Scholar Back to article
Dong, Z., Wang, Q., Ke, Y., Zhang, W., Hong, Q., Liu, C., Liu, X., Yang, J., Xi, Y., Shi, J., Zhang, L., Zheng, Y., Lv, Q., Wang, Y., Wu, J., Sun, X., Cai, G., Qiao, S., Yin, C., Su, S. & Chen, X. (2022). Prediction of 3-year risk of diabetic kidney disease using machine learning based on electronic medical records. Journal of Translational Medicine, 20(1), 1-10, https://doi.org/10.1186/s12967-022-03339-1.
Search in Google Scholar Back to article
Edafetanure-Ibeh, F.T. (2024). Evaluating machine learning algorithms for cervical cancer prediction: A comparative analysis, Preprint, doi: 10.1109/ACCESS.2024.3469869.
Search in Google Scholar Back to article
Ekanayake, I.U., Meddage, D.P.P. & Rathnayake, U. (2022). A novel approach to explain the black-box nature of machine learning in compressive strength predictions of concrete using Shapley additive explanations (SHAP). Case Studies in Construction Materials, 16, 1-20, https://doi.org/10.1016/j.cscm.2022.e01059.
Search in Google Scholar Back to article
El Bouchefry, K. & de Souza, R.S. (2020). Learning in big data: Introduction to machine learning. In: Knowledge discovery in big data from astronomy and earth observation (pp. 225-249). Elsevier, Amsterdam.
Search in Google Scholar Back to article
Fan, J. (2023). Predicting credit default by SVM and decision tree model based on credit card data. BCP Business & Management, 38, 28-33.
Search in Google Scholar Back to article
Fraz, N. (2024). A study on comparison of various machine learning models for the best prediction of 305 days first lactation milk yield, Research Square, 1-16, https://doi.org/10.21203/rs.3.rs-4484720/v1.
Search in Google Scholar Back to article
Friedman, J., Hastie, T. & Tibshirani, R. (2010). Regularization paths for generalized linear models via coordinate descent. Journal of Statistical Software, 33(1), 1-22, https://pmc.ncbi.nlm.nih.gov/articles/PMC2929880/.
Search in Google Scholar Back to article
Geng, R., Bose, I. & Chen, X. (2015). Prediction of financial distress: An empirical study of listed Chinese companies using data mining. European Journal of Operational Research, 241(1), 236-247.
Search in Google Scholar Back to article
Genuer, R. (2012). Variance reduction in purely random forests. Journal of Nonparametric Statistics, 24(3), 543-562.
Search in Google Scholar Back to article
George, A.S. (2024). Finance 4.0: The transformation of financial services in the digital age, Partners Universal Innovative Research Publication, 2(3), 104-125.
Search in Google Scholar Back to article
Gianola, D., Weigel, K.A., Krämer, N., Stella, A. & Schön, C.C. (2014). Enhancing genome-enabled prediction by bagging genomic BLUP. PLoS One, 9(4), 1-18, https://doi.org/10.1371/journal.pone.0091693.
Search in Google Scholar Back to article
Grissa, D., Nytoft Rasmussen, D., Krag, A., Brunak, S. & Juhl Jensen, L. (2020). Alcoholic liver disease: A registry view on comorbidities and disease prediction. PLoS Computational Biology, 16(9), 1-19.
Search in Google Scholar Back to article
Gupta, A., Sharma, A. & Goel, A. (2017). Review of regression analysis models. International Journal of Engineering Research & Technology, 6(8), 58-61.
Search in Google Scholar Back to article
Gzar, D.A., Mahmood, A.M. & Abbas, M.K. (2022). A comparative study of regression machine learning algorithms: Tradeoff between accuracy and computational complexity. Mathematical Modelling of Engineering Problems, 9(5), 1-8, https://doi.org/10.18280/mmep.090508.
Search in Google Scholar Back to article
Hashemi, S.K., Mirtaheri, S.L. & Greco, S. (2022). Fraud detection in banking data by machine learning techniques. IEEE Access, 11, 3034-3043, https://doi.org/10.1109/ACCESS.2022.3232287.
Search in Google Scholar Back to article
Ionescu, S.A. & Diaconita, V. (2023). Transforming financial decision-making: The interplay of AI, cloud computing, and advanced data management technologies. International Journal of Computers Communications & Control, 18(6), 1-9, https://doi.org/10.15837/ijccc.2023.6.5735.
Search in Google Scholar Back to article
Jalal Uddin, M., Li, Y., Abdus Sattar, M. & Mistry, S. (2022). Climatic water balance forecasting with machine learning and deep learning models over Bangladesh. International Journal of Climatology, 42(16), 10083-10106.
Search in Google Scholar Back to article
Jiang, X., Zhou, R., Jiang, F., Yan, Y., Zhang, Z. & Wang, J. (2024). Construction of diagnostic models for the progression of hepatocellular carcinoma using machine learning. Frontiers in Oncology, 14, 1-11.
Search in Google Scholar Back to article
Johnson, R. & Zhang, T. (2013). Learning nonlinear functions using regularized greedy forest. IEEE Transactions on Pattern Analysis and Machine Intelligence, 36(5), 942-954, https://doi.org/10.1109/TPAMI.2013.159.
Search in Google Scholar Back to article
Kadiyala, A. & Kumar, A. (2018). Applications of Python to evaluate the performance of decision tree-based boosting algorithms. Environmental Progress & Sustainable Energy, 37(2), 618-623.
Search in Google Scholar Back to article
Kareem, M.K., Aborisade, O.D., Onashoga, S.A., Sutikno, T. & Olayiwola, O.M. (2023). Efficient model for detecting application layer distributed denial of service attacks. Bulletin of Electrical Engineering and Informatics, 12(1), 441-450, https://doi.org/10.11591/eei.v12i1.3871.
Search in Google Scholar Back to article
Katal, A., Wazid, M. & Goudar, R.H. (2013). Big data: Issues, challenges, tools, and good practices. In 2013 Sixth International Conference on Contemporary Computing (IC3) (pp. 404-409). IEEE, New Jersey.
Search in Google Scholar Back to article
Khalaf, G. (2012). A proposed ridge parameter to improve the least square estimator. Journal of Modern Applied Statistical Methods, 11(2), 443-449, https://doi.org/10.22237/jmasm/1351743240.
Search in Google Scholar Back to article
Kibria, B.G. & Saleh, A.M.E. (2004). Performance of positive rule estimator in the ill-conditioned Gaussian regression model. Calcutta Statistical Association Bulletin, 55(4), 209-240.
Search in Google Scholar Back to article
Ko, P.C., Lin, P.C., Do, H.T. & Huang, Y.F. (2022). P2P lending default prediction based on AI and statistical models. Entropy, 24(6), 1-23, https://doi.org/10.3390/e24060801.
Search in Google Scholar Back to article
Kourtellis, N., Morales, G.D.F., Bifet, A. & Murdopo, A. (2016, December). VHT: Vertical Hoeffding Tree. In 2016 IEEE International Conference on Big Data (Big Data) (pp. 915-922). IEEE, New Jersey.
Search in Google Scholar Back to article
Kulkarni, V.Y. & Sinha, P. K. (2012, July). Pruning of random forest classifiers: A survey and future directions. In 2012 International Conference on Data Science & Engineering (ICDSE) (pp. 64-68). IEEE, New Jersey.
Search in Google Scholar Back to article
Kumar, B., Sharma, M., Bhat, A. & Kumar, P. (2021). An analysis of Indian agricultural workers: A ridge regression approach. Agricultural Economics Research Review, 34(1), 121-127.
Search in Google Scholar Back to article
Kumar, R. (2017). Machine learning and cognition in enterprises: Business intelligence transformed. Apress, New York, https://doi.org/10.5958/0974-0279.2021.00010.0.
Search in Google Scholar Back to article
Li, R., Shinde, A., Liu, A., Glaser, S., Lyou, Y., Yuh, B., Wong, J. & Amini, A. (2020). Machine learning–based interpretation and visualization of nonlinear interactions in prostate cancer survival. JCO Clinical Cancer Informatics, 4, 637-646, https://doi.org/10.1200/CCI.20.00002.
Search in Google Scholar Back to article
Li, S., Qin, J., He, M. & Paoli, R. (2020). Fast evaluation of aircraft icing severity using machine learning based on XGBoost. Aerospace, 7(4), 1-18, https://doi.org/10.3390/aerospace7040036.
Search in Google Scholar Back to article
Li, Z. (2022). Extracting spatial effects from machine learning models using local interpretation methods: An example of SHAP and XGBoost, Computers, Environment and Urban Systems, 96, 1-18.
Search in Google Scholar Back to article
Li, Z. (2024). Evaluation of sailing boat performance based on ridge regression and mathematical model optimization. Highlights in Science, Engineering and Technology, 85, 1275-1283.
Search in Google Scholar Back to article
Lin, T.C. (2019). Artificial intelligence, finance, and the law. Fordham Law Review, 88, 531-560.
Search in Google Scholar Back to article
Lo, W.T., Chang, Y.S., Sheu, R.K., Chiu, C.C. & Yuan, S.M. (2014). CUDT: A CUDA-based decision tree algorithm. The Scientific World Journal, 2014(1), 1-12, https://doi.org/10.1155/2014/745640.
Search in Google Scholar Back to article
Long, X., Kampouridis, M. & Jarchi, D. (2022). An in-depth investigation of genetic programming and nine other machine learning algorithms in a financial forecasting problem. In: 2022 IEEE Congress on Evolutionary Computation (CEC) (pp. 01-08). IEEE, New Jersey.
Search in Google Scholar Back to article
Lundberg, S.M., Erion, G.G. & Lee, S.I. (2018). Consistent individualized feature attribution for tree ensembles. arXiv Preprint 3, https://doi.org/10.48550/arXiv.1802.03888.
Search in Google Scholar Back to article
Lundberg, S.M., Erion, G., Chen, H., DeGrave, A., Prutkin, J.M., Nair, B., Katz, R., Himmelfarb, J., Bansal, N. & Lee, S.I. (2020). From local explanations to global understanding with explainable AI for trees. Nature Machine Intelligence, 2(1), 56-67, https://doi.org/10.1038/s42256-019-0138-9.
Search in Google Scholar Back to article
Magnan, M., Menini, A. & Parbonetti, A. (2015). Fair value accounting: Information or confusion for financial markets? Review of Accounting Studies, 20, 559-591, https://doi.org/10.1007/s11142-014-9306-7.
Search in Google Scholar Back to article
Mahalakshmi, V., Kulkarni, N., Kumar, K.P., Kumar, K.S., Sree, D.N. & Durga, S. (2022). The role of implementing artificial intelligence and machine learning technologies in the financial services industry for creating competitive intelligence. Materials Today: Proceedings, 56, 2252-2255.
Search in Google Scholar Back to article
Malthouse, E.C. (1999). Ridge regression and direct marketing scoring models. Journal of Interactive Marketing, 13(4), 10-23, https://doi.org/10.1002/(SICI)1520-6653(199923)13:4%3C10::AID-DIR2%3E3.0.CO;2-3.
Search in Google Scholar Back to article
Martini, M.L., Neifert, S.N., Shuman, W.H., Chapman, E.K., Schüpper, A.J., Oermann, E.K., Mocco, J., Todd, M., Torner, C.J., Molyeux, A., Mayer, S., Le Roux, P., Vergouwen, M.D.I., Rinkel, G.J.E., Wong, G.K.C., Kirkpatrick, P., Quinn, A., Hänggi, D., Etminan, N., van der Bergh, W.M., Jaja, B.N.R., Cusimano, M., Shweizer, A.T., Suarez, J.I., Fukuda, H., Yamagata, S., Lo, B., Airton, L.O.M., Boogarts, H.D. & Macdonald, R.L. (2021). Rescue therapy for vasospasm following aneurysmal subarachnoid hemorrhage: A propensity score-matched analysis with machine learning. Journal of Neurosurgery, 136(1), 134-147.
Search in Google Scholar Back to article
Massei, G. (2023). Algorithmic trading: An overview and evaluation of its impact on financial markets. Finance Research Letters, 47, 1-101.
Search in Google Scholar Back to article
Mayr, A. & Schmid, M. (2014). Boosting the concordance index for survival data–A unified framework to derive and evaluate biomarker combinations. PLoS One, 9(1), 1-10, https://doi.org/10.1371/journal.pone.0084483.
Search in Google Scholar Back to article
Meir, R. & Rätsch, G. (2003). An introduction to boosting and leveraging. In: Advanced Lectures on Machine Learning: Machine Learning Summer School (pp. 118-183). Springer, Berlin.
Search in Google Scholar Back to article
Mienye, I.D. & Sun, Y. (2022). A survey of ensemble learning: Concepts, algorithms, applications, and prospects. IEEE Access, 10, 99129-99149, https://doi.org/10.1371/10.1109/ACCESS.2022.3207287.
Search in Google Scholar Back to article
Mishina, Y., Murata, R., Yamauchi, Y., Yamashita, T. & Fujiyoshi, H. (2015). Boosted random forest. IEICE Transactions on Information and Systems, 98(9), 1630-1636, https://doi.org/10.1587/transinf.2014OPP0004.
Search in Google Scholar Back to article
Mitchell, R., Frank, E. & Holmes, G. (2022). GPUTreeShap: Massively parallel exact calculation of SHAP scores for tree ensembles. PeerJ Computer Science, 8, 1-25, https://doi.org/10.7717/peerj-cs.880.
Search in Google Scholar Back to article
Mohammad, O.K.J., Seno, M.E. & Dhannoon, B.N. (2024). Detailed Cloud Linear Regression Services in Cloud Computing Environment. Informatica, 48(12), 1-10.
Search in Google Scholar Back to article
Moshkov, M. (1997). Algorithms for constructing of decision trees. In: Principles of Data Mining and Knowledge Discovery: First European Symposium (pp. 335-342). Springer, Berlin.
Search in Google Scholar Back to article
Najafabadi, M.M., Villanustre, F., Khoshgoftaar, T.M., Seliya, N., Wald, R. & Muharemagic, E. (2015). Deep learning applications and challenges in big data analytics. Journal of Big Data, 2(1), 1-21.
Search in Google Scholar Back to article
Natekin, A. & Knoll, A. (2013). Gradient boosting machines, a tutorial. Frontiers in Neurorobotics, 7, 21.
Search in Google Scholar Back to article
Nguyen, D.K., Sermpinis, G. & Stasinakis, C. (2023). Big data, artificial intelligence and machine learning: A transformative symbiosis in favor of financial technology. European Financial Management, 29(2), 517-548.
Search in Google Scholar Back to article
Oliva, R. & Watson, N. (2009). Managing functional biases in organizational forecasts: A case study of consensus forecasting in supply chain planning. Production and Operations Management, 18(2), 138-151.
Search in Google Scholar Back to article
Onoja, M., Jegede, A., Blamah, N., Abimbola, O.V. & Omotehinwa, T.O. (2022). EEMDS: Efficient and effective mal-ware detection system with hybrid model based on XceptionCNN and LightGBM algorithm. Journal of Computing and Social Informatics, 1(2), 42-57, https://doi.org/10.33736/jcsi.4739.2022.
Search in Google Scholar Back to article
Orsini, N., Moore, A. & Wolk, A. (2022). Interaction analysis based on Shapley values and extreme gradient boosting: A realistic simulation and application to a large epidemiological prospective study. Frontiers in Nutrition, 9, 1-8.
Search in Google Scholar Back to article
Pan, B. (2018, February). Application of XGBoost algorithm in hourly PM2.5 concentration prediction. In: IOP Conference Series: Earth and Environmental Science (Vol. 113, p. 012127). IOP Publishing, Bristol.
Search in Google Scholar Back to article
Pandey, M.K. & Sergeeva, I. (2022). Artificial intelligence impact evaluation: Transforming paradigms in financial institutions. Mir Èkonomiki I Upravleniâ, 22(1), 147-164, https://doi.org/10.25205/2542-0429-2022-22-1-147-164.
Search in Google Scholar Back to article
Park, S., Son, S., Bae, J., Lee, D., Kim, J.J. & Kim, J. (2021). Robust spatiotemporal estimation of PM concentrations using boosting-based ensemble models. Sustainability, 13(24), 1-15.
Search in Google Scholar Back to article
Paul, B., Athithan, G. & Murty, M.N. (2009). Speeding up AdaBoost classifier with random projection. In: 2009 Seventh International Conference on Advances in Pattern Recognition (pp. 251-254), IEEE, New Jersey.
Search in Google Scholar Back to article
Penman, S. H. (2002). The quality of financial statements: Perspectives from the recent stock market bubble. Papers SSRN 319262, 1-44, http://dx.doi.org/10.2139/ssrn.319262.
Search in Google Scholar Back to article
Permana, S., Rosadi, R. & Nikki, N. (2022). Application of classification algorithm for sales prediction. TEKNOKOM, 5(2), 119-124, https://doi.org/10.31943/teknokom.v5i2.77.
Search in Google Scholar Back to article
Perrini, F., Russo, A., Tencati, A. & Vurro, C. (2011). Deconstructing the relationship between corporate social and financial performance. Journal of Business Ethics, 102, 59-76, https://doi.org/10.1007/s10551-011-1194-1.
Search in Google Scholar Back to article
Poojithaa, M. & Malathib, K. (2022). Decision tree over support vector machine for better accuracy in identifying the problem based on the Iris flower. Advances in Parallel Computing Algorithms, Tools and Paradigms, 41, 209-217.
Search in Google Scholar Back to article
Prasad, A. & Bakhshi, P. (2022). Forecasting the direction of daily changes in the India VIX index using machine learning. Journal of Risk and Financial Management, 15(12), 1-16, https://doi.org/10.3390/jrfm15120552.
Search in Google Scholar Back to article
Provost, F. & Fawcett, T. (2013). Data science and its relationship to big data and data-driven decision making. Big Data, 1(1), 51-59, https://doi.org/10.1089/big.2013.1508.
Search in Google Scholar Back to article
Rajaratnam, B., Roberts, S., Sparks, D. & Dalal, O. (2015). Lasso regression: Estimation and shrinkage via the limit of Gibbs sampling. Journal of the Royal Statistical Society: Series B Statistical Methodology, 78(1), 153-174.
Search in Google Scholar Back to article
Ramnath, S., Rock, S. & Shane, P. (2008). The financial analyst forecasting literature: A taxonomy with suggestions for further research. International Journal of Forecasting, 24(1), 34-75.
Search in Google Scholar Back to article
Rohatgi, S., Singh, K.K. & Jasuja, D. (2021). Comparative analysis of machine learning algorithm to forecast Indian stock market. In 2021 International Conference on Advance Computing and Innovative Technologies in Engineering (ICACITE) (pp. 278-283). IEEE, New Jersey, https://doi.org/10.1109/ICACITE51222.2021.9404642.
Search in Google Scholar Back to article
Rosenbusch, H., Soldner, F., Evans, A.M. & Zeelenberg, M. (2021). Supervised machine learning methods in psychology: A practical introduction with annotated R code. Social and Personality Psychology Compass, 15(2), 1-25.
Search in Google Scholar Back to article
Rufo, D., Debelee, T., Ibenthal, A. & Negera, W. (2021). Diagnosis of diabetes mellitus using gradient boosting machine (LightGBM). Diagnostics, 11(9), 1-14, https://doi.org/10.3390/diagnostics11091714
Search in Google Scholar Back to article
Ryll, L. & Seidens, S. (2019). Evaluating the performance of machine learning algorithms in financial market forecasting: A comprehensive survey, arXiv preprint, 1906, https://doi.org/10.48550/arXiv.1906.07786.
Search in Google Scholar Back to article
Sairam, S., Srinivasan, S., Marafioti, G., Subathra, B., Mathisen, G. & Bekiroglu, K. (2020). Explainable Incipient Fault Detection Systems for Photovoltaic Panels. arXiv preprint, 2011, https://doi.org/10.48550/arXiv.2011.09843.
Search in Google Scholar Back to article
Samonas, M. (2015). Financial forecasting, analysis, and modelling: A framework for long-term forecasting. John Wiley & Sons, Hoboken.
Search in Google Scholar Back to article
Sandhya, V. & Padyana, A. (2021). Machine learning based crop yield prediction on geographical and climatic data. In: 2021 Sixth International Conference on Image Information Processing (ICIIP) (pp. 186-191). IEEE, New Jersey.
Search in Google Scholar Back to article
Sastry, V.V.L.N. (2020). Artificial intelligence in financial services and banking industry. Idea Publishing, London.
Search in Google Scholar Back to article
Shi, F., Lu, S., Gu, J., Lin, J., Zhao, C., You, X. & Lin, X. (2022). Modeling and evaluation of the permeate flux in forward osmosis process with machine learning. Industrial & Engineering Chemistry Research, 61(49), 18045-18056.
Search in Google Scholar Back to article
Si, Z., Niu, H. & Wang, W. (2022). Credit Risk Assessment by a Comparison Application of Two Boosting Algorithms. In: Fuzzy Systems and Data Mining VIII (pp. 34-40). IOS Press, Amsterdam.
Search in Google Scholar Back to article
Signorino, C. & Kirchner, A. (2018). Using lasso to model interactions and nonlinearities in survey data. Survey Practice, 11(1), 1-10, https://doi.org/10.29115/SP-2018-0005.
Search in Google Scholar Back to article
Siringoringo, R., Perangin, R. & Jamaluddin, J. (2021). Model hibrid genetic-XGBoost dan principal component analysis pada segmentasi dan peramalan pasar. Methomika Jurnal Manajemen Informatika Dan Komputerisasi Akuntansi, 5(2), 97-103, https://doi.org/10.46880/jmika.Vol5No2.pp97-103.
Search in Google Scholar Back to article
Soloff, J. A., Barber, R.F. & Willett, R. (2024). Bagging provides assumption-free stability. Journal of Machine Learning Research, 25(131), 1-35.
Search in Google Scholar Back to article
Sonkavde, G. (2023). Forecasting stock market prices using machine learning and deep learning models: A systematic review, performance analysis and discussion of implications. International Journal of Financial Studies, 11(3), 1-22, https://doi.org/10.3390/ijfs11030094.
Search in Google Scholar Back to article
Strobl, C., Boulesteix, A., Kneib, T., Augustin, T. & Zeileis, A. (2008). Conditional variable importance for random forests. BMC Bioinformatics, 9(1), 1-11.
Search in Google Scholar Back to article
Su, H., Lu, X., Chen, Z., Zhang, H., Lu, W. & Wu, W. (2021). Estimating coastal chlorophyll-a concentration from time-series OLCI data based on machine learning. Remote Sensing, 13(4), 1-21.
Search in Google Scholar Back to article
Suacana, I. (2024). Optimizing the 2024 governor election quick count with extreme gradient boosting (XGBoost) to increase voting prediction accuracy. International Journal of Software Engineering and Computer Science, 4(1), 91-106, https://doi.org/10.35870/ijsecs.v4i1.2286.
Search in Google Scholar Back to article
Tang, M., Zhao, Q., Ding, S., Wu, H., Li, L., Wen, L. & Huang, B. (2020). An improved LightGBM algorithm for online fault detection of wind turbine gearboxes. Energies, 13(4), 807-823.
Search in Google Scholar Back to article
Uddin, M.N., Li, L.Z., Deng, B.Y. & Ye, J. (2023). Interpretable XGBoost–SHAP machine learning technique to predict the compressive strength of environment-friendly rice husk ash concrete. Innovative Infrastructure Solutions, 8(5), 147-168, https://doi.org/10.1007/s41062-023-01122-9.
Search in Google Scholar Back to article
Ünal, A. F., Kaleli, A. Y., Ummak, E. & Albayrak, Ö. (2021, August). A Comparison of State-of-the-Art Machine Learning Algorithms on Fault Indication and Remaining Useful Life Determination by Telemetry Data. In: 8th International Conference on Future Internet of Things and Cloud (pp. 79-85). IEEE, New Jersey.
Search in Google Scholar Back to article
Wang, L., Kern, R., Yu, E., Choi, S. & Pan, J. (2023). IntelliSleepScorer, a software package with a graphic user interface for automated sleep stage scoring in mice based on a light gradient boosting machine algorithm. Scientific Reports, 13(1), 1-11, https://doi.org/10.1038/s41598-023-31288-2.
Search in Google Scholar Back to article
Wang, M. (2024). Identification of mine water source based on TPE-LightGBM. Scientific Reports, 14(1), 1-11.
Search in Google Scholar Back to article
Wang, P., Xie, M., Wang, X., Yu, J., Chen, E., Zhou, Z., Niu, Y., Song, W., Ni, Q. & Zhu, J. (2022). Comparison of nomo-gram with machine learning techniques for prediction of overall survival in patients with retroperitoneal liposarcoma, Research Square, 1, 1-20.
Search in Google Scholar Back to article
Wang, S., Pengfei, D. & Tian, Y. (2017). A novel method of statistical line loss estimation for distribution feeders based on feeder cluster and modified XGBoost. Energies, 10(12), 1-17.
Search in Google Scholar Back to article
Wei, C. (2024). Comparison of different machine learning classification models for predicting deep vein thrombosis in lower extremity fractures. Scientific Reports, 14(1), 1-8.
Search in Google Scholar Back to article
Wei, L., Zhang, Y., Wang, Z., Zhao, L., Zhang, Y., Lu, X. & Cao, L. (2020). Hyperspectral inversion of soil organic matter content based on a combined spectral index model. Sensors, 20(10), 1-17.
Search in Google Scholar Back to article
Wu, Z., Lei, T., Shen, C., Wang, Z., Cao, D. & Hou, T. (2019). ADMET evaluation in drug discovery. 19. Reliable prediction of human cytochrome P450 inhibition using artificial intelligence approaches. Journal of Chemical Information and Modeling, 59(11), 4587-4601, https://doi.org/10.1021/acs.jcim.9b00801.
Search in Google Scholar Back to article
Xiang, Y. (2024). Enhancing non-invasive colorectal cancer screening with stool DNA methylation markers and LightGBM machine learning, Research Square, 1, 1-19, https://doi.org/10.21203/rs.3.rs-3857174/v1.
Search in Google Scholar Back to article
Xiao, D., Chen, J., Zhang, K. & Qian, H. (2020). Privacy-preserving locally weighted linear regression over encrypted millions of data. IEEE Access, 8, 2247-2257, https://doi.org/10.1109/ACCESS.2019.2962700.
Search in Google Scholar Back to article
Xin, S. & Khalid, K. (2018). Modelling house price using ridge regression and lasso regression. International Journal of Engineering & Technology, 7(4), 498-501.
Search in Google Scholar Back to article
Yao, S., Wu, Q., Kang, Q., Chen, Y. & Yi, L. (2023). An interpretable XGBoost-based approach for Arctic navigation risk assessment. Risk Analysis, 44(2), 459-476, https://doi.org/10.1111/risa.14175.
Search in Google Scholar Back to article
Yin, S., Ouyang, P., Xu, D., Liu, L. & Wei, S. (2017). An AdaBoost-based face detection system using parallel configurable architecture with optimized computation. IEEE Systems Journal, 11(1), 260-271.
Search in Google Scholar Back to article
Yoo, H., Lee, K., Woo, J., Park, S., Lee, S., Joo, J., Bae, J.-S., Hwong, H.-J. & Park, B. (2022). A genome-wide association study and machine-learning algorithm analysis on the prediction of facial phenotypes by genotypes in Korean women. Clinical, Cosmetic and Investigational Dermatology, 15, 433-445.
Search in Google Scholar Back to article
Zern, A., Broelemann, K. & Kasneci, G. (2023). Interventional SHAP values and interaction values for piecewise linear regression trees. Proceedings of the AAAI Conference on Artificial Intelligence, 37(9), 11164-11173.
Search in Google Scholar Back to article
Zhan, C., Zheng, Y., Zhang, H. & Wen, Q. (2021). Random-Forest-Bagging Broad Learning System with applications for COVID-19 pandemic. IEEE Internet of Things Journal, 8, 15906-15918.
Search in Google Scholar Back to article
Zhang, B., Sethy, A., Sainath, T.N. & Ramabhadran, B. (2011). Application specific loss minimization using gradient boosting. In: IEEE International Conference on Acoustics, Speech and Signal Processing (pp. 4880-4883). IEEE, New Jersey, https://doi.org/10.1109/ICASSP.2011.5947449.
Search in Google Scholar Back to article
Zhang, J. (2024). Optimization and application of XGBoost logging prediction model for porosity and permeability based on k-means method. Applied Sciences, 14(10), 1-18, https://doi.org/10.3390/app14103956.
Search in Google Scholar Back to article
Zhang, J. (2024). Prediction of compressive strength of geopolymer concrete landscape design: Application of the novel hybrid RF-GWO-XGBoost algorithm. Buildings, 14(3), 1-32, https://doi.org/10.3390/buildings14030591.
Search in Google Scholar Back to article
Zhang, J., Mucs, D., Norinder, U. & Svensson, F. (2019). LightGBM: An effective and scalable algorithm for prediction of chemical toxicity-application to the Tox21 and mutagenicity data sets. Journal of Chemical Information and Modeling, 59(10), 4150-4158, https://doi.org/10.1021/acs.jcim.9b00633.
Search in Google Scholar Back to article
Zhang, P., Jia, Y. & Shang, Y. (2022). Research and application of XGBoost in imbalanced data. International Journal of Distributed Sensor Networks, 18(6), 1-10, https://doi.org/10.1177/15501329221106935.
Search in Google Scholar Back to article
Zhao, G., Wang, Y. & Wang, J. (2023). Intrusion detection model of Internet of Things based on LightGBM. IEICE Transactions on Communications, 106(8), 622-634, https://doi.org/10.1587/transcom.2022EBP3169.
Search in Google Scholar Back to article
Zhou, L., Pan, S., Wang, J. & Vasilakos, A.V. (2017). Machine learning on big data: Opportunities and challenges. Neurocomputing, 237, 350-361, https://doi.org/10.1016/j.neucom.2017.01.026.
Search in Google Scholar Back to article

DOI: https://doi.org/10.2478/fiqf-2025-0004 | Journal eISSN: 2719-3454

Journal RSS Feed

Language: English

Page range: 42 - 66

Submitted on: Jul 31, 2024

Accepted on: Nov 15, 2024

Published on: Mar 26, 2025

Published by: University of Information Technology and Management in Rzeszow

In partnership with: Paradigm Publishing Services

Publication frequency: 4 times per year

Keywords:

Machine Learning Models,

SHAP,

Financial Forecasting

Related subjects:

Business and economics,

Political economics,

Economic theory, systems and structures,

Microeconomics,

Macroeconomics,

Public finance and fiscal theory

© 2025 Cansu Ergenç, Rafet Aktaş, published by University of Information Technology and Management in Rzeszow
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 3.0 License.

Previous article Volume 21 (2025): Issue 1 (March 2025)Next article