Have a personal or library account? Click to login
Fault Tolerance of Cloud Infrastructure with Machine Learning Cover

Fault Tolerance of Cloud Infrastructure with Machine Learning

Open Access
|Nov 2023

Abstract

Enhancing the fault tolerance of cloud systems and accurately forecasting cloud performance are pivotal concerns in cloud computing research. This research addresses critical concerns in cloud computing by enhancing fault tolerance and forecasting cloud performance using machine learning models. Leveraging the Google trace dataset with 10000 cloud environment records encompassing diverse metrics, we systematically have employed machine learning algorithms, including linear regression, decision trees, and gradient boosting, to construct predictive models. These models have outperformed baseline methods, with C5.0 and XGBoost showing exceptional accuracy, precision, and reliability in forecasting cloud behavior. Feature importance analysis has identified the ten most influential factors affecting cloud system performance. This work significantly advances cloud optimization and reliability, enabling proactive monitoring, early performance issue detection, and improved fault tolerance. Future research can further refine these predictive models, enhancing cloud resource management and ultimately improving service delivery in cloud computing.

DOI: https://doi.org/10.2478/cait-2023-0034 | Journal eISSN: 1314-4081 | Journal ISSN: 1311-9702
Language: English
Page range: 26 - 50
Submitted on: Jul 13, 2023
|
Accepted on: Nov 7, 2023
|
Published on: Nov 30, 2023
In partnership with: Paradigm Publishing Services
Publication frequency: 4 issues per year

© 2023 Chetankumar Kalaskar, S. Thangam, published by Bulgarian Academy of Sciences, Institute of Information and Communication Technologies
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.