Have a personal or library account? Click to login
Advanced hybrid machine learning models for estimating chloride penetration resistance of concrete structures for durability assessment: optimization and hyperparameter tuning Cover

Advanced hybrid machine learning models for estimating chloride penetration resistance of concrete structures for durability assessment: optimization and hyperparameter tuning

Open Access
|Dec 2025

Full Article

1
Introduction

Steel-reinforced concrete structures in coastal environments often suffer deterioration due to exposure to aggressive chemical agents, including water [1], 2], carbon dioxide [2], [3], [4], [5], [6], [7], sulphates [7], [8], [9], and especially chloride ions [10], [11], [12]. Chlorides are the primary cause of corrosion in steel reinforcement, resulting in reduced durability, cracking, spalling, and loss of strength in concrete structures [13], 14]. When chloride ions penetrate the concrete’s pore system, they disrupt the protective oxide layer on steel reinforcement, initiating corrosion that can cause severe structural damage and significant economic costs [15]. Corrosion begins once the chloride concentration at the steel surface exceeds a critical threshold, with the time to onset influenced by the concrete’s diffusion properties and the environmental exposure to chloride. High chloride levels, common in marine areas, accelerate deterioration and reduce concrete’s load-bearing capacity. To protect structures, it is essential to use low-chloride materials, corrosion inhibitors, protective coatings, and properly designed concrete mixes with sufficient reinforcement cover. Regular monitoring and maintenance are also necessary to ensure long-term durability [15], 16].

Marine aerosols, generated by sea surface bubbles and transported inland by wind, are the primary source of chloride ions that cause corrosion in coastal concrete structures [17]. Their concentration increases with wind speed [18], while deicing salts add to chloride exposure even far from roads and at high elevations globally [19]. Assessing chloride accumulation in offshore concrete is vital for durability evaluation, with field data informing accurate chloride ingress models [20]. Chloride penetration involves multiple transport mechanisms, mainly diffusion, modeled using Fick’s second law, where the diffusion coefficient measures ion spread over time [21]. Rapid chloride migration tests offer quick assessment but may lack field accuracy, whereas wetting-drying and immersion tests provide realistic results but are time-consuming [22]. Designing durable concrete requires optimizing mixes for chloride resistance and minimal cement use, but verifying this through standard lab tests is slow and resource intensive. Therefore, fast, reliable, and comprehensive predictive methods are needed to estimate chloride penetration resistance efficiently, addressing the complexity of chloride transport in concrete.

Several research studies have explored the implementation of machine learning (ML) in estimating the properties of concrete, thereby eliminating the need for laborious and resource-intensive laboratory testing. Multi-objective optimization techniques combined with ML algorithms have been employed to enhance ingredient proportions for concrete mix design, which can be implemented at batching facilities to efficiently produce high-strength concrete before construction [23]. A hybrid backpropagation neural network (BPNN) model utilizing ReLU activation functions was created to forecast the shear strength of reinforced concrete beams, achieving an R 2 value of 0.924 and a mean squared error of 10.611 kN [24]. An optimized BPNN was also developed to estimate surface chloride infiltration in concrete exposed to marine conditions, outperforming conventional and advanced models with an R 2 value of 0.91 and offering faster, more efficient predictions [25]. To estimate surface chloride content in marine concrete, ML models such as support vector regression (SVR) and extreme gradient boosting (XGBoost) were trained on preprocessed data, with the ensemble model achieving superior accuracy (R 2 = 0.95) compared to the individual models [26]. AQ21 and WEKA, using the J48 algorithm, were employed to automate the classification of plain and CFBC fly ash-enhanced concrete, identifying materials with high chloride resistance [27]. An artificial neural network (ANN) model was also used to assess chloride diffusivity in high-performance concrete, showing high prediction accuracy when validated against experimental data [28]. Additionally, a metaheuristic algorithm was applied to enhance a BPNN model’s performance in predicting chloride levels, surpassing traditional BPNN methods [29]. Utilizing observed data, another ANN model was employed to correlate input parameters with the depth of chloride ion penetration under drying–wetting cycles [30]. ANN and XGBoost have proven effective in accurately predicting the natural frequencies of functionally graded nanobeams [31]. Physics-informed recurrent neural networks have enhanced elastoplastic constitutive modeling by improving stability and extrapolation capabilities [32]. Hybrid metaheuristic-optimized neural networks have achieved superior accuracy in predicting concrete-reinforcement bond strength, outperforming traditional analytical models [33]. Additionally, hybrid approaches combining ML and finite element analysis have accelerated material parameter calibration, resulting in improved accuracy and reduced computational time [34]. Moreover, XGB excelled in predicting remediation outcomes, with Bayesian optimization boosting model performance and SHAP analysis offering interpretability [35].

This study tackles a critical issue by minimizing the dependence on labor-intensive and resource-demanding laboratory tests for assessing chloride diffusion in concrete through advanced ML techniques. By leveraging hybrid ML models, this research aims to develop a reliable predictive framework for estimating the non-steady-state migration diffusion coefficient (D nssm) of concrete, a crucial parameter for ensuring the long-term performance of reinforced concrete structures subjected to chloride exposure. Support vector regression (SVR) was integrated with four metaheuristic optimization techniques: grey wolf optimization (GWO), gorilla troops optimization (GTO), firefly algorithm (FFA), and particle swarm optimization (PSO) to improve predictive accuracy. These algorithms were selected based on a combination of their diverse exploration-exploitation mechanisms, proven effectiveness in solving complex nonlinear problems, and their increasing adoption in civil engineering applications. GWO and PSO are well-established and widely used due to their robustness and global search capabilities, whereas FFA offers excellent local search refinement through its light-attraction mechanism. GTO, a relatively recent algorithm, was included for its innovative exploration strategy, inspired by the social behavior of gorillas, and represents a novel addition to the field. The combination of established and emerging algorithms aimed to balance innovation with performance reliability in optimizing SVR parameters. To enhance model interpretability, tools such as SHapley Additive exPlanations (SHAP), partial dependence plots (PDP), and individual conditional expectation plots (ICE) were used, offering important understanding of how individual input variables influence predictions. Furthermore, an interactive graphical interface was created to facilitate the prediction of D nssm, providing rapid results and eliminating the reliance on conventional experimental methods.

2
Methodology
2.1
Database overview

The database comprises 843 data points that capture D nssm values across various types of concrete. This database is collected from two channels: i) academic research projects [36], and ii) publications in international journals [21], 27], [37], [38], [39], [40], [41], [42]. The D nssm values serve as indicators for evaluating concrete’s ability to withstand chloride ion infiltration, as illustrated in Figure 1.

Figure 1:

Chloride penetration resistance.

The dataset comprises diverse concrete classifications, including conventional, lightweight, enhanced-strength, advanced-performance, and self-consolidating concrete. These concrete mixes are characterized by eight attributes that detail the components and their proportions: water-to-binder ratio, quantities of binders (slag, cement, silica fume, lime filler, and fly ash) measured in kg/m3, amounts of fine and coarse aggregates in kg/m3, and levels of chemical additives (air-entraining agents, superplasticizers, and plasticizers) as a percentage of the binder’s weight.

The database encompasses various cement varieties documented in diverse standards as experimental data has been gathered from numerous regions of the world. To ensure uniformity, all cement types are standardized based on the European standard EN 197-1. This standard classifies cement into 27 unique types, organized into five main categories, as depicted in Figure 2. Within the database, there are 15 cement types representing the four primary cement groups as illustrated in Figure 3. Around 57 % of the experiments conducted incorporate supplementary cementitious materials. The water-to-binder ratio spans from 0.19 to 0.65. Within the dataset, admixtures consist of numerous compounds, including superplasticizers containing polycarboxylate ether, naphthalene, melamine sulfonate, and lignosulfonate, as well as air-entraining agents composed of vinsol resin, synthetic surfactants, and fatty acid soap. The statistical distribution of the features is given in Table 1. Further, box plots of the features and D nssm are depicted in Figure 4.

Figure 2:

Cement groups (EN 197-1).

Figure 3:

Cement types utilized in the database.

Table 1:

Statistical intuitions into the features.

StatisticsW/BWaterCementSlagFly ashSFFACATASPMT age D nssm
kg/m3 kg/m3 kg/m3 kg/m3 kg/m3 kg/m3 kg/m3 kg/m3 % (by Wb)Day(×10−12 m2/s)
Mean0.42162.3330.0729.333.95.28759.06818.51,6360.4883.069.1
Median0.44164.5334.6100076894417460.47287.04
Mode0.45162360000786463.01,9600288.8
SD0.0954.68134.3285.681.819.9183.63307.0323.90.5395.479.42
Maximum0.651,0492,384.971,2847354681,574.11,2402,0974.17365133.6
Minimum0.198.4613.0200027.53054.04030.2
Skewness−0.029.748.328.235.1415.4−0.47−0.86−2.331.251.835.4

SD, standard deviation.

Figure 4:

Statistical distribution of the features and label.

Correlation heatmaps were utilized to evaluate multicollinearity issues. In the Spearman correlation matrix (Figure 5), a notable correlation exists between D nssm and the water-to-binder ratio (W/B), with a coefficient of 0.57. This correlation is followed by water (0.45), fine aggregate (FA, 0.12), and coarse aggregate (CA, 0.07). Additionally, there are negative correlations of −0.36, −0.32, −0.21, −0.14, −0.13, and −0.12 with silica fume (SF), migration test age (MT Age), slag, total aggregate (TA), superplasticizer (SP), and fine aggregate (FA), respectively. Likewise, in the Pearson correlation matrix (Figure 6), W/B exhibits the strongest linear association, with a correlation coefficient of 0.38, followed by CA (0.10) and TA (0.03). Conversely, there are negative correlations of −0.27 and −0.22 with SP and MT Age, respectively. As all features have correlation coefficients below 0.8, multicollinearity is not present.

Figure 5:

Spearman correlation heatmap.

Figure 6:

Pearson correlation heatmap.

2.2
Data preprocessing

The dataset consisted of 843 data points collected from academic journals and research projects, covering a wide range of concrete mixes. To ensure data quality, errors and duplicates were removed, and missing values were handled using manual and naïve imputation [43], 44]. Box plots were used to detect and remove outliers. Some features, like water, cement, and slag, had highly skewed distributions. Although log transformations were considered to correct this, standardizing the data with StandardScaler was enough to handle the skewness without changing the original meaning of the features.

All input features were scaled to have zero mean and unit variance to improve model stability and performance. Correlation analysis using Pearson and Spearman methods confirmed that all variables had correlation coefficients below 0.8, indicating low multicollinearity. The dataset had a high data-to-variable ratio (76.6), exceeding the recommended minimum and reducing the risk of overfitting [45], 46]. Additionally, 30 % of the data was reserved for testing and validation. For the hybrid models, parameter values for the metaheuristic algorithms (GWO, GTO, FFA, and PSO) were initially based on literature. A preliminary sensitivity analysis was also performed to fine-tune the parameters and ensure good optimization performance for the SVR models.

2.3
Model development

The optimization techniques PSO, GWO, FFA, and GTO were utilized to enhance the predictive performance of the SVR model in determining the D nssm. An overview of the model development process is presented in Figure 7. The key SVR parameters include the radial basis function (RBF) kernel width (g) and penalty factor (C), each constrained within the range of 0.01–100. The parameter configurations for the ML models are as follows: In FFA, the absorption coefficient (α) is set to 0.2, the randomization parameter (rand) is 0.93, the initial attractiveness coefficient (β 0) is 2, and the light intensity absorption coefficient (γ) is 1. For GWO, the control variable (a) reduces in a linear fashion from an initial value of 2 down to 0. In PSO, the maximum velocity (V max) is defined as 6, while the minimum (w min) and maximum (w max) inertia weights are 0.2 and 0.8, respectively. Additionally, the cognitive coefficient (c 1) and social coefficient (c 2) are assigned values of 1.6 and 1.8, respectively, to regulate swarm behaviour during optimization. The GTO algorithm utilizes 20 candidate solutions per iteration, with a maximum of 50 optimization iterations. A reproducibility seed of 42 ensures consistency in results, while the perturbation range is set at ±0.1. The candidate solution adjustment follows a random uniform perturbation mechanism based on both exploitation and exploration strategies. These optimized parameter settings enhance the efficiency of the SVR model in predicting the response parameter.

Figure 7:

Overview of hybrid model development.

2.4
Assessment of model effectiveness

Every model is subjected to a comprehensive assessment procedure, which involves computing a range of performance indicators to evaluate its effectiveness (Figure 8). The mathematical formulations for these statistical indices are provided in Eqs. (16). (1) R 2 = i = 1 i = n P i P i Q i Q i 2 i = 1 i = n P i P i 2 i = 1 i = n Q i Q i 2 $${R}^{2}=\frac{{\left[\sum _{i=1}^{i=n}\left({P}_{i}-{\bar{P}}_{i}\right)\left({Q}_{i}-{\bar{Q}}_{i}\right)\right]}^{2}}{\sum _{i=1}^{i=n}{\left({P}_{i}-{\bar{P}}_{i}\right)}^{2}\sum _{i=1}^{i=n}{\left({Q}_{i}-{\bar{Q}}_{i}\right)}^{2}}$$ (2) Adj  R 2 = 1 1 R 2 N 1 N K n 1 $$\text{Adj\,}{R}^{2}=1-\frac{\left(1-{R}^{2}\right)\left(N-1\right)}{N-{K}_{{n}^{-1}}}$$ (3) RMSE = 1 N i = 1 i = n P I Q i 2 $$\text{RMSE}=\sqrt{\frac{1}{N}\sum _{i=1}^{i=n}{\left({P}_{I}-{Q}_{i}\right)}^{2}}$$ (4) MAE = 1 N i = 1 i = n P i Q i $$\text{MAE}=\frac{1}{N}\sum _{i=1}^{i=n}\left\vert {P}_{i}-{\mathrm{Q}}_{i}\right\vert $$ (5) a 10 index = R 10 N $$a10-\text{index}=\frac{R10}{N}$$ (6) a 20 index = R 20 N $$a20-\text{index}=\frac{R20}{N}$$ Where P i depicts the actual value, while Q i represents the anticipated value, where N refers to the total observations, and K n represents the number of predictors. The letter with a bar on top represents the average values. R20 denotes the anticipated number of values falling within the ratio range of 0.90–1.20 for experimental-to-predicted values, whereas R10 represents the count of values within the range of 0.80–1.10.

Figure 8:

Performance indicators for assessing model efficacy.

3
Results and discussions
3.1
Performance analysis

Regression analysis is crucial for evaluating the predictive accuracy of ML models by quantifying the alignment between predicted and observed values (Figure 9). A regression coefficient close to 1, particularly above 0.8, demonstrates the model’s effectiveness in capturing complex relationships [47], 48]. The obtained regression slopes validate the performance of the suggested SVR-based hybrid models (Figure 10). During the training phase, SVR-GTO exhibited the highest slope (0.93), followed by SVR-GWO (0.92), while SVR-PSO and SVR-FFA both achieved a slope of 0.85. In the testing phase, the regression slopes further improved, with SVR-GTO increasing to 0.96, SVR-GWO reaching 0.93, SVR-PSO attaining 0.91, and SVR-FFA maintaining a strong slope of 0.88. These results confirm the robustness of the models in estimating chloride penetration resistance in concrete, ensuring minimal deviation from experimental values and reinforcing their applicability in engineering practices. Additionally, Figure 11 presents an evaluation of the prediction accuracy by comparing actual and predicted values, along with the corresponding relative errors.

Figure 9:

Scatter plots comparing predicted and actual values.

Figure 10:

Regression analysis of the developed models.

Figure 11:

Evaluation of prediction accuracy using actual, predicted, and relative error values.

3.2
Assessment of model effectiveness

The statistical assessment of the ML models was conducted utilizing key performance indicators for both the training and testing phases (Table 2). In the training phase, SVR-GTO achieved the highest accuracy with an R 2 of 0.95 and an RMSE of 2.95, followed closely by SVR-GWO (R 2 = 0.94, RMSE = 3.12). SVR-PSO and SVR-FFA demonstrated slightly lower performance, with R 2 values of 0.92 and 0.90, and RMSE values of 3.45 and 3.56, respectively.

Table 2:

Overview of performance metrics for the proposed models.

PhaseModels R 2 Adj R 2 RMSEMAERSRa-10 indexa-20 index
TrainingSVR-GTO0.950.952.951.130.250.940.97
SVR-GWO0.940.933.122.070.270.920.95
SVR-PSO0.920.913.452.210.330.910.93
SVR-FFA0.900.903.562.340.450.900.92
TestingSVR-GTO0.970.970.930.330.110.940.98
SVR-GWO0.920.921.550.690.190.920.94
SVR-PSO0.910.901.670.870.240.900.93
SVR-FFA0.890.902.131.120.430.890.91

In the testing phase, SVR-GTO again outperformed other models, attaining the highest R 2 of 0.97 and the lowest RMSE of 0.93, indicating strong generalization. SVR-GWO maintained solid performance (R 2  = 0.92, RMSE = 1.55), while SVR-PSO and SVR-FFA yielded R 2 values of 0.91 and 0.89, and RMSE values of 1.67 and 2.13, respectively. These findings confirm the superior accuracy and reliability of the SVR-GTO model in predicting chloride penetration resistance (Figure 12). The superior performance of SVR-GTO can be attributed to the unique exploration–exploitation balance of the GTO algorithm. GTO simulates the complex social behavior and dynamic movement patterns of gorilla troops, enabling it to effectively avoid local optima and explore the solution space more thoroughly. Its adaptive strategy and strong global search capability make it particularly suitable for high-dimensional, nonlinear problems such as predicting chloride diffusion in concrete.

Figure 12:

Spider plots of statistical indicator scores.

3.3
SHAP interpretation

The relative importance of input features in predicting D nssm is illustrated in Figure 13 through SHAP analysis. The W/B exhibited the highest mean SHAP value (+3.8), underscoring its dominant influence on model predictions. MT Age followed with a mean value of +2.6, indicating its strong contribution to chloride resistance. TA and SP also demonstrated notable impacts, with SHAP values of +0.8 and slightly lower, respectively. Furthermore, fly ash and slag each contributed with a value of +0.6, while CA, cement, and SF showed smaller but meaningful effects, with SHAP values of 0.5, 0.3, and 0.2, respectively. These findings highlight the critical roles of mixture composition and curing age in determining chloride ingress behavior in concrete.

Figure 13:

Mean SHAP plot: assessing feature’s significance.

A rise in the W/B ratio is associated with a positive SHAP value of approximately +35, indicating that higher W/B ratios contribute positively to D nssm (Figure 14). Conversely, a decrease in the W/B ratio depicts a negative SHAP value (around −10), suggesting a detrimental effect on D nssm. The MT Age also plays a significant role, with higher durations showing negative SHAP values (around −8) and shorter durations exhibiting positive SHAP values (around +48). This implies that longer curing periods negatively impact D nssm, while shorter durations have a positive influence. Similarly, higher FA quantities show negative SHAP values (−5), indicating adverse effects on D nssm prediction, while lower FA amounts exhibit positive SHAP values (+3), implying beneficial impacts.

Figure 14:

Impact of variables: SHAP summary visualization.

D nssm shows contrasting trends with CA and TA content. Specifically, increasing CA content leads to a decrease in D nssm, indicating improved resistance to chloride penetration. This occurs because larger, well-graded coarse aggregates disrupt the continuity of the cement paste, increasing the tortuosity of the diffusion path and effectively reducing the permeability of the concrete [49]. In contrast, while moderate increases in TA, which includes both fine and coarse aggregates, can improve the concrete matrix, very high TA levels lead to an increase in D nssm. This rise may be attributed to the excessive volume of fine aggregates within the total aggregate fraction, which increases the interfacial transition zones (ITZs). Since ITZs are more porous than bulk paste, their expansion at very high TA contents can create easier pathways for chloride ions, thereby increasing chloride diffusivity. Hence, while coarse aggregate contributes positively to durability by blocking ion transport, an overly large total aggregate content, largely influenced by fine aggregate and the associated ITZ volume, can counteract this benefit and increase chloride permeability. This highlights the importance of optimizing both aggregate grading and overall aggregate volume to balance tortuosity effects with ITZ-induced permeability for enhanced concrete durability.

3.4
ICE and PDP interpretation

The D nssm of concrete exhibits diverse trends across different material parameters (Figure 15). Initially, D nssm increases with higher W/B ratios but only for a limited range, after which it stabilizes until approximately 0.44. Subsequently, there is a rapid increase in D nssm from 0.44 to 0.49, maintaining a constant value up to 0.55. Similarly, D nssm remains consistent up to 102 kg/m3 of cement, beyond which it experiences a slight decrease until approximately 145 kg/m3, remaining relatively constant thereafter. Additionally, D nssm remains constant up to 225 kg/m3 and then slightly decreases. Furthermore, D nssm shows a slight decrease up to 90 kg/m3, followed by a more significant decrease up to 100 kg/m3, after which it remains constant. D nssm almost remains constant with an increase in SF but decreases around 112 kg/m3. It also decreases slightly up to 18 kg/m3of FA, then shows a slight rise and decreases again. With an increase in CA content, D nssm remains almost constant and slightly decreases initially but remains constant up to 980 kg/m3, after which it increases. Moreover, D nssm initially experiences a rapid fall with a rise in MT age and continues to decrease with further increases in MT age. The PDP analysis reveals that D nssm rises sharply when the TA content exceeds approximately 1,100 kg/m3, consistent with the SHAP findings.

Figure 15:

PDP analysis.

The results of this study offer clear guidance for optimizing concrete mix designs to enhance durability against chloride ingress. Maintaining a low W/B ratio, ideally below 0.44, is a key factor in minimizing chloride diffusivity. A moderate cement content, around 102–145 kg/m3, appears sufficient for maintaining strength without increasing permeability. Incorporating SF up to 112 kg/m3 improves matrix densification, thereby reducing D nssm. Increasing CA content up to 980 kg/m3 is beneficial, as it enhances tortuosity and obstructs ion transport pathways. However, care must be taken with TA content. When TA exceeds 1,100 kg/m3, D nssm may increase due to a larger volume of ITZs, which are more porous and facilitate chloride movement. Additionally, extending the MT age shows that prolonged curing significantly improves chloride resistance. By carefully adjusting these key parameters, engineers can design concrete mixes with better long-term performance in chloride-rich environments.

4
Comparison with existing models

ML has been increasingly integrated with analytical and numerical methods to enhance the prediction of complex structural behaviors and material properties. Among boosting algorithms used to predict the homogenized stiffness of fiber-reinforced composites, XGBoost achieved the highest accuracy [50]. For structural stability problems, a hybrid PSO-XGBoost model effectively estimated the buckling loads of columns with variable cross-sections, demonstrating near-perfect R 2 values and improved interpretability via SHAP analysis [51]. Similarly, ensemble models such as light gradient boosting (LGB) outperformed other techniques in predicting the buckling behavior of functionally graded nanobeams [52]. In concrete-related applications, ANN consistently exhibited superior predictive performance over traditional methods such as linear regression and decision trees [53]. Additionally, bagging regressors (BR) yielded the most accurate predictions in estimating wear depth in fly ash concrete [54]. For high-performance concrete, multilayer perception (MLP) showed superior predictive accuracy [55]. Table 3 presents a comparative summary of the models employed in previous literature and those adopted in the current study.

Table 3:

Comparative summary of ML models utilized in existing literature and the present study.

ReferenceBest modelModel interpretation R 2
[50]XGBoost0.99
[51]PSO-XGBoostSHAP0.996
[52]LGB0.999
[53]ANNSensitivity analysis0.99
[54]BRSHAP0.99
[55]MLP0.912
Current studySVR-GTOSHAP, ICE, and PDP0.97
5
Graphical user interface (GUI)

An interactive GUI, depicted in Figure 17, was designed to streamline the prediction of the D nssm of concrete. This innovative tool eliminates reliance on traditional labor-intensive laboratory testing, enabling rapid and accurate assessment of chloride penetration resistance. By inputting key parameters, users can efficiently obtain precise predictions of the migration coefficient, a critical indicator of concrete durability in chloride-exposed environments. The GUI, driven by advanced ML models, provides a practical and easy-to-use tool for researchers and professionals to assess the durability of concrete structures. This interface represents a significant step forward in forecasting chloride resistance, supporting the development of long-lasting, resilient infrastructure in chloride-rich conditions (Figure 17).

Figure 16:

ICE analysis.

Figure 17:

GUI

6
Limitations and research directions for future

To make ML models more reliable, it is important to conduct experiments in controlled environments and collect data from a consistent, real-world source. This approach would improve dataset consistency and enhance the accuracy and reliability of ML models. The dataset, while comprehensive, may not fully represent all concrete types or environmental conditions, such as freeze-thaw cycles or varying chloride sources. This may limit the generalizability of the models to new scenarios. Additionally, future research should include additional predictors to assess the chloride resistance of concrete. For future research, deep learning techniques could be explored by expanding the dataset size to provide more comprehensive information, improving the model’s generalization and predictive performance.

7
Conclusions

This study explored advanced hybrid ML techniques to predict the D nssm of concrete, a key indicator of chloride penetration resistance. SVR was combined with four metaheuristic optimization algorithms: GTO, GWO, PSO, and FFA to enhance predictive accuracy. The primary outcomes of the study are succinctly outlined below:

  • SVR-GTO attained the highest R 2 of 0.97 and the minimum RMSE of 0.93, demonstrating superior robustness and generalization capability, while SVR-GWO also exhibited strong predictive efficacy with an R 2 of 0.92 and an RMSE of 1.55; in comparison, SVR-PSO and SVR-FFA recorded slightly lower R 2 of 0.91 and 0.89, with RMSE values of 1.67 and 2.13, respectively.

  • The analysis underscored the critical significance of the W/B ratio, MT Age, and TA in accurately estimating the chloride penetration resistance of concrete. Specifically, higher values of W/B and TA are associated with increased D nssm values, indicating reduced chloride resistance in concrete. Conversely, greater MT Age corresponds to lower D nssm values, signifying higher chloride resistance in concrete structures.

  • A GUI was created to streamline the estimation of the D nssm of concrete, eliminating the need for traditional, labor-intensive laboratory testing. The developed GUI, driven by advanced ML algorithms, provides a practical and easy-to-use tool for researchers and professionals to evaluate the durability of concrete structures.

Language: English
Submitted on: Apr 9, 2025
Accepted on: Nov 18, 2025
Published on: Dec 17, 2025
Published by: Sciendo
In partnership with: Paradigm Publishing Services

© 2025 Irfan Ullah, Muhammad Faisal Javed, Deema Mohammed Alsekait, Mohammed Jameel, Hisham Alabduljabbar, Khawaja Atif Naseem, Diaa Salama AbdElminaam, published by Sciendo
This work is licensed under the Creative Commons Attribution 4.0 License.