Oral Presentation Ninth International Symposium on Life-Cycle Civil Engineering 2025

Explainable Machine Learning Models for Corrosion degree Prediction Based onĀ Corrosion-induced Crack Width (113187)

Zhengfeng Zhang 1 , Ruilin Wang 1 , Weiping Zhang 1
  1. Tongji University, Shanghai, SHANGHAI, China

The width of corrosion-induced cracks is a useful indicator for evaluating the corrosion degree of steel reinforcements. However, the existing empirical formulas do not account for the various factors that affect the corrosion cracking process. In this study, the corrosion degree prediction model based on the width of corrosion cracks was established using machine learning (ML) techniques. A total of 3,580 samples were collected under various experimental conditions from 63 published studies. Using these data, three ensemble machine learning (EML) models, namely random forest (RF), extreme gradient boosting machine (XGBoost), and light gradient boosting machine (LGBM), were developed. Among these, the RF model achieved the highest performance on the test set with an R2 value of 0.88. In addition, SHapley Additive exPlanations (SHAP) was applied to interpret the data-driven model. A strong linear relationship between crack width and the corrosion degree was observed, while no significant correlation was found between concrete compressive strength and corrosion degree. The analysis also revealed that crack width, rebar diameter, and current density were the three most influential factors on the predict results. The SHAP interpretation results were consistent with theoretical understanding, demonstrating that the model effectively reflects the influencing mechanisms of multiple factors.