TY - JOUR
T1 - Comparison of semi-empirical models, symbolic regression, and machine learning approaches for prediction of tensile strength in steels
AU - Millner, Gerfried
AU - Kronberger, Gabriel
AU - Mücke, Manfred
AU - Romaner, Lorenz
AU - Scheiber, Daniel
N1 - Publisher Copyright:
© 2025 Elsevier Ltd
PY - 2025/7/1
Y1 - 2025/7/1
N2 - We employ data-driven models to predict the tensile strength of steel coils using information on their chemical composition and process parameters. The dataset contains extensive chemical analyses, diverse process parameters, and the characterized tensile strength as target property. We compare prediction quality and traceability of the predictions of pure machine learning models and physics-informed models. To introduce physical knowledge, we combine models from literature knowledge with symbolic regression and compare the physics-inspired models to machine learning models. In contrast to classic black-box models, symbolic regression provides mathematical equations for the estimation of the target value, facilitating straightforward interpretation. To analyze the predictions from classic black-box machine learning models, we use feature importance analysis with SHAP and contrast the obtained feature impacts with physics-based model parameters. We find that for the present use case, Artificial Neural Networks perform best, while the physics-infused models from symbolic regression allow for better interpretability.
AB - We employ data-driven models to predict the tensile strength of steel coils using information on their chemical composition and process parameters. The dataset contains extensive chemical analyses, diverse process parameters, and the characterized tensile strength as target property. We compare prediction quality and traceability of the predictions of pure machine learning models and physics-informed models. To introduce physical knowledge, we combine models from literature knowledge with symbolic regression and compare the physics-inspired models to machine learning models. In contrast to classic black-box models, symbolic regression provides mathematical equations for the estimation of the target value, facilitating straightforward interpretation. To analyze the predictions from classic black-box machine learning models, we use feature importance analysis with SHAP and contrast the obtained feature impacts with physics-based model parameters. We find that for the present use case, Artificial Neural Networks perform best, while the physics-infused models from symbolic regression allow for better interpretability.
KW - Feature importance
KW - Machine learning
KW - Process–structure–property (PSP) relationships
KW - Solid solution strengthening
KW - Steel coils
KW - Symbolic regression
UR - http://www.scopus.com/inward/record.url?scp=105001476197&partnerID=8YFLogxK
U2 - 10.1016/j.ijengsci.2025.104247
DO - 10.1016/j.ijengsci.2025.104247
M3 - Article
AN - SCOPUS:105001476197
SN - 0020-7225
VL - 212
JO - International Journal of Engineering Science
JF - International Journal of Engineering Science
M1 - 104247
ER -