TY - GEN
T1 - SCRBenchmark
T2 - 2025 Genetic and Evolutionary Computation Conference Companion, GECCO 2025 Companion
AU - Bachinger, Florian
AU - Werth, Bernhard
AU - Zenisek, Jan
AU - Haider, Christian
AU - Olivetti de França, Fabrício
N1 - Publisher Copyright:
© 2025 Copyright held by the owner/author(s).
PY - 2025/8/11
Y1 - 2025/8/11
N2 - Shape-constrained regression aims to fit regression models that adhere to specific constraints regarding the expected shape of the parametric model. Commonly studied constraints include convexity, concavity, symmetry, and monotonicity. These constraints can be used to enforce the desired behavior or incorporate prior knowledge about the studied phenomena, guiding the fitting process towards a more realistic solution. These are particularly useful to ensure proper behavior when extrapolating beyond the training data in cases where only limited training data is available. Also, this helps to reduce the influence of noise, mitigating overfitting. There has been a growing interest in shape-constrained regression, however, no established benchmark currently exists. As a result, many studies lack a direct comparison between methods or focus solely on practical applications. To address this gap, we propose an adaptation of the Feynman benchmark suite for symbolic regression as a general benchmark for shape-constrained regression. The Feynman benchmark suite is an appropriate choice because it is derived from physical phenomena where shape constraints naturally occur. In this benchmark, we evaluate model accuracy, the number of violated constraints, and model behavior when extrapolating data and handling noise. Additionally, we provide a Python interface to facilitate the evaluation of any regression model.
AB - Shape-constrained regression aims to fit regression models that adhere to specific constraints regarding the expected shape of the parametric model. Commonly studied constraints include convexity, concavity, symmetry, and monotonicity. These constraints can be used to enforce the desired behavior or incorporate prior knowledge about the studied phenomena, guiding the fitting process towards a more realistic solution. These are particularly useful to ensure proper behavior when extrapolating beyond the training data in cases where only limited training data is available. Also, this helps to reduce the influence of noise, mitigating overfitting. There has been a growing interest in shape-constrained regression, however, no established benchmark currently exists. As a result, many studies lack a direct comparison between methods or focus solely on practical applications. To address this gap, we propose an adaptation of the Feynman benchmark suite for symbolic regression as a general benchmark for shape-constrained regression. The Feynman benchmark suite is an appropriate choice because it is derived from physical phenomena where shape constraints naturally occur. In this benchmark, we evaluate model accuracy, the number of violated constraints, and model behavior when extrapolating data and handling noise. Additionally, we provide a Python interface to facilitate the evaluation of any regression model.
KW - Benchmark Suite
KW - Shape Constraints
KW - Symbolic Regression
UR - https://www.scopus.com/pages/publications/105014588883
U2 - 10.1145/3712255.3734280
DO - 10.1145/3712255.3734280
M3 - Conference contribution
AN - SCOPUS:105014588883
T3 - GECCO 2025 Companion - Proceedings of the 2025 Genetic and Evolutionary Computation Conference Companion
SP - 2505
EP - 2513
BT - GECCO 2025 Companion - Proceedings of the 2025 Genetic and Evolutionary Computation Conference Companion
A2 - Ochoa, Gabriela
PB - Association for Computing Machinery, Inc
Y2 - 14 July 2025 through 18 July 2025
ER -