Symbolic Regression in Materials Science: Discovering Interatomic Potentials from Data

Research output: Chapter in Book/Report/Conference proceedingsChapterpeer-review


Particle-based modeling of materials at atomic scale plays an important role in the development of new materials and the understanding of their properties. The accuracy of particle simulations is determined by interatomic potentials, which allow calculating the potential energy of an atomic system as a function of atomic coordinates and potentially other properties. First-principles-based ab initio potentials can reach arbitrary levels of accuracy, however, their applicability is limited by their high computational cost. Machine learning (ML) has recently emerged as an effective way to offset the high computational costs of ab initio atomic potentials by replacing expensive models with highly efficient surrogates trained on electronic structure data. Among a plethora of current methods, symbolic regression (SR) is gaining traction as a powerful “white-box” approach for discovering functional forms of interatomic potentials. This contribution discusses the role of symbolic regression in Materials Science (MS) and offers a comprehensive overview of current methodological challenges and state-of-the-art results. A genetic programming-based approach for modeling atomic potentials from raw data (consisting of snapshots of atomic positions and associated potential energy) is presented and empirically validated on ab initio electronic structure data.
Original languageEnglish
Title of host publicationGenetic Programming Theory and Practice XIX
EditorsLeonardo Trujillo, Stephan M. Winkler, Sara Silva, Wolfgang Banzhaf
Place of PublicationSingapore
PublisherSpringer Nature Singapore
Number of pages30
ISBN (Print)978-981-19-8460-0
Publication statusPublished - 2023


Dive into the research topics of 'Symbolic Regression in Materials Science: Discovering Interatomic Potentials from Data'. Together they form a unique fingerprint.

Cite this