Improving Genetic Programming for Symbolic Regression with Equality Graphs

Research output: Chapter in Book/Report/Conference proceedingsConference contributionpeer-review

1 Citation (Scopus)

Abstract

The search for symbolic regression models with genetic programming (GP) has a tendency of revisiting expressions in their original or equivalent forms. Repeatedly evaluating equivalent expressions is inefficient, as it does not immediately lead to better solutions. However, evolutionary algorithms require diversity and should allow the accumulation of inactive building blocks that can play an important role at a later point. The equality graph is a data structure capable of compactly storing expressions and their equivalent forms allowing an efficient verification of whether an expression has been visited in any of their stored equivalent forms. We exploit the e-graph to adapt the subtree operators to reduce the chances of revisiting expressions. Our adaptation, called eggp, stores every visited expression in the e-graph, allowing us to filter out from the available selection of subtrees all the combinations that would create already visited expressions. Results show that, for small expressions, this approach improves the performance of a simple GP algorithm to compete with PySR and Operon without increasing computational cost. As a highlight, eggp was capable of reliably delivering short and at the same time accurate models for a selected set of benchmarks from SRBench and a set of real-world datasets.
Original languageEnglish
Title of host publicationGECCO 2025 - Proceedings of the 2025 Genetic and Evolutionary Computation Conference
EditorsGabriela Ochoa
Place of PublicationNew York, NY, USA
PublisherAssociation for Computing Machinery
Pages989–998
Number of pages10
ISBN (Electronic)9798400714658
ISBN (Print)9798400714658
DOIs
Publication statusPublished - 13 Jul 2025

Publication series

NameGECCO 2025 - Proceedings of the 2025 Genetic and Evolutionary Computation Conference

Keywords

  • equality graphs
  • equality saturation
  • genetic programming
  • symbolic regression

Fingerprint

Dive into the research topics of 'Improving Genetic Programming for Symbolic Regression with Equality Graphs'. Together they form a unique fingerprint.

Cite this