Abstract
A distinguishing feature of symbolic regression using genetic programming is its ability to identify complex nonlinearwhite-boxmodels. This is especially relevant in industrial applications where models are extensively scrutinized in order to gain knowledge about underlying processes. This potential is often diluted by the ambiguity and complexity of the models produced by genetic programming. In this contribution we discuss several analysis methods with the common goal to enable better insights in the symbolic regression process and to produce models that are more understandable and show better generalization. In order to gain more information about the process we monitor and analyze the progresses of population diversity, building block information, and even more general genealogy information. Regarding the analysis of results, several aspects such as model simplification, relevance of variables, node impacts, and network analysis are presented and discussed.
Original language | English |
---|---|
Title of host publication | Genetic Programming Theory and Practice XI |
Publisher | Springer |
Pages | 175-190 |
ISBN (Print) | 978-1-4939-0374-0 |
DOIs | |
Publication status | Published - 2014 |
Keywords
- Genetic programming
- Symbolic regression
- Algorithm analysis
- Population diversity
- Building block analysis
- Genealogy
- Variable networks