TY - GEN
T1 - Solving multiclass classification problems by genetic programming
AU - Winkler, Stephan
AU - Affenzeller, Michael
AU - Wagner, Stefan
N1 - Copyright:
Copyright 2012 Elsevier B.V., All rights reserved.
PY - 2005
Y1 - 2005
N2 - In this paper a multiclass classification problem solving technique based on genetic programming is presented. Classification algorithms are designed to learn a function which maps a vector of object features into one of several classes; this is done by analyzing a set of input-output examples of the function (also called "training samples"). Here we present a method based on the theory of genetic algorithms and genetic programming that interprets classification problems as optimization problems. The major aspects presented in this paper are suitable genetic operators for this problem class (mainly the creation of new hypotheses by merging already existing ones and their detailed evaluation) we have designed and implemented. We define a novel function for measuring a classificator model's quality that takes into account several different features of the model to be evaluated; an extended version of ROC curves that can be applied not only to two-class-classification but also to multiclass classification problems, is also presented. The experimental part of the paper documents the ability of this method to yield very satisfying results; selected results achieved for two classification benchmark problems are discussed.
AB - In this paper a multiclass classification problem solving technique based on genetic programming is presented. Classification algorithms are designed to learn a function which maps a vector of object features into one of several classes; this is done by analyzing a set of input-output examples of the function (also called "training samples"). Here we present a method based on the theory of genetic algorithms and genetic programming that interprets classification problems as optimization problems. The major aspects presented in this paper are suitable genetic operators for this problem class (mainly the creation of new hypotheses by merging already existing ones and their detailed evaluation) we have designed and implemented. We define a novel function for measuring a classificator model's quality that takes into account several different features of the model to be evaluated; an extended version of ROC curves that can be applied not only to two-class-classification but also to multiclass classification problems, is also presented. The experimental part of the paper documents the ability of this method to yield very satisfying results; selected results achieved for two classification benchmark problems are discussed.
KW - Classification
KW - Data mining
KW - Genetic programming
KW - Knowledge representation
UR - http://www.scopus.com/inward/record.url?scp=84867339795&partnerID=8YFLogxK
M3 - Conference contribution
SN - 9806560531
SN - 9789806560536
T3 - WMSCI 2005 - The 9th World Multi-Conference on Systemics, Cybernetics and Informatics, Proceedings
SP - 48
EP - 53
BT - WMSCI 2005 - The 9th World Multi-Conference on Systemics, Cybernetics and Informatics, Proceedings
PB - International Institute of Informatics and Systemics
T2 - 9th World Multi-Conference on Systemics, Cybernetics and Informatics, WMSCI 2005
Y2 - 10 July 2005 through 13 July 2005
ER -