TY - JOUR
T1 - An evaluation of heuristics for rule ranking
AU - Dreiseitl, Stephan
AU - Osl, Melanie
AU - Baumgartner, Christian
AU - Vinterbo, Staal
N1 - Copyright:
Copyright 2011 Elsevier B.V., All rights reserved.
PY - 2010/11
Y1 - 2010/11
N2 - Objective: To evaluate and compare the performance of different rule-ranking algorithms for rule-based classifiers on biomedical datasets. Methodology: Empirical evaluation of five rule ranking algorithms on two biomedical datasets, with performance evaluation based on ROC analysis and 5×2 cross-validation. Results: On a lung cancer dataset, the area under the ROC curve (AUC) of, on average, 14267.1 rules was 0.862. Multi-rule ranking found 13.3 rules with an AUC of 0.852. Four single-rule ranking algorithms, using the same number of rules, achieved average AUC values of 0.830, 0.823, 0.823, and 0.822, respectively. On a prostate cancer dataset, an average of 339265.3 rules had an AUC of 0.934, while 9.4 rules obtained from multi-rule and single-rule rankings had average AUCs of 0.932, 0.926, 0.925, 0.902 and 0.902, respectively. Conclusion: Multi-variate rule ranking performs better than the single-rule ranking algorithms. Both single-rule and multi-rule methods are able to substantially reduce the number of rules while keeping classification performance at a level comparable to the full rule set.
AB - Objective: To evaluate and compare the performance of different rule-ranking algorithms for rule-based classifiers on biomedical datasets. Methodology: Empirical evaluation of five rule ranking algorithms on two biomedical datasets, with performance evaluation based on ROC analysis and 5×2 cross-validation. Results: On a lung cancer dataset, the area under the ROC curve (AUC) of, on average, 14267.1 rules was 0.862. Multi-rule ranking found 13.3 rules with an AUC of 0.852. Four single-rule ranking algorithms, using the same number of rules, achieved average AUC values of 0.830, 0.823, 0.823, and 0.822, respectively. On a prostate cancer dataset, an average of 339265.3 rules had an AUC of 0.934, while 9.4 rules obtained from multi-rule and single-rule rankings had average AUCs of 0.932, 0.926, 0.925, 0.902 and 0.902, respectively. Conclusion: Multi-variate rule ranking performs better than the single-rule ranking algorithms. Both single-rule and multi-rule methods are able to substantially reduce the number of rules while keeping classification performance at a level comparable to the full rule set.
KW - Lung cancer
KW - Prostate cancer
KW - Rule evaluation metrics
KW - Rule ranking
KW - Breast Neoplasms/pathology
KW - Area Under Curve
KW - Lung Neoplasms/pathology
KW - Artificial Intelligence
KW - Humans
KW - Male
KW - Algorithms
KW - Prostatic Neoplasms/pathology
KW - Female
UR - http://www.scopus.com/inward/record.url?scp=78149466138&partnerID=8YFLogxK
U2 - 10.1016/j.artmed.2010.03.005
DO - 10.1016/j.artmed.2010.03.005
M3 - Article
C2 - 20466526
SN - 0933-3657
VL - 50
SP - 175
EP - 180
JO - Artificial Intelligence in Medicine
JF - Artificial Intelligence in Medicine
IS - 3
ER -