Title
A Backward Adjusting Strategy And Optimization Of The C4.5 Parameters To Improve C4.5'S Performance
Abstract
In machine learning, decision trees are employed extensively in solving classification problems. In order to design a decision tree classifier two main phases are employed. The first phase is to grow the tree using a set of data, called training data, quite often to its maximum size. The second phase is to prune the tree. The pruning phase produces a smaller tree with better generalization (smaller error on unseen data). One of the most popular decision tree classifiers introduced in the literature is the C4.5 decision tree classifier. In this paper, we introduce an additional phase, called adjustment phase, interjected between the growing and pruning phases of the C4.5 decision tree classifier. The intent of this adjustment phase is to reduce the C4.5 error rate by making adjustments to the non-optimal splits created in the growing phase of the C4.5 classifier, thus eventually improving generalization (accuracy of the tree on unseen data). In most of the simulations conducted with the C4.5 decision tree classifier, its parameters, confidence factor, CF, and minimum number of split-off cases, MS, are chosen to be equal 25% and 2, their default values, recommended by Quinlan, the inventor of C4.5. The overall value of this work is that it provides the C4.5 user with a quantitative and qualitative assessment of the benefits of the proposed adjust phase, as well as the benefits of optimizing the C4.5 parameters, CF and MS. Copyright © 2008, Association for the Advancement of Artificial Intelligence (www.aaai.wg). All rights reserved.
Publication Date
11-17-2008
Publication Title
Proceedings of the 21th International Florida Artificial Intelligence Research Society Conference, FLAIRS-21
Number of Pages
35-40
Document Type
Article; Proceedings Paper
Personal Identifier
scopus
Copyright Status
Unknown
Socpus ID
55849146249 (Scopus)
Source API URL
https://api.elsevier.com/content/abstract/scopus_id/55849146249
STARS Citation
Beck, J. R.; Garcia, M.; Zhong, M.; Georgiopoulos, M.; and Anagnostopoulos, G. C., "A Backward Adjusting Strategy And Optimization Of The C4.5 Parameters To Improve C4.5'S Performance" (2008). Scopus Export 2000s. 9711.
https://stars.library.ucf.edu/scopus2000/9711