Title

A Backward Adjusting Strategy And Optimization Of The C4.5 Parameters To Improve C4.5'S Performance

Abstract

In machine learning, decision trees are employed extensively in solving classification problems. In order to design a decision tree classifier two main phases are employed. The first phase is to grow the tree using a set of data, called training data, quite often to its maximum size. The second phase is to prune the tree. The pruning phase produces a smaller tree with better generalization (smaller error on unseen data). One of the most popular decision tree classifiers introduced in the literature is the C4.5 decision tree classifier. In this paper, we introduce an additional phase, called adjustment phase, interjected between the growing and pruning phases of the C4.5 decision tree classifier. The intent of this adjustment phase is to reduce the C4.5 error rate by making adjustments to the non-optimal splits created in the growing phase of the C4.5 classifier, thus eventually improving generalization (accuracy of the tree on unseen data). In most of the simulations conducted with the C4.5 decision tree classifier, its parameters, confidence factor, CF, and minimum number of split-off cases, MS, are chosen to be equal 25% and 2, their default values, recommended by Quinlan, the inventor of C4.5. The overall value of this work is that it provides the C4.5 user with a quantitative and qualitative assessment of the benefits of the proposed adjust phase, as well as the benefits of optimizing the C4.5 parameters, CF and MS. Copyright © 2008, Association for the Advancement of Artificial Intelligence (www.aaai.wg). All rights reserved.

Publication Date

11-17-2008

Publication Title

Proceedings of the 21th International Florida Artificial Intelligence Research Society Conference, FLAIRS-21

Number of Pages

35-40

Document Type

Article; Proceedings Paper

Personal Identifier

scopus

Socpus ID

55849146249 (Scopus)

Source API URL

https://api.elsevier.com/content/abstract/scopus_id/55849146249

This document is currently not available here.

Share

COinS