Abstract
In this thesis, a comprehensive understanding of supervised machine learning algorithms, namely Logistic Regression, Support Vector Machine, Random Forest, and Ensemble Stacking, is performed. This research also extends and further explores different feature selection techniques: correlation analysis, chi-squared, mutual information classification, and Recursive Feature Elimination (RFE). Then, a practical application in the context of coronary artery disease prediction was conducted to apply and analyze models' performance with different feature selection methods on various measures of accuracy, F1 score, and confusion matrix. The outcomes of this experimentation reveal that among models developed, Logistic Regression with chi-squared feature selection is a robust and reliable predictive model, achieving an accuracy of 87.65%. This research advances the understanding of machine learning algorithms and feature selection techniques, with practical implications for reliable prediction of coronary artery disease.
Thesis Completion
2023
Semester
Fall
Thesis Chair/Advisor
Boloni, Ladislau
Degree
Bachelor of Science (B.S.)
College
College of Engineering and Computer Science
Department
Computer Science
Degree Program
Computer Science
Language
English
Access Status
Campus Access
Length of Campus-only Access
3 years
Release Date
12-15-2026
Recommended Citation
Deegutla, Sathwika, "Understanding Machine Learning Algorithms and Feature Selection Techniques for Predicting Coronary Artery Disease" (2023). Honors Undergraduate Theses. 1495.
https://stars.library.ucf.edu/honorstheses/1495