Welcome to the Data Science and Data Mining collection! This collection showcases the projects and assignments of students enrolled in the Data Science and Data Mining courses at the University of Central Florida. The collection covers topics such as data preprocessing, data visualization, statistical data analysis, data mining algorithms, machine learning, big data analytics and more. The students apply their skills and knowledge to various real-world datasets from different domains such as health, education, psychology, sports, social media and more. The collection aims to highlight the diversity and creativity of data science and data mining applications.
For more information, please contact Dr. Rui Xie.

Follow

Submissions from 2024

PDF

Advancing Cancer Classifcation through Machine Learning Analysis of RNA-Seq Gene Expression Data, Emil Agbemade, Amina Issoufou Anaroua, and Dimitri Bamba

PDF

Combating Cyberbullying on Social Media: A Machine Learning Approach with Text Analysis on Twitter, Amir Alipour Yengejeh

PDF

Optimizing AI with Advanced Data Structuring: A Comparative Analysis of K-means and GMM Clustering Techniques, Amir Alipour Yengejeh

PDF

XGBoost Hyperberd Model Using Steam Platform, Yuh-Haur Chen

PDF

Machine Learning Approaches for Cyberbullying Detection, Roland Fiagbe

PDF

Predicting Superconducting Critical Temperature Using Regression Analysis, Roland Fiagbe

PDF

Bootstrap Regression for Investigating Macroeconomics Factors Affecting USA Home Prices, Benedict Kongyir and Emil Agbemade

Submissions from 2023

PDF

Developing a Data-Driven Statistical Model for Accurately Predicting the Superconducting Critical Temperature of Materials using Multiple Regression and Gradient-Boosted Methods, Emil Agbemade

PDF

Predicting Heart Disease using Tree-based Model, Emil Agbemade

PDF

Silent Agony: Automated Detection of Ethnic and Religious Cyberbullying Using Machine Learning, Emil Agbemade

PDF

Variable Selection and Regression Analysis, Emil Agbemade

PDF

Analyzing the Impact of Health, Economic, and Demographic Factors on Life Expectancy: A Comparative Study of Developed and Developing Countries, Mahyar Alinejad

PDF

A Linear Regression Model to Predict the Critical Temperature of a Superconductor, Amir Alipour Yengejeh

PDF

Analysis of Credit Approval by Decision Tree, Amir Alipour Yengejeh

PDF

A Recommender System for Movie Ratings with Matrix Factorization Algorithm, Amir Alipour Yengejeh

PDF

Genome-Wide Association Study of The Maize Crop by The Lasso Regression Analysis, Amir Alipour Yengejeh

PDF

Machine Learning-based Approaches for Predicting the Critical Temperature of Superconductor, Pradip Dhakal

PDF

Variable Selection Using Lasso and Elastic Net Regression on High Dimensional Genetic Architecture Data of Maize Flowering Time, Pradip Dhakal

PDF

Classification of Adult Income Using Decision Tree, Roland Fiagbe

PDF

Linear Regression with Regularization on the Genetic Architecture of Maize Flowering Time, Roland Fiagbe

PDF

Movie Recommender System Using Matrix Factorization, Roland Fiagbe