Keywords

Ridge, Lasso, Elastic Net, Mean Square Error(MSE)

Abstract

In this project, we investigate several variable selection procedures to give an overview of how well they perform on a genomic dataset using three different penalized regression approaches. Comparisons between different methods were performed. These methods include Ridge, lasso, and Elastic Net. We utilized 4494 observations with 7389 SNPs gene scores to predict time to male flowering (dtoa). We assessed the performance of these three models in terms of mean square error. Not surprisingly, Lasso and Elastic Net perform better than Ridge Regression. Overall, Elastic Net performed better in predicting the time of male flowering (dtoa).

Course Name

STA 6366 Data Science 1

Instructor Name

Dr Rui Xie

Share

COinS