thua101 / Lasso-RidgeRegularization-R

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool


  • Logistic model to predict college's ownership
  • Lasso/Ridge regression model to predict the graduation rate
  • training
  • feature selection

Dataset - College


Statistics for a large number of US Colleges from the 1995 issue of US News and World Report.

Format: A data frame with 777 observations on the following 18 variables. Private A factor with levels No and Yes indicating private or public university

Private A factor with levels No and Yes indicating private or public university

Apps Number of applications received

Accept Number of applications accepted

Enroll Number of new students enrolled

Top10perc Pct. new students from top 10% of H.S. class

Top25perc Pct. new students from top 25% of H.S. class

F.Undergrad Number of fulltime undergraduates

P.Undergrad Number of parttime undergraduates

Outstate Out-of-state tuition

Room.Board Room and board costs

Books Estimated book costs

Personal Estimated personal spending

PhD Pct. of faculty with Ph.D.’s

Terminal Pct. of faculty with terminal degree

S.F.Ratio Student/faculty ratio

perc.alumni Pct. alumni who donate

Expend Instructional expenditure per student

Grad.Rate Graduation rate


This dataset was taken from the StatLib library which is maintained at Carnegie Mellon University. The dataset was used in the ASA Statistical Graphics Section’s 1995 Data Analysis Exposition.
