Guardian99 / MultivariateRegression

The implementation of the model used for analysis of linear relations in ML to be applied on some datasets from UCI ML Repository to find the total squared error using leave-one-out cross-validation.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Machine Learning Validation Techniques Resubstitution If all the data is used for training the model and the error rate is evaluated based on outcome vs. actual value from the same training data set, this error is called the resubstitution error. This technique is called the resubstitution validation technique.

Holdout To avoid the resubstitution error, the data is split into two different datasets labeled as a training and a testing dataset. This can be a 60/40 or 70/30 or 80/20 split. This technique is called the hold-out validation technique. In this case, there is a likelihood that uneven distribution of different classes of data is found in training and test dataset. To fix this, the training and test dataset is created with equal distribution of different classes of data. This process is called stratification.

K-Fold Cross-Validation In this technique, k-1 folds are used for training and the remaining one is used for testing The advantage is that entire data is used for training and testing. The error rate of the model is average of the error rate of each iteration. This technique can also be called a form the repeated hold-out method. The error rate could be improved by using stratification technique.

Leave-One-Out Cross-Validation (LOOCV) In this technique, all of the data except one record is used for training and one record is used for testing. This process is repeated for N times if there are N records. The advantage is that entire data is used for training and testing. The error rate of the model is average of the error rate of each iteration.

MultivariateRegression

The implementation of the model used for analysis of linear relations in ML to be applied on some datasets from UCI ML Repository to find the total squared error using leave-one-out cross-validation.

About

The implementation of the model used for analysis of linear relations in ML to be applied on some datasets from UCI ML Repository to find the total squared error using leave-one-out cross-validation.


Languages

Language:MATLAB 100.0%