ramapitecusment / student_academic_performance

Prediction of Students' Academic Performance Dataset: Cart Trees, Random Forest, Cross Validation and Neuralnet

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Students' Academic Performance Dataset Cart Trees, Random Forest, Cross Validation and Neuralnet

AIMS:

  • To show the accuracy Cart Trees, Random Forest and Cross Validation.
  • To choose the best approach of prediction of students’ performance.
  • To make the conclusion.

Summary of the best CART TREE model. The ACCURACY of the model 68%. Noticeably, the variable VisitedResourses is highly significant, however ParentsSatisfaction level is almost insignificant.

alt text

Summary of the best CART TREE model, which were improved by CROSS VALIDATION. The ACCURACY of the model 71%. As you can see, the accuracy of the model has been improved. CP = 0.015.

alt text

Summary of the best RANDOM FOREST model.�The ACCURACY of the model 76%. Unfortunately, Random forest can not predict as exactly as it did before

Accuracy 0.7655
95% CI (0.6476392, 0.8115522)
No Information Rate 0.4380165
P-Value [Acc > NIR] < 2.2e-16
Kappa 0.6509571

Neuralnet:

One of the most important procedures when forming a neural network is data normalization. This involves adjusting the data to a common scale so as to accurately compare predicted and actual values. Failure to normalize the data will typically result in the prediction value remaining the same across all observations, regardless of the input values.

Deciding on the number of hidden layers in a neural network is not an exact science. In fact, there are instances where accuracy will likely be higher without any hidden layers. Therefore, trial and error plays a significant role in this process.

In our example there are 2 hidden layers:

alt text

In order to identify ideal threshold fot the neuralnet, it is needed to devided our data to training and test datasets. There are 10 training and 10 test datasets. Best Threshold is 0.4

alt text

Conclusion

The key indicators to watch out for are:

  • If a student is absent under 7 days, he will get at least medium GPA.
  • If a student rises hand more than 54 times and his relation is mother or visits additional recourses more that 88 times, he will achieve high GPA.
  • If a student visits less that 28 recourses, he will get low GPA

About

Prediction of Students' Academic Performance Dataset: Cart Trees, Random Forest, Cross Validation and Neuralnet


Languages

Language:R 100.0%