espoirMur / FinalYearProjectReport

This project contain the latex report of my final year project about student orientation with data mining

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

FinalYearProjectReport

In the digital age, Tera-bytes of data is generated per the second by :Mobile devices, digital photographs, web documents,facebook updates,tweets, blogs, user-generated,Transactions, sensor data, surveillance data,queries, clicks, browsing;cheap storage has made possible to maintain this data, but it has been unused while it contains useful informations and insights, that why we need data mining : the process of analyzing large amounts of data in an effort to find correlations, patterns, and insights Many research has been made on the use of data mining in industries such as telecoms, insurances,banks, etc but education domain has been unexplored that why we decided to works on the topic :Mining educational dataset to improve student orientation at the university We tried to answer his question : How can we use modern data mining technics to provide universities tools that can help them to improve new student orientation ? To answer that question we have conducted this research by following the cross industry standard process for data mining . We have collected data from the information system called Univeristy Admin- istrative Tool (UAT) for UniversiteĢ Libre des Pays des Grands Lacs (ULPGL) and our initial dataset had 9606 rows and 22 columns which represent the records of data for each student who studied at ULPGL from 2012 to 2016 ,based on that dataset we discovered how student are distributed by gender , option , school, and percentage in the national examination, etc We have used statistics test (chi-square ,Analyse of variance (ANOVA), Pearson correlation)and predictive models to analyze student marks and try to discover what are criteria students use to choose theirs departments when they start university, after that we predicted Cumulative Grade Point Average (CGPA) in with theirs marks , the field they studied and their secondary school name in each department of ULPGL by stacking 3 different regressors models and 2 different SVM models . We evaluated the final model by using cross validation and RMSE and found a score of less than 10% in each department . And finally we have built a web application that suggests a new student theirs orientation according to the CGPA predicted. we used those libraries and frameworks : python, pandas, matplotlib,seaborn, flask numpy and sklearn for this work . jupter nootebook of the project are in the Memory-working Directory

About

This project contain the latex report of my final year project about student orientation with data mining


Languages

Language:TeX 49.8%Language:PostScript 30.2%Language:HTML 19.9%