ishwarvenugopal / Imbalanced_Datasets

The final project for the CE888: Data Science and Decision Making module (Spring Term) at the University of Essex

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Imbalanced_Datasets

The datasets used are:

For the code used in pre-processing these datasets, go to the 'Preprocessing' folder

The Python notebooks used in this work are:

  • Dataset_1_LowImbalance.ipynb : The implementation using the Hepatocellular Carcinoma Dataset
  • Dataset_2_MediumImbalance.ipynb: The implementation using the Breast Cancer Dataset
  • Dataset_3_HighImbalance.ipynb: The implementation using the Porto Seguro’s Safe Driver Prediction
  • Plots.ipynb: The code for generating the plots used in the report

Read Report.pdf for a complete description and analysis of the project

About

The final project for the CE888: Data Science and Decision Making module (Spring Term) at the University of Essex


Languages

Language:Jupyter Notebook 100.0%