aryanxk02 / Breast-Cancer-Detection

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Breast-Cancer-Detection

This project aims to find out the best possible way to predict Breast Cancer using Machine Learning Algorithms. The different kinds of models that will be used would include Tree Based Classifiers, Regression Based and Probabality Based Models. The best possible model will be decided based on 3 Metrics which are Mean-Accuracy, F1-Score and AUC-Score.

Datasets

The datasets used for this project are the Wisconsin Diagnostic and Wisconsin Prognostic Datasets which are available on the UCI Machine Learning Repository. The data in the Dataset has been calculated by measuring the dimensions of the High-Resolution scans of the tissue.

Methodology

  1. Data Preprocessing
  2. Data Visualization
  3. K-Fold Cross-Validation
  4. Model Fitting and Evaluation

Timeline

  1. No Oversampling 1 (all columns)
  2. No oversampling 2 (dropping columns)
  3. Oversampling 1 (all columns)
  4. Oversampling 2 (dropping columns)

About


Languages

Language:Jupyter Notebook 100.0%