RomDeffayet / MelanomaChallenge

A melanoma classifier built for a custom kaggle challenge at Télécom ParisTech

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

MelanomaChallenge

A melanoma classifier built for a custom kaggle challenge at Télécom ParisTech.

Methods used :

Feature Extraction

In the MelanomaChallenge.ipynb file, I extracted 26 features corresponding to colors, borders, symmetry and other local features. Meanwhile, in the Inception_svm.ipynb file, the features are obtained by keeping only the convolutional layers of Google's Inception_v3 neural network.

Oversampling

Since the dataset is highly imbalanced, a resampling gives better results. Due to the small number of samples in the dataset, oversampling is desirable. I chose used the SMOTE implementation from the imbalanced-learn API.

Dimensionality reduction

Through Principal Component Analysis and/or Sequential Feature Selection, we can reach a higher score by keeping only the most important features.

Classifier

I got promising results with Quadratic Discriminant Analyisis and k-Nearest Neighbors. However the most robust classifier is by far a linear SVM. I used the implementations from sklearn for these classifiers.

Sources :

About

A melanoma classifier built for a custom kaggle challenge at Télécom ParisTech


Languages

Language:Jupyter Notebook 100.0%