jamesmoonusa / Credit_Risk_Analysis

Predicting Credit Risk by Using Several Machine Learning algorithms

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Credit_Risk_Analysis

Overview

To predict credit risk we can use several machine learning models.

Results

  1. Random Oversampling Model - Result
  2. SMOTE Oversampling - Result
  3. Undersampling - Result
  4. Combination (Over and Under) Sampling - Result
  5. Balanced Random Forest Classifier - Result
  6. Easy Ensemble AdaBoost Classifier - Result
  7. ALL MODEL Result Comparison - Chart

Summary

All models show that 100% Precision on Low Risk but low numbers on High Risk Precision. High Risk Sensitivity range is between 59% ~ 69%. Low Risk Sensitivity range is between 45% ~ 91%. Only Low Risk Precision shows 100%. All models have low High Risk F1 score which can refer that imbalance between sensitivity and precision. Sensitivity and Precision are important measure for credit risk prediction, so I do not recommand any of these models to predict credit risk. If one of model must use, then either Balanced Random Forest Classifier or Easy Ensemble AdaBoost Classifier model should use for the prediction since those two model has highest Balanced Accuracy Score and F1 scores.

About

Predicting Credit Risk by Using Several Machine Learning algorithms


Languages

Language:Jupyter Notebook 100.0%