prneidhardt / Supervised-Learning-Classification

INN Hotels Project

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Supervised-Learning-Classification

  • Project completed as part of Great Learning's Postgraduate Program - Data Science and Business Analytics
  • Project delivered in November 2021
  • Repository includes two files:
    • Jupyter notebook with Python code written for data analysis and model building
    • CSV file includes data imported into notebook

Problem Statement

  • Analyze the data of INN Hotels to find which factors have a high influence on booking cancellations, build a predictive model that can predict which booking is going to be canceled in advance, and help in formulating profitable policies for cancellations and refunds.

Skills and Tools

  • Exploratory Data Analysis (Variable identification, Univariate analysis, Bi-Variate analysis)
  • Data Pre-processing
  • Logistic regression
    • Multicollinearity
    • Optimal threshold using AUC-ROC curve
  • Decision trees
    • Pruning

About

INN Hotels Project

License:GNU General Public License v3.0


Languages

Language:Jupyter Notebook 100.0%