sharmasapna / titanic

This repository contains the python code used for predicting the survivors

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

This repository contains the python code used for predicting the survivors.

titanic_EDA_18june2021.ipynb.
Highlights:
This notebook contains EDA

titanic_prediction_with_score_0.78708_21july2021.ipynb
This is my best Titanic prediction with score of 78.708% till now!
Highlights:

  1. Family feature created by summing Parch, SibSp
  2. Isalone feature created where Family = 0
  3. Title feature created from Name by extracting the titles from them and then grouping them
  4. Missing value of Embarked filled with mode value
  5. Missing value of Age filled with random numbers generated as per mean and standard deviation
  6. Bins created for age and farehyoer
  7. Columns dropped: Cabin, Ticket,Age,Name,Fare
  8. Used Random Forest, Logistic Regression and XGBoost for prediction

Future efforts:

  1. Changing the missing value imputation for Age
  2. Keeping the cabin and ticket features
  3. Feature selection
  4. Stacking results from different models

About

This repository contains the python code used for predicting the survivors


Languages

Language:Jupyter Notebook 100.0%