A kaggle data analysis
Running the notebook:
- The requirements are in
requirements.txt
- Install with
pip install -r requirements.txt
- NOTE: Python 3.6+ only
- Install with
- The dataset is the two sigma connect rental
- Download it with the kaggle api
Run the following after activating your python environment:
kaggle competitions download -c two-sigma-connect-rental-listing-inquiries
unzip two-sigma-connect-rental-listing-inquiries.zip
unzip train.json.zip
unzip test.json.zip
# unzip images_sample.zip
# Optionally remove zip files
rm *.zip
Files:
- exploratory-data-analysis.ipynb - Data exploration notebook.
- XGboosting.ipynb - Data mining using various versions of XGboosting.
- Random Forest - Data mining using random forests
- K-Nearest-Neighbours Classifier - Data mining using KNN classifier