tonysln / ids2020-project

IDS 2020 Course Project

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Airbnb analysis & price prediction

IDS 2020 Course Project

Using New York City Airbnb Open Data on Kaggle.

The project is described in-depth in the report.

Files

/
├── data
│     ├── AB_NYC_2019.csv        # the dataset from Kaggle, contains information on Airbnb listing activity and metrics in New York, 2019
│     ├── AB_NYC_2019_xlsx.xlsx  # the original dataset ported to Microsoft Excel
│     └── NYC.jpg                # illustrative map of New York City
├── notebooks
│     ├── Analysis.ipynb         # Jupyter notebook (Python 3.8) for analyzing the dataset
│     └── Project.ipynb          # Jupyter notebook (Python 3.8) for the project, containing debugging and testing code
├── output                       # dataset visualizations in .png format, created by the main script
│     ├── fig1.png
│     └── ...
├── C4_report.pdf                # CRISP-DM report of the project
├── C4-KAGGLE-NYC-AIRBNB.py      # main source code for the project, using Python 3.8
└── C4-NYC-AIRBNB_poster.pdf     # introductory poster for our project

Requirements

Python version: 3.8 or newer

Libraries: pandas, numpy, seaborn, matplotlib, sklearn

Usage

  1. Clone this repository
$ git clone https://github.com/tonysln/ids2020-project.git
  1. Install missing libraries using pip

  2. Run the main script

$ python C4-KAGGLE-NYC-AIRBNB.py

Team

  • Anton Slavin (Group 6)
  • Elen Liivapuu (Group 6)

About

IDS 2020 Course Project


Languages

Language:Jupyter Notebook 99.7%Language:Python 0.3%