ahlag / kaggle_airbnb

Airbnb Kaggle Competition: New User Bookings

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Airbnb Kaggle Competition: New User Bookings

This repository contains the code developed for the Airbnb Kaggle competition. It's written in Python 2.7.

The code produces a prediction with a score around 0.88670, winner of the 3rd place out of 1463 teams in the competition.

Description (from competition website)

Where will a new guest book their first travel experience?

Instead of waking to overlooked "Do not disturb" signs, Airbnb travelers find themselves rising with the birds in a whimsical treehouse, having their morning coffee on the deck of a houseboat, or cooking a shared regional breakfast with their hosts.

New users on Airbnb can book a place to stay in 34,000+ cities across 190+ countries. By accurately predicting where a new user will book their first travel experience, Airbnb can share more personalized content with their community, decrease the average time to first booking, and better forecast demand.

In this competition, the challenges is to predict in which country a new user will make his or her first booking. In this challenge, you are given a list of users along with their demographics, web session records, and some summary statistics. You are asked to predict which country a new user's first booking destination will be. All the users in this dataset are from the USA.

There are 12 possible outcomes of the destination country: US, FR, CA, GB, ES, IT, PT, NL, DE, AU, NDF (no destination found), and other. Please note that NDF is different from other because other means there was a booking, but is to a country not included in the list, while NDF means there wasn't a booking.

The training and test sets are split by dates. In the test set, you will predict all the new users with first activities after 7/1/2014. In the sessions dataset, the data only dates back to 1/1/2014, while the users dataset dates back to 2010.

Data

The data should be download from the [competition][data_page] website and copied into the 'data' folder. The required files are: train_users_2.csv, test_users.csv and sessions.csv [data_page]: https://www.kaggle.com/c/airbnb-recruiting-new-user-bookings/data

Main Ideas of this solution

...TODO

Requirements

To execute the code in this repository you will need the next Python packages:

License

BSD 3 clause

About

Airbnb Kaggle Competition: New User Bookings


Languages

Language:Python 100.0%