twostarxx / Codes-for-WSDM-CUP-Music-Rec-1st-place-solution

Home Page:https://www.kaggle.com/c/kkbox-music-recommendation-challenge/discussion/45942

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Codes-for-WSDM-CUP-Music-Rec-1st-place-solution

This is the corresponding codes for WSDM CUP 2018 Music Recommendation Challenge's 1st place solution.

Please create following folders before testing:

input/training/source_data/

input/training/temporal_data/

input/validation/source_data/

input/validation/temporal_data/

temp_nn/

submission/

Put the data in the folder "source_data", then run script/run.sh, features will be extracted. For validation, you need to prepare data by hand, and create a "test_label.csv" file with "target" field.

The hyper-parameters is recorded in lgb_record.csv and nn_record.csv, you can try it directly. If everything is right, you should be able to get 0.744+ with LightGBM, and 0.742+ with 30-ensemble of NNs. 0.6 * LightGBM + 0.4 * NN should be able to get you ~0.749.

The code is tested on a small part of the data under python 2.7, if you find any bug, please contract me under the topic on Kaggle.

The versions of dependencies:

pandas: 0.20.1, sklearn: 0.18.1, keras: 2.0.4, lightgbm: 0.1, numpy: 1.12.1, scipy: 0.19.0, Tensorflow 1.0.1

About

https://www.kaggle.com/c/kkbox-music-recommendation-challenge/discussion/45942


Languages

Language:Python 99.7%Language:Shell 0.3%