Leavingseason / RecsysChallenge2017

Our solution for Recsys Challenge 2017.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

This is our source code for Recsys Challenge 2017 http://2017.recsyschallenge.com/. The official rank of our team is 5th, and our final model ranked 2 for the last 2 consecutive weeks. It is a huge pity that we didn't use the best model in the first two weeks (in the most of other competitions, updating the best model before the last minute of deadline is enough, however, this is not true in this competiton, which is different from what we were expected.).

The code is written with Microsoft's internal big data platform named COSMOS and the language is Scope. If you are interested in running it, you can try the public version in Azure, which is called Data Lake and U-SQL https://docs.microsoft.com/en-us/azure/data-lake-analytics/data-lake-analytics-data-lake-tools-get-started .

Scripts under folder 'models' are scripts for extracting features, training model, making predictions, and many more post processing. The final features are in sparse format as SVMLight.

Programs under folder 'auto-pipeline' are c# source code for our automatic pipeline.

RecsysChallenge2017.pdf is our workshop paper, "Practical Lessons for Job Recommendations in the Cold-Start Scenario".
https://dl.acm.org/citation.cfm?id=3124794

Jianxun Lian, Fuzheng Zhang, Min Hou, Hongwei Wang, Xing Xie, and Guangzhong Sun. 2017. Practical Lessons for Job Recommendations in the Cold-Start Scenario. In Proceedings of the Recommender Systems Challenge 2017 (RecSys Challenge '17). ACM, New York, NY, USA, Article 4, 6 pages. DOI: https://doi.org/10.1145/3124791.3124794

About

Our solution for Recsys Challenge 2017.


Languages

Language:C# 89.5%Language:Python 10.5%Language:Smalltalk 0.0%