xumx / kaggle-advertised-salaries

Predicting job salaries from ads - a Kaggle competition

Home Page:http://fastml.com/predicting-advertised-salaries/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Predicting advertised salaries

See http://fastml.com/predicting-advertised-salaries/ for description.

2vw.py - convert a combined train+test file to VW format
2vw_loc.py - the same, but for data transformed with update_locations.py
add_dummy_salaries.py - add dummy salaries columns (2) to a test file; drop headers
first.py - Take some lines from the input file and save them to the output file
split.py - split a file into two randomly, line by line
unlog_predictions.r - convert VW's log predictions back to a normal scale by taking exp()
update_locations.py - replace location columns from the original file with parsed location (five columns)

About

Predicting job salaries from ads - a Kaggle competition

http://fastml.com/predicting-advertised-salaries/


Languages

Language:Python 96.7%Language:R 1.7%Language:Shell 1.6%