bolaik / RenthopApartmentInterestLevel

Solution posts to Kaggle competition Two sigma connect Renthop for rental listing inquries

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

RenthopApartmentInterestLevel

Solution posts to Kaggle competition Two sigma connect Renthop for rental listing inquries

features selected

The basic features include features from Branden Murray's post and some further revisions.

advanced features

  • encoded features: This is based on the fact that some categorical turns out to have pretty high impact on prediction.
    • encode some numerical features, such as bedrooms, distance_city on manager_id, by taking the mean of features grouped by the categorical feature.
    • inspired by this, one can also encode important numerical features, such as price, on other features.
    • encode prediction (i.e. interest_level) on manager_id, requires cross validation.
    • encode (group mean, other statistics could also be defined) other numerical features (e.g. price) on categorical features (e.g. manager_id) conditioned on interest_level, also request cross validation.
  • geological features: local price fluctuation, from plantsgo
  • images features: also see magic feature

classifiers

Ensemble

  • Train level-2 models, including xgb, nn, knn, lr, lightgbm, where xgb prediction is submitted because of best cv score.

many helpful links for reference

About

Solution posts to Kaggle competition Two sigma connect Renthop for rental listing inquries


Languages

Language:Jupyter Notebook 54.7%Language:HTML 45.0%Language:Python 0.3%Language:R 0.0%