Future Sales Prediction Challenge - Kaggle
This submssion from Dec. 2017 was good for a 0.94x private leaderboard score and 13th place by the close of CY 2017. Final submission was a bagged version of XGB, LightGBM, and ExtraTreesRegressor (each individually built then averaged by a separate utility script).
Start with the submission.py file to see how the final submissions are created; this script calls contest.py (where all of the feature engineering and data cleansing takes place) and build.py (where ML models are constructed and parameterized). If you want to use the pickle files, which is much faster than loading the CSV files, refer to the pickle.py file.
email- cbenge509, 2017
- Competition:
- Challenge description: 1C - Predict Future Sales
- Documentation
- All documentation covered in the Jupyter Notebook files (3 x EDA and 1 x validation analysis).
- Models used
- SciKit-Learn:
- XGBoost
- Microsoft LightGBM
- MLXTend
- StackingRegressorCV
- this was used in the stacking branch of code (see submission.py); did not result in improved scores.
- StackingRegressorCV
- This project is released under a permissive MIT open source license (LICENSE-MIT.txt). There is no warranty; not even for merchantability or fitness for a particular solution.