Kaggle competition described here: https://www.kaggle.com/passnyc/data-science-for-good/home
Folder structure based on article found here: https://www.kdnuggets.com/2018/07/cookiecutter-data-science-organize-data-project.html
iPython submitted for Kaggle competition: https://www.kaggle.com/gbolla/bolla-passnyc-final-submission
Scripts to collect data: src/data
(output goes to data/external
)
Scripts to clean data and build features: src/features
(output goes to data/interim
)
Scripts to build single regression and final models: src/models
(output goes to models/
and data/processed
, respectively)