- Background
- Goals
- Data Collection and Data Cleaning
- Feature Engineering and Selection
- Modeling
- Results
- Final Thoughts
- Next Steps
- References
- Repo Structure
- Describe the background of the problem you are trying to solve and why it is important.
- List 2-3 goals
- Describe where the data came from
- Describe data shape
- Describe cleaning process
- Discuss if there is missing data and how you approached dealing with it
- Describe data science pipeline (include image if you have one)
- Discuss feature engineering process
- Discuss feature selection process
- Discuss model selection process
- Discuss hyperparameter tuning
- Accuracy and other metrics
- Confusion matrix
- Cross-validation
- Bias and variance trade-off
- Overfitting and underfitting
- Feature importance
- Interpretability
- Discuss any obstacles in developing the model and how you overcame them
- Pros/Cons of each decision you made throughout the project and what you have learned overall from the project
- Increase the amount of data
- Incorporate more features
- Experiment with different algorithms
- Validate the model on a different dataset
- Develop a user interface
├── /data (data)
├── /img (contains all images for repo)
├── 1-DataCleanManip.py
├── 2-DataModeling.py
├── README.md
└── dataset.csv