This repository is a compilation of resources for reducing the data size and optimizing python pandas.
- Basics of datatype optimizations - https://www.youtube.com/watch?v=u4_c2LDi4b8
- Basics of storage optimizations - https://www.youtube.com/watch?v=u4rsA5ZiTls
- How to reduce data size? Detailed steps - https://www.kaggle.com/competitions/amex-default-prediction/discussion/328054
- Speed up python dataframe loops - https://www.youtube.com/watch?v=SAFmrTnEHLg