General data science repository architecture guidelines
Data science projects are hosted in individual repositories, each containing:
- A code folder ('code') including:
- Code developed in Python and/or R;
- Code formatted as .py Python files oy .ipynb Jupyter Notebooks;
- A dataset folder ('dataset') including:
- Dataset input files in a variety of formats;
- An output folder ('output') including:
- Temporary or final files generated by the code from input datasets;
- A support folder ('support') including:
- Any additional file that may serve an ancillary project purpose.