paulobreviglieri / data-science-guidelines

General data science repository architecture guidelines

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Data Science Guidelines

General data science repository architecture guidelines

Data science projects are hosted in individual repositories, each containing:

  • A code folder ('code') including:
    • Code developed in Python and/or R;
    • Code formatted as .py Python files oy .ipynb Jupyter Notebooks;
  • A dataset folder ('dataset') including:
    • Dataset input files in a variety of formats;
  • An output folder ('output') including:
    • Temporary or final files generated by the code from input datasets;
  • A support folder ('support') including:
    • Any additional file that may serve an ancillary project purpose.

About

General data science repository architecture guidelines

License:MIT License