rstats-wtf / what-they-forgot

"What They Forgot to Teach You About R" website / eBook

Home Page:https://rstats.wtf

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Data and version control

eamcvey opened this issue · comments

A question we discussed that might be nice to include in the future: can/should we version control our data in git/Github? If not, what are the other options?

Seconded! Can there be a section on how to add subdirectories (i.e. /data) to .gitignore? Especially if the data is too large to be uploaded successfully to Github, or better not uploaded to Github.

"Small" data may well live on GitHub, but for anything larger than, say, 1MB, you want to find a better place. I have suggested using symlinks in #13 (comment), which should also be added to .gitignore.