DataScienceSpecialization / courses

Course materials for the Data Science Specialization: https://www.coursera.org/specialization/jhudatascience/1

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Clean up and rewrite Git repository (or break it up)

kynan opened this issue · comments

This Git repository alone weighs in at 1073 MB (!!!) presently, which is infeasible to clone over anything but a decent broadband connection. A local clone (including work tree) is > 2GB.

It seems over time a lot of unnecessary and large binary files have accumulated (e.g. zip files with another copy of files already in the repository). Some have been removed again, but they're still present in the history.

It would be great to clean up this repository, remove unnecessary files and rewrite the history to get the repository size down using e.g. BFG Repo Cleaner.

Alternatively it might be better to split it up into repositories for each of the courses.