EpistasisLab / pmlb

PMLB: A large, curated repository of benchmark datasets for evaluating supervised machine learning algorithms.

Home Page:https://epistasislab.github.io/pmlb/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Only fetch needed datasets from git LFS in github actions workflow

lacava opened this issue · comments

Currently we fetch all the LFS files using the lfs: true option here which is described here.

It would be better to only fetch datasets that need to be updated, especially since disk space and bandwidth are limited in github actions.