karpathy / llm.c

LLM training in simple, raw C/CUDA

Repository from Github https://github.comkarpathy/llm.cRepository from Github https://github.comkarpathy/llm.c

python dev/data/fineweb.py --version 10B

bigsnarfdude opened this issue · comments

fineweb.py requires datasets python module


pip install datasets
python dev/data/fineweb.py --version 10B

currently not included during install in the requirements.txt

commented

fixed