abacaj / train-with-fsdp

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

train-with-fsdp

Code used to fine-tune this model: abacaj/starcoderbase-1b-sft.

Note the data in folder data/ is not the full training data used. You can find the full set here: evol-codealpaca-v1

How to run

Install dependencies:

python -m venv env \
  && source env/bin/activate \
  && pip install -r requirements.txt

Run training code:

torchrun --nnodes=1 --nproc-per-node=<REPLACE_WITH_NUMBER_OF_GPUS> train.py

To add data place jsonl files in data/ and edit train.py line :154, :155.

Benchmarks

image

Charts

image

See: wandb

About

License:MIT License


Languages

Language:Python 100.0%