iwiwi / epochraft-hf-fsdp

Example of using Epochraft to train HuggingFace transformers models with PyTorch FSDP

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

epochraft-hf-fsdp

GitHub license Checks status

Simple example of using Epochraft to train HuggingFace transformers models with PyTorch FSDP.

🌟 News: We are thrilled to announce the release of two new models: Japanese Stable LM Gamma 7B and Japanese StableLM 3B, both trained using our codebase.

Quick start

pip install -e .
python train.py gpt2_testrun.yaml  # 1 GPU
torchrun --nproc-per-node=8 train.py gpt2_testrun.yaml  # 8 GPUs

Development

pip install -e .[development]
mypy .; black .; flake8 .; isort .

About

Example of using Epochraft to train HuggingFace transformers models with PyTorch FSDP

License:MIT License


Languages

Language:Python 100.0%