JY-Ren

JY-Ren

Geek Repo

Github PK Tool:Github PK Tool

JY-Ren's repositories

transpeeder

train llama on a single A100 80G node using 🤗 transformers and 🚀 Deepspeed Pipeline Parallelism

Language:PythonLicense:Apache-2.0Stargazers:2Issues:0Issues:0

neox-LLAMA2

update neox framework to train llama1&llama2

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

License:Apache-2.0Stargazers:0Issues:0Issues:0