EleutherAI / DeeperSpeed

DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.

Home Page:https://www.deepspeed.ai/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

License MIT

DeeperSpeed

DeeperSpeed is a fork of Microsoft's Deepspeed library that is tailor-made for the GPT-NeoX by EleutherAI.

Prior to 3/9/2023, DeeperSpeed was based on an old version of DeepSpeed (0.3.15). In order to migrate to the latest upstream DeepSpeed version while allowing users to access the old versions of GPT-NeoX and DeeperSpeed, we have introduced two versioned releases for both libraries:

About

DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.

https://www.deepspeed.ai/

License:Apache License 2.0


Languages

Language:Python 68.9%Language:C++ 20.6%Language:Cuda 9.6%Language:Shell 0.4%Language:C 0.4%Language:Dockerfile 0.1%Language:Batchfile 0.0%