rioyokotalab / Megatron-DeepSpeed-Ylab

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

rioyokotalab/Megatron-DeepSpeed-Ylab Stargazers