microsoft/Megatron-DeepSpeed Issues
Bugs in GPT2 Inference Example
Updated 3MOE TFLOPS calculation
Updatedwhy moe can not use zero3
Updated屎山代码DeepSpeed
Updated 2Support TransformerEngine
Closed 1Doubts about GPU memory
Updatederror in generate_text.sh
Updated 1[BUG] suspicious bf16 config
Updated 2Zero2/3 segmentation fault
Closed 1