kwai / Megatron-Kwai

[USENIX ATC '24] Accelerating the Training of Large Language Models using Efficient Activation Rematerialization and Optimal Hybrid Parallelism

Home Page:https://www.usenix.org/conference/atc24/presentation/yuan

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

kwai/Megatron-Kwai Stargazers