eth-easl / fmengine

Utilities for Training Very Large Models

Home Page:https://fmengine.readthedocs.io/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

PyTorch DTensor

xzyaoi opened this issue · comments

PyTorch now has native support for distributed tensor, might be a better way to do TP than megatron's MPU.