[Feature]: Automatic Pipeline Parallelism

Question

[Feature]: Automatic Pipeline Parallelism

dujiangsu opened this issue 2 years ago · comments

Describe the feature:
We are going to introduce the automated pipeline parallelism feature into EnergonAI, which hopes that users only need to specify some simple arguments and achieve the pipeline parallelism.
With torch.fx, here the pipelinable directory is with functions that can split a model into multiple submodules.

Difficulty:

Use meta device in fx.GraphModule generation to reduce peak memory usage.
auto_pipeline_wrapper.py is not that automated.