convert_hf_to_module（pipeline_parallel>1）

Question

convert_hf_to_module（pipeline_parallel>1）

liuxinxin123 opened this issue 9 months ago · comments

hi，
I see that we support to convert a HuggingFace transformers to a NeoX model with pipeline parallelism without pipeline parallelism by 'convert_hf_to_sequential.py'.

Can we support to convert a HuggingFace transformers to a NeoX model with pipeline parallelism without pipeline parallelism greater than 1, or how to do now?

Stella Biderman · Answer 1 · Wed Dec 06 2023 23:13:12 GMT+0800 (China Standard Time)

Thank you for the issue. The language you're using is a little confusing to me. Am I correct in thinking you want to go HF -> NeoX w/ PP = 1? That is, you can do PP > 1 and no PP but not PP = 1?

Stella Biderman · Answer 2 · Mon Jan 08 2024 11:30:34 GMT+0800 (China Standard Time)

@liuxinxin123 Hey, I wanted to follow up on this. Can you elaborate on what your issue is?