TODO : switch code base of expert parallel from colossalai to deepspeed
scsc0511 opened this issue · comments
scsc0511 commented
Describe a TODO feature
- Existing code is based on colossalai but this code is not proper for multi parallelism.
- Switch the code base of expert parallel to deepspeed