There are 1 repository under large-scale-language-modeling topic.
Codes for VPGTrans: Transfer Visual Prompt Generator across LLMs. VL-LLaMA, VL-Vicuna.
Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*