yutian-mt's repositories
FlexGen
Running large language models on a single GPU for throughput-oriented scenarios.
Language:PythonApache-2.0000
Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Language:PythonNOASSERTION000