Jerry Li's repositories
PAI-Megatron-LM-240718
Ongoing research training transformer models at scale
bigcode-evaluation-harness
A framework for the evaluation of autoregressive code generation language models.
Language:PythonApache-2.0000
lm-evaluation-harness
A framework for few-shot evaluation of language models.
Language:PythonMIT000
Language:PythonApache-2.0000
TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.
Language:PythonApache-2.0000