There are 0 repository under ml-efficiency topic.
Supercharge Your Model Training
(Unofficial) building Hugging Face SmolLM-blazingly fast and small language model with PyTorch implementation of grouped query attention (GQA)