Rumen Mihaylov's repositories
fiddler
Fast Inference of MoE Models with CPU-GPU Orchestration
Apache-2.0000
llm-autoeval
Automatically evaluate your LLMs in Google Colab
Language:PythonMIT000
lm-evaluation-harness
A framework for few-shot evaluation of language models.
MIT000
falcontune
Tune any FALCON in 4-bit
Apache-2.0000