Rumen Mihaylov's repositories
falcontune
Tune any FALCON in 4-bit
fiddler
Fast Inference of MoE Models with CPU-GPU Orchestration
Language:PythonApache-2.0000
Language:PythonApache-2.0000
llm-autoeval
Automatically evaluate your LLMs in Google Colab
Language:PythonMIT000
lm-evaluation-harness
A framework for few-shot evaluation of language models.
Language:PythonMIT000