Reza Yazdani's repositories
bigcode-evaluation-harness
A framework for the evaluation of autoregressive code generation language models.
Language:PythonApache-2.0000
DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Language:PythonApache-2.0000
Language:C++Apache-2.0000
FlashAttention
Fast and memory-efficient exact attention
Language:PythonBSD-3-Clause000
llama
Inference code for LLaMA models
Language:PythonNOASSERTION000
llama2.c
Inference Llama 2 in one file of pure C
Language:PythonMIT000
lm-evaluation-harness
A framework for few-shot evaluation of language models.
Language:PythonMIT000
transformers-1
🤗 Transformers: State-of-the-art Natural Language Processing for Pytorch, TensorFlow, and JAX.
Language:PythonApache-2.0000
transformers-bloom-inference
Fast Inference Solutions for BLOOM
Language:PythonApache-2.0000
triton
Development repository for the Triton language and compiler
Language:C++MIT000
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Apache-2.0000