Arun A. Kumar's repositories
DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
python-unidiff
Unified diff python parsing/metadata extraction library
seqio
Task-based datasets, preprocessing, and evaluation for sequence models.
state-spaces
Sequence Modeling with Structured State Spaces
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
triton
Development repository for the Triton language and compiler
trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
whisper
Robust Speech Recognition via Large-Scale Weak Supervision