Sanchit Gandhi's repositories
whisper-jax
JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.
parler-tts
Inference and training library for high-quality TTS models.
pyannote-audio-ka
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
alignment-handbook
Robust recipes to align language models with human and AI preferences
audio-transformers-course
The Hugging Face Course on Transformers for Audio
audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
candle
Minimalist ML framework for Rust
faster-whisper
Faster Whisper transcription with CTranslate2
seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
trl
Train transformer language models with reinforcement learning.
whisper.cpp
Port of OpenAI's Whisper model in C/C++