sanchit-gandhi

followers

following

stars

@huggingface

London, UK

Sanchit Gandhi's repositories

whisper-jax

JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.

Language:Jupyter NotebookApache-2.04146 42 172

parler-tts

Inference and training library for high-quality TTS models.

Language:PythonApache-2.08200

notebooks

A collection of notebooks for the Hugging Face blog series (https://huggingface.co/blog).

Language:Jupyter Notebook32 2 1

codesnippets

Language:Jupyter Notebook7 20

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch

Language:PythonApache-2.03 20

simple-machine-setup

Language:Shell3 20

whisper-calm

Language:Python3 20

pyannote-audio-ka

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

MIT200

small-language-models

Language:PythonApache-2.0200

whisper-pt2

Language:Python2 20

alignment-handbook

Robust recipes to align language models with human and AI preferences

Language:PythonApache-2.01 10

audio-transformers-course

The Hugging Face Course on Transformers for Audio

Language:MDXApache-2.01 10

AudioLDM2

Text-to-Audio/Music Generation

Language:PythonNOASSERTION1 10

benchmark-asr

Language:Python1 10

blog

Public repo for HF blog posts

Language:Jupyter Notebook1 10

dataspeech

MIT100

diarizers

100

insanely-fast-whisper

Language:Jupyter NotebookApache-2.01 10

MusicLDM

The latent diffusion model for text-to-music generation.

Language:PythonNOASSERTION1 10

open_asr_leaderboard

Language:Python100

optimum

🚀 Accelerate training and inference of 🤗 Transformers and 🤗 Diffusers with easy to use hardware optimization tools

Language:PythonApache-2.01 10

audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Language:PythonMIT010

candle

Minimalist ML framework for Rust

Apache-2.0000

faster-whisper

Faster Whisper transcription with CTranslate2

Language:PythonMIT000

flax

Flax is a neural network library for JAX that is designed for flexibility.

Language:PythonApache-2.0010

seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Language:CNOASSERTION010

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonApache-2.0010

triton

Development repository for the Triton language and compiler

Language:C++MIT010

trl

Train transformer language models with reinforcement learning.

Apache-2.0000

whisper.cpp

Port of OpenAI's Whisper model in C/C++

MIT000