Sanchit Gandhi (sanchit-gandhi)

sanchit-gandhi

Geek Repo

Company:@huggingface

Location:London, UK

Github PK Tool:Github PK Tool

Sanchit Gandhi's repositories

whisper-jax

JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:4146Issues:42Issues:172

parler-tts

Inference and training library for high-quality TTS models.

Language:PythonLicense:Apache-2.0Stargazers:82Issues:0Issues:0

notebooks

A collection of notebooks for the Hugging Face blog series (https://huggingface.co/blog).

Language:Jupyter NotebookStargazers:32Issues:2Issues:1
Language:Jupyter NotebookStargazers:7Issues:2Issues:0

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch

Language:PythonLicense:Apache-2.0Stargazers:3Issues:2Issues:0
Language:PythonStargazers:3Issues:2Issues:0

pyannote-audio-ka

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

License:MITStargazers:2Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:2Issues:0Issues:0
Language:PythonStargazers:2Issues:2Issues:0

alignment-handbook

Robust recipes to align language models with human and AI preferences

Language:PythonLicense:Apache-2.0Stargazers:1Issues:1Issues:0

audio-transformers-course

The Hugging Face Course on Transformers for Audio

Language:MDXLicense:Apache-2.0Stargazers:1Issues:1Issues:0

AudioLDM2

Text-to-Audio/Music Generation

Language:PythonLicense:NOASSERTIONStargazers:1Issues:1Issues:0
Language:PythonStargazers:1Issues:1Issues:0

blog

Public repo for HF blog posts

Language:Jupyter NotebookStargazers:1Issues:1Issues:0
License:MITStargazers:1Issues:0Issues:0
Stargazers:1Issues:0Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1Issues:1Issues:0

MusicLDM

The latent diffusion model for text-to-music generation.

Language:PythonLicense:NOASSERTIONStargazers:1Issues:1Issues:0
Language:PythonStargazers:1Issues:0Issues:0

optimum

🚀 Accelerate training and inference of 🤗 Transformers and 🤗 Diffusers with easy to use hardware optimization tools

Language:PythonLicense:Apache-2.0Stargazers:1Issues:1Issues:0

audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

candle

Minimalist ML framework for Rust

License:Apache-2.0Stargazers:0Issues:0Issues:0

faster-whisper

Faster Whisper transcription with CTranslate2

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

flax

Flax is a neural network library for JAX that is designed for flexibility.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Language:CLicense:NOASSERTIONStargazers:0Issues:1Issues:0

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

triton

Development repository for the Triton language and compiler

Language:C++License:MITStargazers:0Issues:1Issues:0

trl

Train transformer language models with reinforcement learning.

License:Apache-2.0Stargazers:0Issues:0Issues:0

whisper.cpp

Port of OpenAI's Whisper model in C/C++

License:MITStargazers:0Issues:0Issues:0