Vector Ventures's starred repositories
soundstorm-pytorch
Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch
descript-audio-codec
State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.
bark-voice-cloning-HuBERT-quantizer
The code for the bark-voicecloning model. Training and inference.
landmark-attention
Landmark Attention: Random-Access Infinite Context Length for Transformers
XPhoneBERT
XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech (INTERSPEECH 2023)
spear-tts-pytorch
An unofficial PyTorch implementation of SPEAR-TTS.
CoMoSpeech
CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model
efficientspeech
PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.
SpeechPrompt-v2
《SpeechPrompt v2: Prompt Tuning for Speech Classification Tasks》Speech processing with prompting paradigm
zeus-llm-trainer
Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models
tts-trainer
Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using Whisper.
ArticulateAI
Listen, transcribe, reply - Voice Assistant using OpenAI & ElevenLabs API's
naturalspeech2-pytorch
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch