Vector Ventures's starred repositories
AnimateAnyone
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
magic-animate
[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
distil-whisper
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
VoiceFlow-TTS
[ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"
pflowtts_pytorch
Unofficial implementation of NVIDIA P-Flow TTS paper
encodec.cpp
Port of Meta's Encodec in C/C++
UniCATS-CTX-vec2wav
[AAAI 2024] Code for CTX-vec2wav in UniCATS
MB-iSTFT-VITS2
Application of MB-iSTFT-VITS components to vits2_pytorch
VocalForge
Your one-stop solution for voice dataset creation
ml-spatial-librispeech
A large synthetic dataset of spatial audio with multiple labels
PromptTTS2
[WIP] Unofficial Implementation of Microsoft's PromptTTS2
whisper-cpp-server
whisper-cpp-serve Real-time speech recognition and c+ of OpenAI's Whisper model in C/C++
rvc-onnx-test
for onnx export test from rvc