Vector Ventures's starred repositories

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:26115Issues:225Issues:4331

AnimateAnyone

Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation

mamba

Mamba SSM architecture

Language:PythonLicense:Apache-2.0Stargazers:12461Issues:102Issues:493

magic-animate

[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model

Language:PythonLicense:BSD-3-ClauseStargazers:10358Issues:104Issues:145

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Language:PythonLicense:Apache-2.0Stargazers:7583Issues:108Issues:152
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:7222Issues:63Issues:186

EmotiVoice

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

Language:PythonLicense:Apache-2.0Stargazers:7179Issues:63Issues:149

StyleTTS2

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Language:PythonLicense:MITStargazers:4689Issues:79Issues:188

distil-whisper

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

Language:PythonLicense:MITStargazers:3475Issues:64Issues:98

FlexFlow

FlexFlow Serve: Low-Latency, High-Performance LLM Serving

Language:C++License:Apache-2.0Stargazers:1640Issues:33Issues:642

UniAudio

The Open Source Code of UniAudio

VoiceFlow-TTS

[ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"

MP-SENet

MP-SENet: A Speech Enhancement Model with Parallel Denoising of Magnitude and Phase Spectra

Language:PythonLicense:MITStargazers:281Issues:5Issues:43
Language:PythonLicense:Apache-2.0Stargazers:248Issues:13Issues:15

ar-vits

text to speech using autoregressive transformer and VITS

Language:PythonLicense:MITStargazers:222Issues:15Issues:4

pflowtts_pytorch

Unofficial implementation of NVIDIA P-Flow TTS paper

Language:PythonLicense:MITStargazers:207Issues:14Issues:42

nendo

The Nendo AI Audio Tool Suite

Language:PythonLicense:MITStargazers:206Issues:7Issues:8

bigvsan

Pytorch implementation of BigVSAN

Language:PythonLicense:MITStargazers:196Issues:29Issues:6

encodec.cpp

Port of Meta's Encodec in C/C++

ttts

Train the next generation of TTS systems.

Language:PythonLicense:MPL-2.0Stargazers:159Issues:14Issues:19

UniCATS-CTX-vec2wav

[AAAI 2024] Code for CTX-vec2wav in UniCATS

MB-iSTFT-VITS2

Application of MB-iSTFT-VITS components to vits2_pytorch

Language:PythonLicense:MITStargazers:107Issues:5Issues:16

VocalForge

Your one-stop solution for voice dataset creation

Language:PythonLicense:MITStargazers:106Issues:8Issues:12

VecTok

Official implementation of Vec-Tok Speech

ml-spatial-librispeech

A large synthetic dataset of spatial audio with multiple labels

License:NOASSERTIONStargazers:84Issues:17Issues:0
Language:PythonLicense:Apache-2.0Stargazers:61Issues:5Issues:12

PromptTTS2

[WIP] Unofficial Implementation of Microsoft's PromptTTS2

Language:PythonStargazers:49Issues:5Issues:0

whisper-cpp-server

whisper-cpp-serve Real-time speech recognition and c+ of OpenAI's Whisper model in C/C++

Language:HTMLLicense:MITStargazers:33Issues:2Issues:5

rvc-onnx-test

for onnx export test from rvc

Language:PythonLicense:MITStargazers:4Issues:3Issues:0