Beast code in Giters

Vector Ventures's starred repositories

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonApache-2.026115 225 4331

AnimateAnyone

Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation

Apache-2.014252 672 90

mamba

Mamba SSM architecture

Language:PythonApache-2.012461 102 493

magic-animate

[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model

Language:PythonBSD-3-Clause10358 104 145

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Language:PythonApache-2.07583 108 152

insanely-fast-whisper

Language:Jupyter NotebookApache-2.07222 63 186

EmotiVoice

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

Language:PythonApache-2.07179 63 149

StyleTTS2

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Language:PythonMIT4689 79 188

distil-whisper

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

Language:PythonMIT3475 64 98

FlexFlow

FlexFlow Serve: Low-Latency, High-Performance LLM Serving

Language:C++Apache-2.01640 33 642

UniAudio

The Open Source Code of UniAudio

Language:Python505 38 32

VoiceFlow-TTS

[ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"

Language:Python285 17 13

MP-SENet

MP-SENet: A Speech Enhancement Model with Parallel Denoising of Magnitude and Phase Spectra

Language:PythonMIT281 5 43

MahaTTS

Language:PythonApache-2.0248 13 15

ar-vits

text to speech using autoregressive transformer and VITS

Language:PythonMIT222 15 4

pflowtts_pytorch

Unofficial implementation of NVIDIA P-Flow TTS paper

Language:PythonMIT207 14 42

nendo

The Nendo AI Audio Tool Suite

Language:PythonMIT206 7 8

bigvsan

Pytorch implementation of BigVSAN

Language:PythonMIT196 29 6

encodec.cpp

Port of Meta's Encodec in C/C++

Language:C++187 10 4

ttts

Train the next generation of TTS systems.

Language:PythonMPL-2.0159 14 19

UniCATS-CTX-vec2wav

[AAAI 2024] Code for CTX-vec2wav in UniCATS

Language:Python115 10 9

MB-iSTFT-VITS2

Application of MB-iSTFT-VITS components to vits2_pytorch

Language:PythonMIT107 5 16

VocalForge

Your one-stop solution for voice dataset creation

Language:PythonMIT106 8 12

VecTok

Official implementation of Vec-Tok Speech

91 19 1

ml-spatial-librispeech

A large synthetic dataset of spatial audio with multiple labels

NOASSERTION84 170

CLARA

Language:PythonApache-2.061 5 12

PromptTTS2

[WIP] Unofficial Implementation of Microsoft's PromptTTS2

Language:Python49 50

whisper-cpp-server

whisper-cpp-serve Real-time speech recognition and c+ of OpenAI's Whisper model in C/C++

Language:HTMLMIT33 2 5

vits3_pytorch

Language:PythonMIT26 7 1

rvc-onnx-test

for onnx export test from rvc

Language:PythonMIT4 30