Vector Ventures's starred repositories

qlora

QLoRA: Efficient Finetuning of Quantized LLMs

Language:Jupyter NotebookLicense:MITStargazers:9702Issues:84Issues:246

soundstorm-pytorch

Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch

Language:PythonLicense:MITStargazers:1145Issues:51Issues:15

descript-audio-codec

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.

Language:PythonLicense:MITStargazers:1017Issues:26Issues:62

chatpdf

Chat and Ask on your own data. Accelerator to quickly upload your own enterprise data and use OpenAI services to chat to that uploaded data and ask questions

Language:TypeScriptLicense:MITStargazers:680Issues:25Issues:30

bark-voice-cloning-HuBERT-quantizer

The code for the bark-voicecloning model. Training and inference.

Language:PythonLicense:MITStargazers:610Issues:17Issues:42

GPT4Tools

GPT4Tools is an intelligent system that can automatically decide, control, and utilize different visual foundation models, allowing the user to interact with images during a conversation.

Language:PythonLicense:Apache-2.0Stargazers:580Issues:11Issues:9

knn-vc

Voice Conversion With Just Nearest Neighbors

Language:PythonLicense:NOASSERTIONStargazers:427Issues:14Issues:35

vocos

Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis

Language:PythonLicense:MITStargazers:405Issues:29Issues:27

landmark-attention

Landmark Attention: Random-Access Infinite Context Length for Transformers

Language:PythonLicense:Apache-2.0Stargazers:399Issues:40Issues:15

mayavoz

Pytorch based speech enhancement toolkit.

Language:PythonLicense:MITStargazers:322Issues:14Issues:16

XPhoneBERT

XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech (INTERSPEECH 2023)

Language:PythonLicense:MITStargazers:291Issues:10Issues:19

Pengi

An Audio Language model for Audio Tasks

Language:PythonLicense:MITStargazers:266Issues:14Issues:13

NS2VC

Unofficial implementation of NaturalSpeech2 for Voice Conversion and Text to Speech

spear-tts-pytorch

An unofficial PyTorch implementation of SPEAR-TTS.

Language:Jupyter NotebookLicense:MITStargazers:209Issues:32Issues:15

PL-BERT

Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions

Language:PythonLicense:MITStargazers:199Issues:16Issues:46

CoMoSpeech

CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model

Language:PythonLicense:MITStargazers:168Issues:11Issues:10

DDDM-VC

Official Pytorch Implementation for "DDDM-VC: Decoupled Denoising Diffusion Models with Disentangled Representation and Prior Mixup for Verified Robust Voice Conversion" (AAAI 2024)

efficientspeech

PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:143Issues:6Issues:9
Language:Jupyter NotebookLicense:MITStargazers:133Issues:12Issues:11

Barkify

Barkify: an unoffical training implementation of Bark TTS by suno-ai

SpeechPrompt-v2

《SpeechPrompt v2: Prompt Tuning for Speech Classification Tasks》Speech processing with prompting paradigm

Language:PythonLicense:MITStargazers:69Issues:13Issues:0

zeus-llm-trainer

Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models

Language:PythonLicense:Apache-2.0Stargazers:66Issues:3Issues:0

WaveODE

An ODE-based generative neural vocoder using Rectified Flow

tts-trainer

Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using Whisper.

fluenttts

FluentTTS: Text-dependent Fine-grained Style Control for Multi-style TTS

Language:PythonStargazers:20Issues:0Issues:0

ArticulateAI

Listen, transcribe, reply - Voice Assistant using OpenAI & ElevenLabs API's

Language:PythonLicense:MITStargazers:14Issues:2Issues:0

naturalspeech2-pytorch

Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch

Language:PythonLicense:MITStargazers:6Issues:2Issues:0