There are 18 repositories under vits topic.
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Easily train a good VC model with voice data <= 10 mins!
SoftVC VITS Singing Voice Conversion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
so-vits-svc fork with realtime support, improved interface and more features.
vits2 backbone with multilingual-bert
Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, support 12 programming languages
AI Vtuber是一个由 【ChatterBot/ChatGPT/claude/langchain/chatglm/text-gen-webui/闻达/千问/kimi/ollama】 驱动的虚拟主播【Live2D/UE/xuniren】,可以在 【Bilibili/抖音/快手/微信视频号/拼多多/斗鱼/YouTube/twitch/TikTok】 直播中与观众实时互动 或 直接在本地进行聊天。它使用TTS技术【edge-tts/VITS/elevenlabs/bark/bert-vits2/睿声】生成回答并可以选择【so-vits-svc/DDSP-SVC】变声;指令协同SD画图。
Core Engine of Singing Voice Conversion & Singing Voice Clone
A simple, high-quality voice conversion tool focused on ease of use and performance.
Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support ONNX streaming out!
多个SVC/TTS的C++推理库
A simple VITS HTTP API, developed by extending Moegoe with additional features.
So-VITS-SVC 本地部署使用帮助文档,提供Colab笔记本 So-VITS-SVC Local Deployment Document and provide Colab notebook
🦖Pytorch implementation of popular Attention Mechanisms, Vision Transformers, MLP-Like models and CNNs.🔥🔥🔥
SummerTTS 是一个基于C++的独立编译的中文和英文语音合成项目,可以本地运行不需要网络,而且没有额外的依赖,一键编译完成即可用于中文和英文的语音合成。SummerTTS is a standalone Chinese and English speech synthesis(TTS) project that has almost no dependency and could be easily used for Chinese TTS with just one key build out
AivisSpeech: AI Voice Imitation System - Text to Speech Software
Probing the representations of Vision Transformers.
Persian/Farsi text to speech(TTS) training using coqui tts
開発休止中ですが、将来的に Aivis-Project/AivisBuilder として大幅リニューアル予定のリポジトリです
本地完整部署ASR(K2)-NLP(Rasa,Spacy)-LLM(Chatglm2)-TTS(Vits)
🌻 VITS ONNX TTS server designed for fast inference 🔥
Application of MB-iSTFT-VITS components to vits2_pytorch
图片搜索引擎,很简单。三步构建属于你自己的图片搜索引擎,掌握向量数据库和以图搜图、文本搜索图片。
VITS for Mandarin. Support Windows and Linux, low-end and high-end hardwares
AivisSpeech Engine: AI Voice Imitation System - Text to Speech Engine