barryhunt's starred repositories
Langchain-Chatchat
Langchain-Chatchat(原Langchain-ChatGLM, Qwen 与 Llama 等)基于 Langchain 与 ChatGLM 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Llama) RAG and Agent app with langchain
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Retrieval-based-Voice-Conversion-WebUI
Easily train a good VC model with voice data <= 10 mins!
Clash-for-Windows_Chinese
clash for windows汉化版. 提供clash for windows的汉化版, 汉化补丁及汉化版安装程序
ChatGLM2-6B
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
so-vits-svc-fork
so-vits-svc fork with realtime support, improved interface and more features.
Bert-VITS2
vits2 backbone with multilingual-bert
VoiceCraft
Zero-Shot Speech Editing and Text-to-Speech in the Wild
video-subtitle-extractor
视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.
VITS-fast-fine-tuning
This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion
parler-tts
Inference and training library for high-quality TTS models.
ADeus
An open source AI wearable device that captures what you say and hear in the real world and then transcribes and stores it on your own server. You can then chat with Adeus using the app, and it will have all the right context about what you want to talk about - a truly personalized, personal AI.
naturalspeech2-pytorch
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch
3D-Speaker
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
VoiceprintRecognition-Pytorch
This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not excluded that more models will be supported in the future. At the same time, this project also supports MelSpectrogram, Spectrogram data preprocessing methods
book-text-to-speech
A book about Text-to-Speech (TTS) in Chinese.
vits2_pytorch
unofficial vits2-TTS implementation in pytorch
AudioClassification-Pytorch
The Pytorch implementation of sound classification supports EcapaTdnn, PANNS, TDNN, Res2Net, ResNetSE and other models, as well as a variety of preprocessing methods.
RapidVideOCR
Extract video hard subtitles and automatically generate corresponding srt files.
GPT-SoVITS-Inference
Inference Specialization
audio-preprocess
Preprocess Audio for training
RVC-Studio
The best looking and most functional webui for RVC related tasks. See website for UI demo: