Yuhang's starred repositories
e2-tts-pytorch
Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch
detail_tts
All generative model in one for better TTS model
stable-audio-tools
Generative models for conditional audio generation
OpenPhonemizer
An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GPL phonemizer.
SpeechAlgorithms
Speech Algorithms
metavoice-src
Foundational model for human-like, expressive TTS
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
DeepFilterNet
Noise supression using deep filtering
hello-algo
《Hello 算法》:动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新,English version ongoing
chatgpt_system_prompt
A collection of GPT system prompts and various prompt injection/leaking knowledge.
SpeechAlgorithms
Speech Algorithms
streamlit-audio-recorder
Record Audio from the User's Microphone in Apps that are Deployed to the Web. (via Browser Media-API, REACT-based, Streamlit Custom Component)
speech_recognition
Speech recognition module for Python, supporting several engines and APIs, online and offline.
Mixly_Arduino
A visual programming editor based on blockly for Arduino、Microbit、MicroPython、Python
Free-Certifications
A curated list of free courses & certifications.
video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
RemoveAdblockThing
The intrusive "Ad blocker are not allowed on YouTube" message is annoying. This open-source project aims to address this issue by providing a solution to bypass YouTube's ad blocker detection
magvit2-pytorch
Implementation of MagViT2 Tokenizer in Pytorch
wukong-robot
🤖 wukong-robot 是一个简单、灵活、优雅的中文语音对话机器人/智能音箱项目,支持ChatGPT多轮对话能力,还可能是首个支持脑机交互的开源智能音箱项目。