Yuhang's starred repositories

e2-tts-pytorch

Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch

Language:PythonLicense:MITStargazers:161Issues:0Issues:0

chatllama

ChatLLaMA 📢 Open source implementation for LLaMA-based ChatGPT runnable in a single GPU. 15x faster training process than ChatGPT

Language:PythonStargazers:1202Issues:0Issues:0

detail_tts

All generative model in one for better TTS model

Language:PythonStargazers:36Issues:0Issues:0

stable-audio-tools

Generative models for conditional audio generation

Language:PythonLicense:MITStargazers:2333Issues:0Issues:0

OpenPhonemizer

An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GPL phonemizer.

Language:PythonLicense:BSD-3-Clause-ClearStargazers:72Issues:0Issues:0

SpeechAlgorithms

Speech Algorithms

License:Apache-2.0Stargazers:1Issues:0Issues:0

mustango

Mustango: Toward Controllable Text-to-Music Generation

Language:PythonLicense:MITStargazers:305Issues:0Issues:0

MeloTTS

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.

Language:PythonLicense:MITStargazers:4105Issues:0Issues:0

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonLicense:Apache-2.0Stargazers:20843Issues:0Issues:0

OpenDiT

OpenDiT: An Easy, Fast and Memory-Efficient System for DiT Training and Inference

Language:PythonLicense:Apache-2.0Stargazers:1354Issues:0Issues:0

metavoice-src

Foundational model for human-like, expressive TTS

Language:PythonLicense:Apache-2.0Stargazers:3550Issues:0Issues:0

gpt-neo

An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.

Language:PythonLicense:MITStargazers:8180Issues:0Issues:0

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonLicense:MITStargazers:29604Issues:0Issues:0

DeepFilterNet

Noise supression using deep filtering

Language:PythonLicense:NOASSERTIONStargazers:2206Issues:0Issues:0

megatts2

Unoffical implementation of Megatts2

Language:PythonLicense:MITStargazers:245Issues:0Issues:0

awesome

😎 Awesome lists about all kinds of interesting topics

License:CC0-1.0Stargazers:312265Issues:0Issues:0

hello-algo

《Hello 算法》:动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新,English version ongoing

Language:JavaLicense:NOASSERTIONStargazers:89101Issues:0Issues:0

chatgpt_system_prompt

A collection of GPT system prompts and various prompt injection/leaking knowledge.

Language:HTMLLicense:MITStargazers:7757Issues:0Issues:0

SpeechAlgorithms

Speech Algorithms

Language:CLicense:Apache-2.0Stargazers:729Issues:0Issues:0

streamlit-audio-recorder

Record Audio from the User's Microphone in Apps that are Deployed to the Web. (via Browser Media-API, REACT-based, Streamlit Custom Component)

Language:TypeScriptLicense:MITStargazers:394Issues:0Issues:0

speech_recognition

Speech recognition module for Python, supporting several engines and APIs, online and offline.

Language:PythonLicense:BSD-3-ClauseStargazers:8200Issues:0Issues:0

snowboy

Future versions with model training module will be maintained through a forked version here: https://github.com/seasalt-ai/snowboy

Language:C++License:NOASSERTIONStargazers:3042Issues:0Issues:0

Mixly_Arduino

A visual programming editor based on blockly for Arduino、Microbit、MicroPython、Python

Language:CLicense:Apache-2.0Stargazers:241Issues:0Issues:0

Free-Certifications

A curated list of free courses & certifications.

License:MITStargazers:24843Issues:0Issues:0

video-retalking

[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

Language:PythonLicense:Apache-2.0Stargazers:6151Issues:0Issues:0

RemoveAdblockThing

The intrusive "Ad blocker are not allowed on YouTube" message is annoying. This open-source project aims to address this issue by providing a solution to bypass YouTube's ad blocker detection

Language:JavaScriptLicense:MITStargazers:5955Issues:0Issues:0

MoeGoe

Executable file for VITS inference

Language:PythonLicense:MITStargazers:2315Issues:0Issues:0

magvit2-pytorch

Implementation of MagViT2 Tokenizer in Pytorch

Language:PythonLicense:MITStargazers:498Issues:0Issues:0

AudioSep

Official implementation of "Separate Anything You Describe"

Language:PythonLicense:MITStargazers:1512Issues:0Issues:0

wukong-robot

🤖 wukong-robot 是一个简单、灵活、优雅的中文语音对话机器人/智能音箱项目,支持ChatGPT多轮对话能力,还可能是首个支持脑机交互的开源智能音箱项目。

Language:PythonLicense:MITStargazers:6073Issues:0Issues:0