Liumeng Xue's starred repositories
screenshot-to-code
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
MoneyPrinterTurbo
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
accelerate
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
VoiceCraft
Zero-Shot Speech Editing and Text-to-Speech in the Wild
fish-speech
Brand new TTS solution
EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
ComfyUI-Workflows-ZHO
我的 ComfyUI 工作流合集 | My ComfyUI workflows collection
AniPortrait
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
resemble-enhance
AI powered speech denoising and enhancement
HierSpeechpp
The official implementation of HierSpeech++
descript-audio-codec
State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.
emotion2vec
[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation
speech-trident
Awesome speech/audio LLMs, representation learning, and codec models
VoiceFlow-TTS
[ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"
spear-tts-pytorch
Implementation of Spear-TTS - multi-speaker text-to-speech attention network, in Pytorch
frechet-audio-distance
A lightweight library for Frechet Audio Distance calculation.
Codec-SUPERB
Audio Codec Speech processing Universal PERformance Benchmark
openai_trtllm
OpenAI compatible API for TensorRT LLM triton backend
SpeechTasks
This is a list of speech tasks and datasets, which can provide training data for Generative AI, AIGC, AI model training, intelligent speech tool development, and speech applications.