AlexYangli's repositories
a-week-in-wild-ai
360 view on ai/ml/dl applications
Arabic-Tashkeela-Model
This is a diacritization model for Arabic language. This model was built/trained using the Tashkeela: the Arabic diacritization corpus on Kaggle
Bert-VITS2
vits2 backbone with bert
champ
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
EasySpider
A visual no-code/code-free web crawler/spider易采集:一个可视化浏览器自动化测试/数据采集/爬虫软件,可以无代码图形化的设计和执行爬虫任务。别名:ServiceWrapper面向Web应用的智能化服务封装系统。
english-conversation-corpus
English conversation corpus for conversational TTS.
ESLTTS
ESLTTS dataset
g2p_id
g2p ID: Indonesian Grapheme-to-Phoneme Converter
Latent-GLAT
Implementation of latent-GLAT (ACL-2022)
low-resource-languages
Resources for conservation, development, and documentation of low resource (human) languages.
metahuman-stream
Real time streaming digital human based on nerf
MockingBird
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
Multilingual_Text_to_Speech
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.
performant
A toolset for easy formant extraction and visualization from wav files and TTS models
Singing-Voice-Conversion
Project of Singing Voice Conversion.
speechbrain
A PyTorch-based Speech Toolkit
StoryDiffusion
Create Magic Story!
tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
voicefixer
General Speech Restoration
weekly
科技爱好者周刊,每周五发布
WikipediaHomographData
Labeled data for homograph disambiguation