AlexYangli's repositories
EasySpider
A visual no-code/code-free web crawler/spider易采集:一个可视化浏览器自动化测试/数据采集/爬虫软件,可以无代码图形化的设计和执行爬虫任务。别名:ServiceWrapper面向Web应用的智能化服务封装系统。
StoryDiffusion
Create Magic Story!
ESLTTS
ESLTTS dataset
champ
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
metahuman-stream
Real time streaming digital human based on nerf
weekly
科技爱好者周刊,每周五发布
Singing-Voice-Conversion
Project of Singing Voice Conversion.
Bert-VITS2
vits2 backbone with bert
Arabic-Tashkeela-Model
This is a diacritization model for Arabic language. This model was built/trained using the Tashkeela: the Arabic diacritization corpus on Kaggle
tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
voicefixer
General Speech Restoration
WikipediaHomographData
Labeled data for homograph disambiguation
english-conversation-corpus
English conversation corpus for conversational TTS.
g2p_id
g2p ID: Indonesian Grapheme-to-Phoneme Converter
Multilingual_Text_to_Speech
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.
MockingBird
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
NeMo
NeMo: a toolkit for conversational AI
sockeye
Sequence-to-sequence framework with a focus on Neural Machine Translation based on PyTorch
speechbrain
A PyTorch-based Speech Toolkit
espeak-ng
eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
low-resource-languages
Resources for conservation, development, and documentation of low resource (human) languages.
a-week-in-wild-ai
360 view on ai/ml/dl applications
performant
A toolset for easy formant extraction and visualization from wav files and TTS models
NeuralSVB
Learning the Beauty in Songs: Neural Singing Voice Beautifier; ACL 2022 (Main conference); Official code
Latent-GLAT
Implementation of latent-GLAT (ACL-2022)
rVADfast
This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method.