yingfenging's repositories
awesome-large-audio-models
Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.
cantonese-books-data
粵音資料集叢:典籍資料
CharsiuG2P
Multilingual G2P in over 100 languages
chinese_speech_pretrain
chinese speech pretrained models
FastDiff
PyTorch Implementation of FastDiff (IJCAI'22)
FastGithub
github加速神器,解决github打不开、用户头像无法加载、releases无法上传下载、git-clone、git-pull、git-push失败等问题
g2pW
Mandarin Grapheme-to-Phoneme Converter. 中文轉注音或拼音
GraphemeBERT
This is the source code of the paper "Neural grapheme-to-phoneme conversion with pretrained grapheme models
LPCNet
Efficient neural speech synthesis
Meta-TTS
Official repository of https://arxiv.org/abs/2111.04040v1
NATSpeech
A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)
ncnn
ncnn is a high-performance neural network inference framework optimized for the mobile platform
NeMo
NeMo: a toolkit for conversational AI
onnxruntime
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
paper2gui
Convert AI papers to GUI,Make it easy and convenient for everyone to use artificial intelligence technology。让每个人都简单方便的使用前沿人工智能技术
Parselmouth
Praat in Python, the Pythonic way
PitchExtractor
Deep Neural Pitch Extractor for Voice Conversion and TTS Training
pycantonese
Cantonese Linguistics and NLP in Python
Retrieval-based-Voice-Conversion-WebUI
Voice data <= 10 mins can also be used to train a good VC model!
so-vits-svc
基于vits与softvc的歌声音色转换模型
STYLER
STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllable Neural Text to Speech, INTERSPEECH 2021
StyleSpeech
Official implementation of Meta-StyleSpeech and StyleSpeech
TTS-Objective-Metrics
Objective metrics used in several text-to-speech (TTS) papers.
vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
vits_chinese
vits chinese, tts chinese, tts mandarin 史上训练最简单,音质最好的语音合成系统,兼容性非常好的合成框架
voicefixer_main
General Speech Restoration
VQMIVC
Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!
Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020.