hertz's repositories
phonemizer
Simple text to phones converter for multiple languages
DiffSinger
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
MockingBird
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
cn2an
📦 快速转化「中文数字」和「阿拉伯数字」~ (最新特性:分数,日期、温度等转化)
ddsp-singing-vocoders
Official implementation of SawSing (ISMIR'22)
espnet
End-to-End Speech Processing Toolkit
facialanimation
Source code for: Expressive Speech-driven Facial Animation with controllable emotions
fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
g2pW
Mandarin Grapheme-to-Phoneme Converter. 中文轉注音或拼音
GenerSpeech
PyTorch Implementation of GenerSpeech (NeurIPS'22): a text-to-speech model towards zero-shot style transfer of OOD custom voice.
music-generation-research
A straightforward collection of Music Generation research resources.
Muskits
An opensource music processing toolkit
NeuralSVB
Learning the Beauty in Songs: Neural Singing Voice Beautifier; ACL 2022 (Main conference); Official code
new-pac
翻墙-科学上网、免费科学上网、免费翻墙、油管youtube、fanqiang、VPN、一键翻墙浏览器,vps一键搭建翻墙服务器脚本/教程,免费shadowsocks/ss/ssr/v2ray/goflyway账号/节点,免费自由上网、翻墙梯子,电脑、手机、iOS、安卓、windows、Mac、Linux、路由器翻墙、科学上网
nnsvs
Neural network-based singing voice synthesis library for research
pinyin-pro
中文转拼音、拼音音调、拼音声母、拼音韵母、多音字拼音、拼音首字母
ProDiff
PyTorch Implementation of ProDiff (ACM-MM'22) with a Extremely-Fast diffusion speech synthesis pipeline
reverse-interview-zh
技术面试最后反问面试官的话
SiFiGAN
Official implementation of the source-filter HiFiGAN vocoder
so-vits-svc-5.0
Core Engine of Singing Voice Conversion & Singing Voice Clone
stable-diffusion
A latent text-to-image diffusion model
TensorFlowTTS
:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, Korean, Chinese, German and Easy to adapt for other languages)
TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
ttts
Train the next generation of TTS systems.
UniAudio
The Open Source Code of UniAudio
unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
VITS_Singing
Use VITS and Opencpop to develop singing voice synthesis; Maybe it will VISinger.
x-ui
支持多协议多用户的 xray 面板