hertz-pj

followers

following

stars

hitsz

hertz's repositories

phonemizer

Simple text to phones converter for multiple languages

Language:PythonGPL-3.0200

DiffSinger

DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code

Language:PythonMIT100

MockingBird

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

Language:JavaScriptNOASSERTION100

a3t

Language:PythonApache-2.0000

cn2an

📦 快速转化「中文数字」和「阿拉伯数字」～ (最新特性：分数，日期、温度等转化）

Language:PythonMIT000

ddsp-singing-vocoders

Official implementation of SawSing (ISMIR'22)

Language:PythonAGPL-3.0000

espnet

End-to-End Speech Processing Toolkit

Language:PythonApache-2.0000

facialanimation

Source code for: Expressive Speech-driven Facial Animation with controllable emotions

Language:PythonApache-2.0000

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonMIT000

g2pW

Mandarin Grapheme-to-Phoneme Converter. 中文轉注音或拼音

Language:PythonApache-2.0000

GenerSpeech

PyTorch Implementation of GenerSpeech (NeurIPS'22): a text-to-speech model towards zero-shot style transfer of OOD custom voice.

Language:PythonMIT000

music-generation-research

A straightforward collection of Music Generation research resources.

000

Muskits

An opensource music processing toolkit

Language:PythonApache-2.0000

NeuralSVB

Learning the Beauty in Songs: Neural Singing Voice Beautifier; ACL 2022 (Main conference); Official code

Language:Python000

new-pac

翻墙-科学上网、免费科学上网、免费翻墙、油管youtube、fanqiang、VPN、一键翻墙浏览器，vps一键搭建翻墙服务器脚本/教程，免费shadowsocks/ss/ssr/v2ray/goflyway账号/节点，免费自由上网、翻墙梯子，电脑、手机、iOS、安卓、windows、Mac、Linux、路由器翻墙、科学上网

000

nnsvs

Neural network-based singing voice synthesis library for research

Language:PythonMIT000

pinyin-pro

中文转拼音、拼音音调、拼音声母、拼音韵母、多音字拼音、拼音首字母

Language:TypeScriptMIT000

ProDiff

PyTorch Implementation of ProDiff (ACM-MM'22) with a Extremely-Fast diffusion speech synthesis pipeline

Language:PythonMIT000

reverse-interview-zh

技术面试最后反问面试官的话

NOASSERTION000

SiFiGAN

Official implementation of the source-filter HiFiGAN vocoder

Language:PythonMIT000

so-vits-svc-5.0

Core Engine of Singing Voice Conversion & Singing Voice Clone

Language:PythonMIT000

SoundStorm

000

stable-diffusion

A latent text-to-image diffusion model

Language:Jupyter NotebookNOASSERTION000

TensorFlowTTS

:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, Korean, Chinese, German and Easy to adapt for other languages)

Language:PythonApache-2.0000

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonMPL-2.0000

ttts

Train the next generation of TTS systems.

MPL-2.0000

UniAudio

The Open Source Code of UniAudio

Language:Python000

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Language:PythonMIT000

VITS_Singing

Use VITS and Opencpop to develop singing voice synthesis; Maybe it will VISinger.

Language:PythonApache-2.0000

x-ui

支持多协议多用户的 xray 面板

Language:JavaScriptGPL-3.0000