xingxy's repositories
AISystem
AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术
Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
apachecn-dl-zh
ApacheCN 深度学习译文集
Bert-vits2-V2.3
Bert-vits2-V2.3 训练和推理
BigVGAN
Official PyTorch implementation of BigVGAN (ICLR 2023)
efficientspeech
PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.
encodecmae
音频编解码
g2p-zh-en
Chinese and English Bilinguish G2P
GenerSpeech
PyTorch Implementation of GenerSpeech (NeurIPS'22): a text-to-speech model towards zero-shot style transfer of OOD custom voice.
IMS-Toucan
Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.
malaya-speech
Speech Toolkit for Malaysian language, https://malaya-speech.readthedocs.io/
musiclm-pytorch
Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch
naturalspeech
A fully working pytorch implementation of NaturalSpeech (Tan et al., 2022)
paddlespeech_tts_cpp
PaddleSpeech TTS cpp
resemble-enhance
AI powered speech denoising and enhancement
SNAC
Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speaker text-to-speech
SoundStorm-pytorch
Google's SoundStorm: Efficient Parallel Audio Generation
TensorFlowASR
:zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords
tortoise-tts-fast
Fast TorToiSe inference (5x or your money back!)
TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
ttts
Train the next generation of TTS systems.
unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
vampnet
music generation with masked transformers!
vits_chinese
Best TTS based on BERT and VITS with some Natural Speech Features Of Microsoft