aixingxy

xingxy's repositories

AISystem

AISystem 主要是指AI系统，包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Apache-2.0000

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

MIT000

apachecn-dl-zh

ApacheCN 深度学习译文集

NOASSERTION000

Bert-vits2-V2.3

Bert-vits2-V2.3 训练和推理

000

BigVGAN

Official PyTorch implementation of BigVGAN (ICLR 2023)

000

cached_conv

NOASSERTION000

efficientspeech

PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.

Apache-2.0000

encodecmae

音频编解码

000

g2p-zh-en

Chinese and English Bilinguish G2P

Language:PythonNOASSERTION000

GenerSpeech

PyTorch Implementation of GenerSpeech (NeurIPS'22): a text-to-speech model towards zero-shot style transfer of OOD custom voice.

Language:PythonMIT000

hifi-gan-misrnet

MIT000

IceDemo

Language:PythonApache-2.0000

IMS-Toucan

Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.

Apache-2.0000

malaya-speech

Speech Toolkit for Malaysian language, https://malaya-speech.readthedocs.io/

Language:Jupyter NotebookMIT000

musiclm-pytorch

Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch

Language:PythonMIT000

naturalspeech

A fully working pytorch implementation of NaturalSpeech (Tan et al., 2022)

000

NeuralSpeech

MIT000

paddlespeech_tts_cpp

PaddleSpeech TTS cpp

000

resemble-enhance

AI powered speech denoising and enhancement

Language:PythonMIT000

SNAC

Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speaker text-to-speech

Language:PythonMIT000