xingxy's repositories

fastcws

轻量级高性能中文分词项目

Language:C++License:BSD-2-ClauseStargazers:1Issues:0Issues:0

AISystem

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

License:Apache-2.0Stargazers:0Issues:0Issues:0

Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

License:MITStargazers:0Issues:0Issues:0

apachecn-dl-zh

ApacheCN 深度学习译文集

License:NOASSERTIONStargazers:0Issues:0Issues:0

Bert-vits2-V2.3

Bert-vits2-V2.3 训练和推理

Stargazers:0Issues:0Issues:0

BigVGAN

Official PyTorch implementation of BigVGAN (ICLR 2023)

Stargazers:0Issues:0Issues:0
License:NOASSERTIONStargazers:0Issues:0Issues:0

efficientspeech

PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.

License:Apache-2.0Stargazers:0Issues:0Issues:0

encodecmae

音频编解码

Stargazers:0Issues:0Issues:0

g2p-zh-en

Chinese and English Bilinguish G2P

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

GenerSpeech

PyTorch Implementation of GenerSpeech (NeurIPS'22): a text-to-speech model towards zero-shot style transfer of OOD custom voice.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

IMS-Toucan

Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.

License:Apache-2.0Stargazers:0Issues:0Issues:0

malaya-speech

Speech Toolkit for Malaysian language, https://malaya-speech.readthedocs.io/

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

musiclm-pytorch

Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

naturalspeech

A fully working pytorch implementation of NaturalSpeech (Tan et al., 2022)

Stargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

paddlespeech_tts_cpp

PaddleSpeech TTS cpp

Stargazers:0Issues:0Issues:0

resemble-enhance

AI powered speech denoising and enhancement

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

SNAC

Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speaker text-to-speech

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

SoundStorm-pytorch

Google's SoundStorm: Efficient Parallel Audio Generation

License:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

TensorFlowASR

:zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords

License:Apache-2.0Stargazers:0Issues:0Issues:0

tortoise-tts-fast

Fast TorToiSe inference (5x or your money back!)

Language:Jupyter NotebookLicense:AGPL-3.0Stargazers:0Issues:0Issues:0

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

License:MPL-2.0Stargazers:0Issues:0Issues:0

ttts

Train the next generation of TTS systems.

Language:PythonLicense:MPL-2.0Stargazers:0Issues:0Issues:0

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

vampnet

music generation with masked transformers!

License:MITStargazers:0Issues:0Issues:0

vits_chinese

Best TTS based on BERT and VITS with some Natural Speech Features Of Microsoft

Language:PythonLicense:MITStargazers:0Issues:0Issues:0