Beast code in Giters

sunnnnnnnny's repositories

XPhoneBERT

XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech (INTERSPEECH 2023)

MIT000

Tacotron2-PyTorch

Yet another PyTorch implementation of Tacotron 2 with reduction factor and faster training speed.

Language:PythonMIT000

VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

MIT000

AcademiCodec

AcademiCodec: An Open Source Audio Codec Model for Academic Research

000

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

MIT000

emospeech

Apache-2.0000

DiffGAN-TTS

PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs

Language:PythonMIT000

ltu

Github Repo for Paper "Listen, Think, and Understand".

000

ParallelWaveGAN

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

MIT000

lora-svc

singing voice change based on whisper, and lora for singing voice clone

MIT000

CLAP

Learning audio concepts from natural language supervision

MIT000

bark

🔊 Text-Prompted Generative Audio Model

MIT000

melgan-neurips

GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis

MIT000

DiffSinger

DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code

MIT000

wetts

Production First and Production Ready End-to-End Text-to-Speech Toolkit

Apache-2.0000

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

MIT000

g2pW_chinese

Chinese Mandarin Grapheme-to-Phoneme Converter. 中文轉注音或拼音 (INTERSPEECH 2022)

Apache-2.0000

vall-e

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Apache-2.0000

FastSpeech2

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

MIT000

StarGAN-Voice-Conversion-2

A pytorch implementation of StarGAN-VC2

000

pytorch-StarGAN-VC

Fully reproduce the paper of StarGAN-VC. Stable training and Better audio quality .

000

polyphone-g2pL

The implementation of g2pL with a new open dataset.

Apache-2.0000

INTERSPEECH-2023-Papers

INTERSPEECH 2023 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!

MIT000

sunnnnnnnny

sunnnnnnnny's repositories

onnx_for_loop

XPhoneBERT

attention_onnx_exp

Tacotron2-PyTorch

fs2_mfa_phone

VALL-E-X

AcademiCodec

audiocraft

emospeech

DiffGAN-TTS

ltu

ParallelWaveGAN

lora-svc

CLAP

bark

melgan-neurips

DiffSinger

wetts

unilm

g2pW_chinese

vall-e

FastSpeech2

StarGAN-Voice-Conversion-2

pytorch-StarGAN-VC

polyphone-g2pL

INTERSPEECH-2023-Papers

TTS-TextAnalyzer

Dict-TTS-test

SpeechT5

FastSpeech_sing