Beast code in Giters

lzc's starred repositories

VITS-fast-fine-tuning

This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion

Language:PythonApache-2.0464900

VISinger2

VISinger 2: High-Fidelity End-to-End Singing Voice Synthesis Enhanced by Digital Signal Processing Synthesizer

Language:Python30100

avocodo

Official implementation of "Avocodo: Generative Adversarial Network for Artifact-Free Vocoder" (AAAI2023)

Language:PythonNOASSERTION15000

naturalspeech

A fully working pytorch implementation of NaturalSpeech (Tan et al., 2022)

Language:Python45200

naturalspeech2-pytorch

Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch

Language:PythonMIT123900

AudioGPT

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

Language:PythonNOASSERTION990400

annotated_deep_learning_paper_implementations

🧑‍🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

Language:PythonMIT5191300

vall-e

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Language:PythonApache-2.0194000

WhisperSpeech

An Open Source text-to-speech system built by inverting Whisper.

Language:Jupyter NotebookMIT359100

DiffGAN-TTS

PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs

Language:PythonMIT30300

Comprehensive-E2E-TTS

A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate E2E-TTS

Language:Python14200

Comprehensive-Transformer-TTS

A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS

Language:PythonMIT31700

Mockingjay-Speech-Representation

Official Implementation of Mockingjay in Pytorch

Language:PythonMIT5200

s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Language:PythonApache-2.0216000

vall-e

An unofficial PyTorch implementation of the audio LM VALL-E

Language:PythonMIT292000

CDFSE_FastSpeech2

The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synthesis”

Language:PythonMIT7800

StyleSpeech

Official implementation of Meta-StyleSpeech and StyleSpeech

Language:PythonMIT23700

StyleSpeech

PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation

Language:PythonMIT18600

Meta-TTS

Official repository of https://doi.org/10.1109/TASLP.2022.3167258. More up-to-date code is in "refactor" branch.

Language:Python18500

Cross-Speaker-Emotion-Transfer

PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech

Language:PythonMIT17700

PortaSpeech

PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech

Language:PythonMIT32800

Avocodo-pytorch

Avocodo: Generative Adversarial Network for Artifact-free Vocoder

Language:PythonMIT11500

voicesmith

[WIP] VoiceSmith makes training text to speech models easy.

Language:PythonApache-2.021500

VI-SVS

Singing Voice Synthesis based on VITS, different from VISinger

Language:PythonApache-2.018200

tortoise-tts

A multi-voice TTS system trained with an emphasis on quality

Language:Jupyter NotebookApache-2.01247900

onnx-modifier

A tool to modify ONNX models in a visualization fashion, based on Netron and Flask.

Language:JavaScriptMIT120400

Coding-Offer

Language:Python100

hello-algorithm

🌍 针对小白的算法训练 | 包括四部分：①.大厂面经 ②.力扣图解 ③.千本开源电子书 ④.百张技术思维导图（项目花了上百小时，希望可以点 star 支持，🌹感谢~）推荐免费ChatGPT使用网站

Language:Java3497800

SortingNetwork

Implement a bitonic sorting network on FPGA

Language:VerilogApache-2.03600

lzcsjtu

lzc's starred repositories

VITS-fast-fine-tuning

VISinger2

avocodo

naturalspeech

NaturalSpeech2

naturalspeech2-pytorch

AudioGPT

annotated_deep_learning_paper_implementations

vall-e

WhisperSpeech

DiffGAN-TTS

Comprehensive-E2E-TTS

Comprehensive-Transformer-TTS

Mockingjay-Speech-Representation

s3prl

vall-e

CDFSE_FastSpeech2

StyleSpeech

StyleSpeech

Meta-TTS

Cross-Speaker-Emotion-Transfer

PortaSpeech

Avocodo-pytorch

voicesmith

VI-SVS

tortoise-tts

onnx-modifier

Coding-Offer

hello-algorithm

SortingNetwork