lzc (lzcsjtu)

lzcsjtu

Geek Repo

Github PK Tool:Github PK Tool

lzc's starred repositories

VITS-fast-fine-tuning

This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion

Language:PythonLicense:Apache-2.0Stargazers:4649Issues:0Issues:0

VISinger2

VISinger 2: High-Fidelity End-to-End Singing Voice Synthesis Enhanced by Digital Signal Processing Synthesizer

Language:PythonStargazers:301Issues:0Issues:0

avocodo

Official implementation of "Avocodo: Generative Adversarial Network for Artifact-Free Vocoder" (AAAI2023)

Language:PythonLicense:NOASSERTIONStargazers:150Issues:0Issues:0

naturalspeech

A fully working pytorch implementation of NaturalSpeech (Tan et al., 2022)

Language:PythonStargazers:452Issues:0Issues:0
Language:PythonLicense:MITStargazers:69Issues:0Issues:0

naturalspeech2-pytorch

Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch

Language:PythonLicense:MITStargazers:1239Issues:0Issues:0

AudioGPT

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

Language:PythonLicense:NOASSERTIONStargazers:9904Issues:0Issues:0

annotated_deep_learning_paper_implementations

🧑‍🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

Language:PythonLicense:MITStargazers:51913Issues:0Issues:0

vall-e

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Language:PythonLicense:Apache-2.0Stargazers:1940Issues:0Issues:0

WhisperSpeech

An Open Source text-to-speech system built by inverting Whisper.

Language:Jupyter NotebookLicense:MITStargazers:3591Issues:0Issues:0

DiffGAN-TTS

PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs

Language:PythonLicense:MITStargazers:303Issues:0Issues:0

Comprehensive-E2E-TTS

A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate E2E-TTS

Language:PythonStargazers:142Issues:0Issues:0

Comprehensive-Transformer-TTS

A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS

Language:PythonLicense:MITStargazers:317Issues:0Issues:0

Mockingjay-Speech-Representation

Official Implementation of Mockingjay in Pytorch

Language:PythonLicense:MITStargazers:52Issues:0Issues:0

s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Language:PythonLicense:Apache-2.0Stargazers:2160Issues:0Issues:0

vall-e

An unofficial PyTorch implementation of the audio LM VALL-E

Language:PythonLicense:MITStargazers:2920Issues:0Issues:0

CDFSE_FastSpeech2

The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synthesis”

Language:PythonLicense:MITStargazers:78Issues:0Issues:0

StyleSpeech

Official implementation of Meta-StyleSpeech and StyleSpeech

Language:PythonLicense:MITStargazers:237Issues:0Issues:0

StyleSpeech

PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation

Language:PythonLicense:MITStargazers:186Issues:0Issues:0

Meta-TTS

Official repository of https://doi.org/10.1109/TASLP.2022.3167258. More up-to-date code is in "refactor" branch.

Language:PythonStargazers:185Issues:0Issues:0

Cross-Speaker-Emotion-Transfer

PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech

Language:PythonLicense:MITStargazers:177Issues:0Issues:0

PortaSpeech

PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech

Language:PythonLicense:MITStargazers:328Issues:0Issues:0

Avocodo-pytorch

Avocodo: Generative Adversarial Network for Artifact-free Vocoder

Language:PythonLicense:MITStargazers:115Issues:0Issues:0

voicesmith

[WIP] VoiceSmith makes training text to speech models easy.

Language:PythonLicense:Apache-2.0Stargazers:215Issues:0Issues:0

VI-SVS

Singing Voice Synthesis based on VITS, different from VISinger

Language:PythonLicense:Apache-2.0Stargazers:182Issues:0Issues:0

tortoise-tts

A multi-voice TTS system trained with an emphasis on quality

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:12479Issues:0Issues:0

onnx-modifier

A tool to modify ONNX models in a visualization fashion, based on Netron and Flask.

Language:JavaScriptLicense:MITStargazers:1204Issues:0Issues:0
Language:PythonStargazers:1Issues:0Issues:0

hello-algorithm

🌍 针对小白的算法训练 | 包括四部分:①.大厂面经 ②.力扣图解 ③.千本开源电子书 ④.百张技术思维导图(项目花了上百小时,希望可以点 star 支持,🌹感谢~)推荐免费ChatGPT使用网站

Language:JavaStargazers:34978Issues:0Issues:0

SortingNetwork

Implement a bitonic sorting network on FPGA

Language:VerilogLicense:Apache-2.0Stargazers:36Issues:0Issues:0