fastspeech2

There are 4 repositories under fastspeech2 topic.

Amphion
open-mmlab / Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
audio-generation audio-synthesis audioldm hifi-gan music-generation naturalspeech2 singing-voice-conversion speech-synthesis text-to-audio text-to-speech vall-e voice-conversion audit fastspeech2 vits
Language:Python 4463
TensorSpeech / TensorFlowTTS
:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
speech-synthesis text-to-speech tensorflow2 melgan fastspeech real-time tts vocoder multi-speaker-tts fastspeech2 multiband-melgan tacotron2 parallel-wavegan tflite mobile-tts zh-tts chinese-tts korea-tts german-tts japanese-tts
Language:Python 3806
PaddlePaddle / Parakeet
PAddle PARAllel text-to-speech toolKIT (supporting Tacotron2, Transformer TTS, FastSpeech2/FastPitch, SpeedySpeech, WaveFlow and Parallel WaveGAN)
text-to-speech speech-synthesis tacotron2 transformer-tts waveflow speedyspeech fastspeech2 parallelwavegan multi-speaker-tts text-frontend ge2e voice-cloning fastpitch
Language:Python 600
ranchlai / mandarin-tts
Chinese Mandarin tts text-to-speech 中文 (普通话) 语音合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder, with biaobei and aishell3 datasets
aishell3 fastspeech2 multi-speaker pytorch tacotron tts tts-chinese tts-hanzi
Language:Python 460
keonlee9420 / Comprehensive-Transformer-TTS
A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS
text-to-speech supervised unsupervised non-autoregressive non-ar multi-speaker ultimate-tts tts pytorch comprehensive single-speaker fastspeech transformer neural-tts fastspeech2 hifi-gan mel-gan sota speech-synthesis deep-learning
Language:Python 318
Executedone / Chinese-FastSpeech2
基于标贝数据继续训练，同时对原本的FastSpeech2模型做了改进，引入了韵律表征以及韵律预测模块，使中文发音更生动且富有节奏
chinese-tts fastspeech2 pytorch speech speech-synthesis text-to-speech tts
Language:Python 236
rishikksh20 / FastSpeech2
PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech
fastspeech fastspeech2 tts tts-engines text-to-speech pytorch
Language:Jupyter Notebook 223
ZDisket / TensorVox
Desktop application for neural speech synthesis written in C++
multiband-melgan mb-melgan fastspeech2 voice-synthesis tts phoneme desktop tacotron2 speech-synthesis text-to-speech real-time
Language:C++ 210
rishikksh20 / AdaSpeech
AdaSpeech: Adaptive Text to Speech for Custom Voice
adaspeech fastspeech2 tts text-to-speech pytorch pytorch-implementation speech-synthesis speech fastspeech transformer
Language:Jupyter Notebook 156
keonlee9420 / Comprehensive-E2E-TTS
A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate E2E-TTS
deep-learning fastspeech2 hifi-gan jets multi-speaker neural-tts non-autoregressive pytorch single-speaker sota speech-synthesis text-to-speech tts unsupervised end-to-end non-ar ultimate-tts text-to-wav
Language:Python 143
tuanh123789 / AdaSpeech
An implementation of Microsoft's "AdaSpeech: Adaptive Text to Speech for Custom Voice"
text-to-speech voiceclone adaspeech fastspeech2 conditional-layer-norm conditional-layer-normalization
Language:Python 95
ga642381 / FastSpeech2
Multi-Speaker Pytorch FastSpeech2: Fast and High-Quality End-to-End Text to Speech :fist:
fastspeech2 melgan multi-speaker-tts pytorch text-to-speech tts waveglow
Language:Python 92
rishikksh20 / LightSpeech
LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search
fastspeech fastspeech2 lightspeech pytorch speech speech-synthesis text-to-speech tts
Language:Python 80
xcmyz / FastSpeech2
The Implementation of FastSpeech2 Based on Pytorch.
fastspeech fastspeech2 pytorch speech-synthesis tts
Language:Python 52
hwRG / End-to-End-TTS-Fine-Tune
Use FastSpeech2 and HiFi-GAN to easily perform end-to-end Korean speech synthesis.
end-to-end fastspeech2 fine-tune hifi-gan tts
Language:Python 27
Adibian / ResGrad
Unofficial implementation of ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech
fastspeech2 resgrad speech-denoising speech-synthesis text-to-speech
Language:Python 15
AppleHolic / FastSpeech2
Refactored version of https://github.com/ming024/FastSpeech2
deeplearning fastspeech2 pytorch speech tts
Language:Python 13
deepaudio / deepaudio-tts
fastspeech2 hydra pytorch-lightning tts vits
Language:Python 12
dathudeptrai / FastSpeech2
A Tensorflow Implementation of the FastSpeech 2: Fast and High-Quality End-to-End Text to Speech
fastspeech fastspeech2 real-time tensorflow tensorflow2
11
alessandropec / data_driven_ai_voice_cloning
This repository contain the code of the main part of my master thesis degree at Politecnico di Torino in Data science & Engineering
ai generative-ai speaker-embeddings speaker-verification text-to-speech voice-cloning zero-shot-learning deep-learning machine-learning wavlm ecapa-tdnn fastspeech2 tacotron2
Language:Python 8
hwRG / FastSpeech2-Pytorch-Korean-Multi-Speaker
Multi-Speaker FastSpeech2 applicable to Korean. Description about train and synthesize in detail.
fastspeech2 korean multi-speaker pytorch transfer-learning tts
Language:Python 8
ssmlkl / MnTTS2
This is the experimental description of MnTTS2.
tts fastspeech2 hifi-gan mongolian multi-speaker-tts
Language:Jupyter Notebook 7
nikolaStanojkovski / Assistive_Bus_Helper
An Android application that allows visually impaired people to hear which bus lines are passing next to them.
assistive-technology machine-learning text-to-speech artificial-intelligence easyocr fastspeech2 ocr ocr-recognition visually-impaired-people yolox
Language:Python 2
lars76 / fastspeech2-clean
Clean and modernized implementation of FastSpeech2/LightSpeech using IPA
chinese fastspeech2 mandarin mandarin-chinese text-to-speech
Language:Python 1
lordzuko / SpeakingStyle
Aligning latent space of speaking style with human perception using a re-embedding strategy
fastspeech2 pytorch speaking-style speech-synthesis hifi-gan pytorch-distributeddataparallel vocoder blizzard-challenge
Language:Jupyter Notebook 1
nikolaStanojkovski / Talk_Through_Me
An Android application that acts as a speaking assistant for the hearing impaired people.
assistive-technology deep-learning fastspeech2 hearing-impaired-people machine-learning text-to-speech artificial-intelligence
Language:Python 1
quackson / DG_HW
homework for deep generation. Combine FastSpeech2 with different vocoders ⭐REFERENCE (modify origin repos): https://github.com/ming024/FastSpeech2 https://github.com/NVIDIA/waveglow https://github.com/mindslab-ai/univnet https://github.com/jik876/hifi-gan
fastspeech2 hifigan univnet vocoder waveglow
Language:Python 1
mariatepei / VT_thesis_MTepei
This repository accompanies my MSc Thesis for the degree Voice Technology, storing all referenced data and other relevant resources.
data-augmentation fastspeech2 speech-recognition whisper
Language:Jupyter Notebook 0
utkarsh2299 / Fastspeech2_HS
Created this repo as a part of the project "Speech Technologies in Indian languages". About Indic TTS for Indian Languages: This is a project on developing text-to-speech (TTS) synthesis systems for Indian languages, improving quality of synthesis, as well as small foot print TTS integrated with disability aids and various other applications.
espnet fastspeech2 hs hybrid-segment indic-languages tts
Language:Perl 0
gagan3012 / image2audio
Convert Image to audio using ViT, GPT and FastSpeech
pytorch vit gpt-2 fastspeech2 speech-to-text imagecaptioning image-captioning
Language:Python
lordzuko / FastSpeech2-jax
Implementation of FastSpeech2 in JAX
fastspeech2 jax pytorch-lightning

fastspeech2

open-mmlab / Amphion

TensorSpeech / TensorFlowTTS

PaddlePaddle / Parakeet

ranchlai / mandarin-tts

keonlee9420 / Comprehensive-Transformer-TTS

Executedone / Chinese-FastSpeech2

rishikksh20 / FastSpeech2

ZDisket / TensorVox

rishikksh20 / AdaSpeech

keonlee9420 / Comprehensive-E2E-TTS

tuanh123789 / AdaSpeech

ga642381 / FastSpeech2

rishikksh20 / LightSpeech

xcmyz / FastSpeech2

hwRG / End-to-End-TTS-Fine-Tune

Adibian / ResGrad

AppleHolic / FastSpeech2

deepaudio / deepaudio-tts

dathudeptrai / FastSpeech2

alessandropec / data_driven_ai_voice_cloning

hwRG / FastSpeech2-Pytorch-Korean-Multi-Speaker

ssmlkl / MnTTS2

nikolaStanojkovski / Assistive_Bus_Helper

lars76 / fastspeech2-clean

lordzuko / SpeakingStyle

nikolaStanojkovski / Talk_Through_Me

quackson / DG_HW

mariatepei / VT_thesis_MTepei

utkarsh2299 / Fastspeech2_HS

gagan3012 / image2audio

lordzuko / FastSpeech2-jax