non-autoregressive

There are 1 repository under non-autoregressive topic.

lucidrains / soundstorm-pytorch
Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch
artificial-intelligence attention-mechanism audio-generation deep-learning non-autoregressive transformers
Language:Python 1114
Matcha-TTS
shivammehta25 / Matcha-TTS
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
deep-learning diffusion-model diffusion-models flow-matching machine-learning non-autoregressive probabilistic probabilistic-machine-learning text-to-speech tts tts-api tts-engines
Language:Jupyter Notebook 381
keonlee9420 / PortaSpeech
PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech
text-to-speech normalizing-flows generative-model deep-neural-networks pytorch tts speech-synthesis neural-tts non-autoregressive portable-tts vae fastspeech hifi-gan non-ar mel-gan high-quality
Language:Python 330
keonlee9420 / Comprehensive-Transformer-TTS
A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS
text-to-speech supervised unsupervised non-autoregressive non-ar multi-speaker ultimate-tts tts pytorch comprehensive single-speaker fastspeech transformer neural-tts fastspeech2 hifi-gan mel-gan sota speech-synthesis deep-learning
Language:Python 313
keonlee9420 / DiffGAN-TTS
PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs
text-to-speech deep-neural-networks pytorch tts speech-synthesis generative-model ddpm diffusion neural-tts non-autoregressive diffspeech diffgan-tts gan non-ar hifi-gan diffusion-models fastspeech multi-speaker-tts single-speaker-tts
Language:Python 293
keonlee9420 / Expressive-FastSpeech2
PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean, and your own languages.
text-to-speech tts speech-synthesis non-autoregressive emotional-tts emotional-speech-synthesis expressive-tts expressive-speech-synthesis korean-speech-synthesis korean-tts conversational-tts conversational-speech-synthesis
Language:Python 258
keonlee9420 / DiffSinger
PyTorch implementation of DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (focused on DiffSpeech)
text-to-speech diffusion ddpm pytorch singing-voice tts speech-synthesis english diffusion-models neural-tts non-autoregressive fastspeech diffsinger
Language:Python 224
keonlee9420 / Parallel-Tacotron2
PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling
neural-tts non-autoregressive vae self-attention duration parallel-tacotron parallel-tacotron2 speech-synthesis pytorch tts text-to-speech english fastspeech
Language:Python 186
keonlee9420 / StyleSpeech
PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation
text-to-speech pytorch tts speech-synthesis english style speech-style prosody neural-tts non-autoregressive fastspeech stylespeech meta-learning speaker speaker-adaptation meta-stylespeech unseen-speaker one-shot
Language:Python 179
keonlee9420 / DailyTalk
Official repository of DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech, ICASSP 2023 (Oral)
conversational-ai conversational-data conversational-tts dataset non-autoregressive pytorch speech-synthesis text-to-speech tts tts-dataset
Language:Python 177
keonlee9420 / Cross-Speaker-Emotion-Transfer
PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech
emotion-transfer cross-speaker global-style-tokens conditional-layer-normalization text-to-speech deep-neural-networks pytorch tts speech-synthesis generative-model parallel-tacotron neural-tts non-ar non-autoregressive semi-supervised-learning
Language:Python 170
xcfcode / What-I-Have-Read
Paper Lists, Notes and Slides, Focus on NLP. For summarization, please refer to https://github.com/xcfcode/Summarization-Papers
nlp summarization acl emnlp aaai naacl slides presentation gnn knowledge-distillation pretrain gan non-autoregressive generation graph-neural-networks notes presentations data-augmentation meta-learning conversation
162
keonlee9420 / Comprehensive-E2E-TTS
A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate E2E-TTS
deep-learning end-to-end fastspeech2 hifi-gan jets multi-speaker neural-tts non-ar non-autoregressive pytorch single-speaker sota speech-synthesis text-to-speech text-to-wav tts ultimate-tts unsupervised
Language:Python 140
HKUNLP / reparam-discrete-diffusion
Reparameterized Discrete Diffusion Models for Text Generation
diffusion-models fairseq language-model machine-learning natural-language-processing non-autoregressive python3 pytorch text-generation
Language:Python 84
keonlee9420 / FastPitchFormant
PyTorch Implementation of NCSOFT's FastPitchFormant: Source-filter based Decomposed Modeling for Speech Synthesis
text-to-speech end-to-end neural-tts pytorch tts speech-synthesis pitch timbre fastpitch fastspeech non-autoregressive pitch-control
Language:Python 70
keonlee9420 / VAENAR-TTS
PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.
vae glow transforer non-autoregressive tts text-to-speech duration pytorch speech-synthesis self-attention neural-tts non-ar unsupervised-learning unsupervised-duration
Language:Python 70
bearcatt / LaBERT
A length-controllable and non-autoregressive image captioning model.
controllable-image-captioning eccv2020 image-captioning non-autoregressive
Language:Python 67
keonlee9420 / WaveGrad2
PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis
text-to-speech phoneme-to-waveform neural-tts audio synthesis non-autoregressive score-matching duration robust pytorch tts speech-synthesis text-to-audio end-to-end
Language:Python 66
keonlee9420 / Daft-Exprt
PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis
text-to-speech style pytorch tts speech-synthesis english speaker prosody prosody-transfer gaussian-upsampling neural-tts non-autoregressive
Language:Python 54
HKUNLP / diffusion-of-thoughts
Code for the paper "Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models"
chain-of-thought-reasoning diffusion-models machine-learning mathematical-reasoning natural-language-processing non-autoregressive pytorch text-generation diffusion-lm
Language:Python 44
henry-yeh / GLOP
[AAAI 2024] GLOP: Learning Global Partition and Local Construction for Solving Large-scale Routing Problems in Real-time
autoregressive-neural-networks capacitated-vehicle-routing-problem deep-reinforcement-learning divide-and-conquer graph-neural-networks hierarchical-reinforcement-learning neural-combinatorial-optimization non-autoregressive prize-collecting-travelling-salesman-problem reinforcement-learning transformer travelling-salesman-problem vehicle-routing-problem
Language:Python 41
hemingkx / SpecDec
Codes for our paper "Speculative Decoding: Exploiting Speculative Execution for Accelerating Seq2seq Generation" (EMNLP 2023 Findings)
non-autoregressive speculative-decoding
Language:Python 23
yzhangcs / ctc-copy
[EMNLP'23] Code for "Non-autoregressive Text Editing with Copy-aware Latent Alignments".
ctc non-autoregressive text-editing
Language:Python 17
keonlee9420 / Deep-Learning-TTS-Template
This is a template for the Non-autoregressive Deep Learning-Based TTS model (in PyTorch).
text-to-speech pytorch tts speech-synthesis deep-learning fastspeech non-autoregressive neural-tts template
Language:Python 14
kan-bayashi / NonARSeq2SeqVC
Non-autoregressive sequence-to-sequence voice conversion
end-to-end non-autoregressive voice-conversion
6
RistoAle97 / ContinualNAT
M.Sc. thesis on Continual Learning for Non-Autoregressive Neural Machine Translation
continual-learning experience-replay nat natural-language-processing neural-machine-translation nlp nmt non-autoregressive non-autoregressive-translation
Language:Python 6
LARC-CMU-SMU / Enconter
Implementation of 2021 EACL paper Enconter
nlg language-model non-autoregressive
Language:Jupyter Notebook 2
aistairc / BERT-NAR-BERT
BERT-based pre-trained non-autoregressive sequence-to-sequence model
bert language-modelling machine-translation natural-language-processing non-autoregressive question-answering sequence-to-sequence summarization
Language:Python 1