ensky0's repositories
Autoformer
About Code release for "Autoformer: Decomposition Transformers with Auto-Correlation for Long-Term Series Forecasting" (NeurIPS 2021), https://arxiv.org/abs/2106.13008
asrp
ASR text preprocessing utility
cargan
Official repository for the paper "Chunked Autoregressive GAN for Conditional Waveform Synthesis"
charsiu
Charsiu: A neural phonetic aligner.
ExpressiveTacotron
This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN, Non-attentive Tacotron, GST, VAE, GMVAE, and X-vectors for building prosody encoder.
Fre-GAN-pytorch
Fre-GAN: Adversarial Frequency-consistent Audio Synthesis
glow-tts
A Generative Flow for Text-to-Speech via Monotonic Alignment Search
hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
iCanHazShortcut
simple shortcut manager for macOS
korean_tacotron2
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
LVCNet
LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation
Multilingual_Text_to_Speech
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.
Non-Attentive-Tacotron
This is Pytorch Implementation of Google's Non-attentive Tacotron.
NSP-BERT
The code for our paper "NSP-BERT: A Prompt-based Zero-Shot Learner Through an Original Pre-training Task —— Next Sentence Prediction"
Parakeet
PAddle PARAllel text-to-speech toolKIT (supporting WaveFlow, ClariNet, WaveNet, Deep Voice 3, Transformer TTS and FastSpeech)
Parallel-Tacotron2
PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling
pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
rust-kissfft
Rust binding for KissFFT
seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
TTS
:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
typical-sampling
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
WaveRNN
Pytorch implementation of Deepmind's WaveRNN model