chazo1994's repositories
Phonetisaurus-1
Phonetisaurus G2P
project-android
my project android
Blizzard2013_Segmentation
Transcripts and segmentation for the Blizzard 2013 audiobooks also known as the Lessac or Blizzard 2013 dataset.
gst-tacotron
A tensorflow implementation of the "Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis"
hyde
A brazen two-column theme for Jekyll.
inaSpeechSegmenter
CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.
melgan
MelGAN vocoder (compatible with NVIDIA/tacotron2)
merlin
This is now the official location of the Merlin project.
NaturalSpeech2
Personal Work
phonetisaurus
Automatically exported from code.google.com/p/phonetisaurus
tacotron2
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
Theano
Theano is a Python library that allows you to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays efficiently. It can use GPUs and perform efficient symbolic differentiation.