Đỗ Trí Nhân's repositories
MOS-TTS-Evaluate
Website for evaluation speech synthesis system with MOS
NormEspeak
G2P from Espeak: Combine Vinorm and Espeak to normalize text then convert Graphoneme to IPA
ViTacotron2
Vietnamese Speech Synthesis with End-to-End Model and Text Normalization: 2020 7th NAFOSTED Conference on Information and Computer Science
VietnameseSpeedySpeech
VNUHCM-US_CONF_2020: SPEEDYSPEECH MODEL WITH NORMALIZATION FOR FASTER VIETNAMESE SPEECH SYNTHESIS
MediaEval2020
HCMUS at MediaEval 2020: Emotion Classification Using Wavenet Features with SpecAugment and EfficientNet
ExpressiveTacotron
This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN, Non-attentive Tacotron, GST, VAE, GMVAE, and X-vectors for building prosody encoder.
FrameworkSetup
"The mother of all demo apps" — Exemplary fullstack Medium.com clone powered by React, Angular, Node, Django, and many more 🏅
PythonAlgorithm
All Algorithms implemented in Python
SentimentClassification
Django Project to demo Sentiment Classification using: SVM, Naive Bayes, Xgboost, Decision Tree
SpeechSynthesisSurvey
List of speech synthesis papers.
stanford-cs-229-machine-learning
VIP cheatsheets for Stanford's CS 229 Machine Learning
ViProsodic
Prosodic is a metrical-phonological parser written in Python, this repo is clone version backup from Prosodic in Pypi, handle English case which mix code in Vietnamese phonetic processing
VoiceCloneSystem
Pipeline Voice Cloning System: Tacotron - Waveglow - Verification - AutoVC - Wavenet
awesome-audio-visualization
A curated list about Audio Visualization.
conformer
PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)
g2pK
g2pK: g2p module for Korean
Parallel-Tacotron2
PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling
Phonetisaurus
Phonetisaurus G2P
PyTorch_Speaker_Verification
PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.
Self_Driving_Car_specialization
Assignments and notes for the Self Driving Cars course offered by University of Toronto on Coursera
TTS
:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
TTS_TFLite
This repository is a collection of TTS Models in TFLite
webMUSHRA
a MUSHRA compliant web audio API based experiment software