Đỗ Trí Nhân's repositories

MOS-TTS-Evaluate

Website for evaluation speech synthesis system with MOS

Language:CSSStargazers:8Issues:1Issues:0

NormEspeak

G2P from Espeak: Combine Vinorm and Espeak to normalize text then convert Graphoneme to IPA

Language:PythonStargazers:5Issues:1Issues:0

Vi_G2P

grapheme-to-phoneme method, converts any Vietnamese word from grapheme-based into a phoneme-based pronunciation that integrates tone information. It is usefull to create a lexicon for deverloping a Vi LVCSR system.

Language:TclStargazers:5Issues:1Issues:0

ViTacotron2

Vietnamese Speech Synthesis with End-to-End Model and Text Normalization: 2020 7th NAFOSTED Conference on Information and Computer Science

VietnameseSpeedySpeech

VNUHCM-US_CONF_2020: SPEEDYSPEECH MODEL WITH NORMALIZATION FOR FASTER VIETNAMESE SPEECH SYNTHESIS

MediaEval2020

HCMUS at MediaEval 2020: Emotion Classification Using Wavenet Features with SpecAugment and EfficientNet

Language:PythonStargazers:2Issues:1Issues:0
Language:Jupyter NotebookStargazers:2Issues:1Issues:0

NLPSample

Source-based for two NLP problems: Sentiment Analysis and Sequence Labeling, using BERT, CNN, RNNs, Attention and its variants.

Language:Jupyter NotebookStargazers:2Issues:1Issues:0

ExpressiveTacotron

This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN, Non-attentive Tacotron, GST, VAE, GMVAE, and X-vectors for building prosody encoder.

Language:PythonStargazers:1Issues:0Issues:0

flowtron

Auto-regressive flow-based generative network for text to speech synthesis

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

FrameworkSetup

"The mother of all demo apps" — Exemplary fullstack Medium.com clone powered by React, Angular, Node, Django, and many more 🏅

Language:JavaScriptLicense:MITStargazers:1Issues:0Issues:0

PythonAlgorithm

All Algorithms implemented in Python

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

SentimentClassification

Django Project to demo Sentiment Classification using: SVM, Naive Bayes, Xgboost, Decision Tree

Language:JavaScriptStargazers:1Issues:1Issues:0

SpeechSynthesisSurvey

List of speech synthesis papers.

License:MITStargazers:1Issues:0Issues:0

stanford-cs-229-machine-learning

VIP cheatsheets for Stanford's CS 229 Machine Learning

License:MITStargazers:1Issues:0Issues:0

ViProsodic

Prosodic is a metrical-phonological parser written in Python, this repo is clone version backup from Prosodic in Pypi, handle English case which mix code in Vietnamese phonetic processing

Language:PythonStargazers:1Issues:1Issues:0

VoiceCloneSystem

Pipeline Voice Cloning System: Tacotron - Waveglow - Verification - AutoVC - Wavenet

Language:Jupyter NotebookStargazers:1Issues:2Issues:0
Language:PythonLicense:Apache-2.0Stargazers:1Issues:1Issues:0

awesome-audio-visualization

A curated list about Audio Visualization.

Language:ShellStargazers:0Issues:0Issues:0

conformer

PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

g2pK

g2pK: g2p module for Korean

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Parallel-Tacotron2

PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Phonetisaurus

Phonetisaurus G2P

Language:ShellLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

PyTorch_Speaker_Verification

PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

Self_Driving_Car_specialization

Assignments and notes for the Self Driving Cars course offered by University of Toronto on Coursera

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

TTS

:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

Language:Jupyter NotebookLicense:MPL-2.0Stargazers:0Issues:0Issues:0

TTS_TFLite

This repository is a collection of TTS Models in TFLite

License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

webMUSHRA

a MUSHRA compliant web audio API based experiment software

Language:JavaScriptLicense:NOASSERTIONStargazers:0Issues:0Issues:0