Trần Cao Sơn's repositories
vits2
VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design
DeepLearningExamples
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
StyleTTS2
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
ForwardTacotron
⏩ Generating speech in a single forward pass without any attention!
Multilingual_Text_to_Speech
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.
TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
panpp
Scene Text Detection
PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
DSRNet
Image super-resolution via dynamic network (CAAI Transactions on Intelligence Technology, 2023)
synthtiger
Official Implementation of SynthTIGER (Synthetic Text Image Generator), ICDAR 2021
naturalspeech2-pytorch
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch
Vietnamese-Handwritten-Text-Recognition
By team BK.BigHand
sonchuate
Config files for my GitHub profile.