tuanio

Nguyễn Văn Anh Tuấn's repositories

noisy-student-training-asr

Pytorch implementation of Noisy Student Training for Automatic Speech Recognition and Automatic Pronunciation Error Detection problem

Language:Python78 2 5

image2latex

Image to Latex using Encoder-Decoder architecture

Language:Python11 1 1

create-visa-account

Language:Jupyter Notebook4 10

tuanio.github.io

This is an academic blog

Language:Jupyter NotebookMIT3 20

whisper-ctc

Whisper Encoder (extracted from pretrained) with a Linear on top and solve using CTC criterion

Language:Python3 10

custom-chatbot-ui

Language:TypeScriptMIT2 10

ling-wav2vec2

Official implementation of LingWav2Vec2: Linguistic-augmented Wav2Vec2 for Mispronunciation Detection

Language:Python2 10

single-or-multiple-speakers-detection

2 10

tuanio

2 10

audio-llm-prefix-tuning

Language:Python1 10

Discriminator-Constrained-Optimal-Transport-Network

Language:Python100

gradio-chat-rag

Language:Python1 10

setup-vicuna

Language:Shell1 10

VisionLLM-Vietnamese

Language:Python1 10

diffusion-work

010

EfficientConformer-Edit

[ASRU 2021] Efficient Conformer: Progressive Downsampling and Grouped Attention for Automatic Speech Recognition

Language:PythonApache-2.0000

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonMIT000

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and FastChat-T5.

Language:PythonApache-2.0000

gan_workspace_for_speech

Language:Jupyter NotebookNOASSERTION000

GANSpeechAugment

Language:Python000

hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Language:PythonMIT000

LaVy-revised

Pioneering in Vietnamese Multimodal Large Language Model

Language:Python000

llava-working-space

Vistral-V: Visual Instruction Tuning for Vistral - Vietnamese Large Vision-Language Model.

Language:PythonApache-2.0000

MaskCycleGAN-VC

Fork from https://github.com/GANtastic3/MaskCycleGAN-VC

Language:PythonMIT010

MaskCycleGAN-VC-pytorch-lightning

Language:Python020

melgan

Fork from melgan-neurips

Language:PythonMIT020

MoE-LLaVA

Mixture-of-Experts for Large Vision-Language Models

Language:PythonApache-2.0000

simulate-noisy-speech-from-clean

Language:Python010

SpeechAttentionGAN

Language:Python010

SpeechUVCGANv2

Rethinking CycleGAN: Improving Quality of GANs for Unpaired Image-to-Image Translation

Language:PythonNOASSERTION000