Tien-Hong Lo's repositories
automated-english-transcription-grader
Investigating the effect of auxiliary objectives for the automated grading of learner English speech transcriptions (ACL 2020)
CTC-Attention-Mispronunciation
A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques
espnet
End-to-End Speech Processing Toolkit
fluency_scorer
It's my implementation for speech fluency assessment model
gop-dnn-l2arctic
Goodness of Pronunciation using Kaldi on Epa-DB database
GPU-monitor
A gpu clusters status monitoring tool online.
HeterSumGraph
Code for ACL2020 paper "Heterogeneous Graph Neural Networks for Extractive Document Summarization"
kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
kaldi-gstreamer-server
Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.
SALMONN
SALMONN: Speech Audio Language Music Open Neural Network
SLAM-LLM
Speech, Language, Audio, Music Processing with Large Language Model
SpeechSplit
Unsupervised Speech Decomposition Via Triple Information Bottleneck
teinhonglo.github.io
✨ Build a beautiful and simple website in literally minutes. Demo at https://beautifuljekyll.com
voice-chatgpt
A simple demo site that implement by gradio.
whisper-hakka
Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.
whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)