Jason's Lab's repositories
PaddleSpeech
An Easy-to-use Speech Toolkit including SOTA ASR pipeline, influential TTS with text frontend and End-to-End Speech Simultaneous Translation.
hugo
The world’s fastest framework for building websites.
NLP-progress
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
rnnoise
Recurrent neural network for audio noise reduction
deep_avsr
A PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.
pika
a lightweight speech processing toolkit based on Pytorch and (Py)Kaldi
ASR-decoder
it's ASR decoder and make graph project
kenlm
KenLM: Faster and Smaller Language Model Queries
bazel
a fast, scalable, multi-language and extensible build system
k2
FSA/FST algorithms, intended to (eventually) be interoperable with PyTorch and similar
rnnt-speech-recognition
End-to-end speech recognition using RNN Transducers in Tensorflow 2.0
warp-transducer
A fast parallel implementation of RNN Transducer.
How-To-Ask-Questions-The-Smart-Way
本文原文由知名 Hacker Eric S. Raymond 所撰寫,教你如何正確的提出技術問題並獲得你滿意的答案。
neural_sp
End-to-end ASR/LM implementation with PyTorch
espnet
End-to-End Speech Processing Toolkit
DeepSpeech
DeepSpeech is an open source speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
wav2letter
Facebook AI Research's Automatic Speech Recognition Toolkit
ASRT_SpeechRecognition
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
kaldi
This is the official location of the Kaldi project.
avsr-tf1
Audio-Visual Speech Recognition using Sequence to Sequence Models
Discriminative-Multi-modality-Speech-Recognition
TF code for our CVPR2020 paper "Discriminative Multi-modality Speech Recognition"
Python
All Algorithms implemented in Python
Speech-Emotion-Recognition
Speech emotion recognition using LSTM, SVM and MLP, implemented in Keras | 语音情感识别
masr
中文语音识别; Mandarin Automatic Speech Recognition;
Awesome
:computer: 🎉 An awesome & curated list of best applications and tools for Windows.
Job_test
研究生毕业秋招期间的笔试题目
lip-reading-deeplearning
:unlock: Lip Reading - Cross Audio-Visual Recognition using 3D Architectures