Chao Yan's repositories
algo
数据结构和算法必知必会的50个代码实现
ASR_WORD
采用端到端方法构建声学模型,以字为建模单元,采用DCNN-CTC网络结构。
awesome-diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
Awesome-Interview
Collection of awesome interview references.
awesome-kaldi
This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
Coursera-ML-AndrewNg-Notes
吴恩达老师的机器学习课程个人笔记
deeplearning_ai_books
deeplearning.ai(吴恩达老师的深度学习课程笔记及资源)
dscore
Diarization scoring tools.
grpc-gateway
gRPC to JSON proxy generator following the gRPC HTTP spec
grpcpp-bidi-streaming
gRPC C++ bidirectional streaming example
insightface
Face Analysis Project on MXNet
jsalt2019-diadet
Repository of recipes for the JSALT2019 workshop on "Speaker Detection in Adverse Scenarios with a Single Microphone"
kaldi-decoders
Custom decoders for Kaldi
LeetCodeAnimation
Demonstrate all the questions on LeetCode in the form of animation.(用动画的形式呈现解LeetCode题目的思路)
night-reading-go
Night-Reading-Go《Go 夜读》 > Share the related technical topics of Go every week through zoom online live broadcast, every day on the WeChat/Slack to communicate programming technology topics. 每周通过 zoom 在线直播的方式分享 Go 相关的技术话题,每天大家在微信/Slack 上及时沟通交流编程技术话题。
pansori
Tools for ASR Corpus Generation from Online Video
pychain
PyTorch implementation of LF-MMI for End-to-end ASR
ReplayGainAnalysis
ReplayGainAnalysis - analyzes input samples and give the recommended dB change
rnnoise
Recurrent neural network for audio noise reduction
shendusuipian
To know stats by heart
speaker-embedding-with-phonetic-information
The code for the Interspeech paper "Speaker Embedding Extraction with Phonetic Information"
sphereface
Implementation for <SphereFace: Deep Hypersphere Embedding for Face Recognition> in CVPR'17.
sphereface-plus
SphereFace+ Implementation for <Learning towards Minimum Hyperspherical Energy> in NIPS'18.
tf-kaldi-speaker
Neural speaker recognition/verification system based on Kaldi and Tensorflow
VBDiarization
Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone data
VBx
Variational Bayes HMM over x-vectors diarization on DIHARD II
wav2letter
Facebook AI Research Automatic Speech Recognition Toolkit
WebRTC_VAD
Voice Activity Detector Module Port From WebRTC