yanchaomars

followers

following

stars

Chao Yan's repositories

algo

数据结构和算法必知必会的50个代码实现

Language:CApache-2.0000

ASR_WORD

采用端到端方法构建声学模型，以字为建模单元，采用DCNN-CTC网络结构。

Language:PythonAGPL-3.0000

athena-signal

Apache-2.0000

awesome-diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

Apache-2.0000

Awesome-Interview

Collection of awesome interview references.

000

awesome-kaldi

This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )

MIT000

Coursera-ML-AndrewNg-Notes

吴恩达老师的机器学习课程个人笔记

Language:HTML000

deeplearning_ai_books

deeplearning.ai（吴恩达老师的深度学习课程笔记及资源）

Language:HTML000

dscore

Diarization scoring tools.

BSD-2-Clause000

grpc-gateway

gRPC to JSON proxy generator following the gRPC HTTP spec

Language:GoBSD-3-Clause000

grpcpp-bidi-streaming

gRPC C++ bidirectional streaming example

Language:C++MIT000

insightface

Face Analysis Project on MXNet

MIT000

jsalt2019-diadet

Repository of recipes for the JSALT2019 workshop on "Speaker Detection in Adverse Scenarios with a Single Microphone"

Apache-2.0000

kaldi-decoders

Custom decoders for Kaldi

Language:C++MIT000

LeetCodeAnimation

Demonstrate all the questions on LeetCode in the form of animation.（用动画的形式呈现解LeetCode题目的思路）

Language:Java000

night-reading-go

Night-Reading-Go《Go 夜读》 > Share the related technical topics of Go every week through zoom online live broadcast, every day on the WeChat/Slack to communicate programming technology topics. 每周通过 zoom 在线直播的方式分享 Go 相关的技术话题，每天大家在微信/Slack 上及时沟通交流编程技术话题。

MIT000

pansori

Tools for ASR Corpus Generation from Online Video

Language:PythonMIT000

pychain

PyTorch implementation of LF-MMI for End-to-end ASR

000

ReplayGainAnalysis

ReplayGainAnalysis - analyzes input samples and give the recommended dB change

Language:C000

rnnoise

Recurrent neural network for audio noise reduction

Language:CBSD-3-Clause000

RPNSD

PyTorch implementation of RPNSD

Language:PythonMIT010

shendusuipian

To know stats by heart

Language:HTML000

speaker-embedding-with-phonetic-information

The code for the Interspeech paper "Speaker Embedding Extraction with Phonetic Information"

000

sphereface

Implementation for <SphereFace: Deep Hypersphere Embedding for Face Recognition> in CVPR'17.

MIT000

sphereface-plus

SphereFace+ Implementation for <Learning towards Minimum Hyperspherical Energy> in NIPS'18.

MIT000

tf-kaldi-speaker

Neural speaker recognition/verification system based on Kaldi and Tensorflow

Language:PythonApache-2.0000

VBDiarization

Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone data

Apache-2.0000

VBx

Variational Bayes HMM over x-vectors diarization on DIHARD II

000

wav2letter

Facebook AI Research Automatic Speech Recognition Toolkit

Language:C++NOASSERTION000

WebRTC_VAD

Voice Activity Detector Module Port From WebRTC

BSD-3-Clause000