wantongtang

followers

following

stars

Richard M Wan's starred repositories

python_speech_features

This library provides common speech features for ASR including MFCCs and filterbank energies.

Language:PythonMIT235300

ninja

a small build system with a focus on speed

Language:C++Apache-2.01084400

TensorSlow

Re-implementation of TensorFlow in pure python, with an emphasis on code understandability

Language:Jupyter Notebook67900

wakeword

A library to detect wake words and aid in responding to them

Language:JavaScript200

c_speech_features

A port of python_speech_features to C.

Language:CNOASSERTION4600

Chimay-Red

Working POC of Mikrotik exploit from Vault 7 CIA Leaks

Language:Python200

Chimay-Red

Working POC of Mikrotik exploit from Vault 7 CIA Leaks

Language:Python64600

tensorflow-cmake

TensorFlow examples in C, C++, Go and Python without bazel but with cmake and FindTensorFlow.cmake

Language:CMakeApache-2.044100

Domain_QA

限定域问答系统包括：自动构建知识库、问句检索、基于微信平台搭建问答系统。本项目所有代码已开源。用户通过简单配置，可以实现快速自动化搭建一个比较完备的领域知识库。另外，基于微信平台如何通过配置来搭建问答系统，具体操作见readme.txt

Language:Java7200

Domain_QA

限定域问答系统包括：自动构建知识库、问句检索、基于微信平台搭建问答系统。本项目所有代码已开源。用户通过简单配置，可以实现快速自动化搭建一个比较完备的领域知识库。另外，基于微信平台如何通过配置来搭建问答系统，具体操作见readme.txt

Language:Java100

chime5

download URL for the ASR transcripts and the lattices

300

chime5-synchronisation

CHiME-5 Baseline Array Synchronisation

Language:PythonMIT1100

ASR-module

Language:C200

RokidPhone

Rokid智能语音识别Demo(AS工程)，运行在Android6.0平台

Language:Java500

silk-v3-decoder

[Skype Silk Codec SDK]Decode silk v3 audio files (like wechat amr, aud files, qq slk files) and convert to other format (like mp3). Batch conversion support.

Language:CMIT258700

MTTS

A Demo of Mandarin/Chinese TTS frontend

Language:PythonMIT27300

make-a-smart-speaker

A collection of resources to make a smart speaker

TF-Speech-Recognition-Challenge-Solution

Source code of the model used in Tensorflow Speech Recognition Challenge (https://www.kaggle.com/c/tensorflow-speech-recognition-challenge). The solution ranked in top 5% in private leaderboard.

Language:Jupyter NotebookGPL-3.05800

kws

An End-to-End Architecture for Keyword Spotting and Voice Activity Detection

Language:PythonMIT36900

pykaldi

A Python wrapper for Kaldi

Language:PythonApache-2.098600

RecordRTC

RecordRTC is WebRTC JavaScript library for audio/video as well as screen activity recording. It supports Chrome, Firefox, Opera, Android, and Microsoft Edge. Platforms: Linux, Mac and Windows.

Language:JavaScriptMIT648200

DeepSpeaker-pytorch

Speaker embedding(verification and recognition) using Pytorch

Language:PythonMIT36300

vocore2

VoCore2 firmware drivers

Language:C9500

PyBaiduYuyin

This project has been deprecated

Language:PythonMIT7700

mic_array

DOA, VAD and KWS for ReSpeaker Microphone Array

Language:PythonApache-2.028400

UrbanSound8K-JAMS

JAMS annotation files for the original and augmented UrbanSound8K dataset

3500

muda

A library for augmenting annotated audio data

Language:PythonISC23000

audio-classifier-keras-cnn

Audio Classifier in Keras using Convolutional Neural Network

Language:PythonMIT15800

extreme-sound-stretch

Stretch any audio to extreme lengths

Language:PythonMIT600

pydub

Manipulate audio with a simple and easy high level interface

Language:PythonMIT862500