Richard M Wan's starred repositories
python_speech_features
This library provides common speech features for ASR including MFCCs and filterbank energies.
TensorSlow
Re-implementation of TensorFlow in pure python, with an emphasis on code understandability
c_speech_features
A port of python_speech_features to C.
Chimay-Red
Working POC of Mikrotik exploit from Vault 7 CIA Leaks
Chimay-Red
Working POC of Mikrotik exploit from Vault 7 CIA Leaks
tensorflow-cmake
TensorFlow examples in C, C++, Go and Python without bazel but with cmake and FindTensorFlow.cmake
chime5-synchronisation
CHiME-5 Baseline Array Synchronisation
RokidPhone
Rokid智能语音识别Demo(AS工程),运行在Android6.0平台
silk-v3-decoder
[Skype Silk Codec SDK]Decode silk v3 audio files (like wechat amr, aud files, qq slk files) and convert to other format (like mp3). Batch conversion support.
make-a-smart-speaker
A collection of resources to make a smart speaker
TF-Speech-Recognition-Challenge-Solution
Source code of the model used in Tensorflow Speech Recognition Challenge (https://www.kaggle.com/c/tensorflow-speech-recognition-challenge). The solution ranked in top 5% in private leaderboard.
DeepSpeaker-pytorch
Speaker embedding(verification and recognition) using Pytorch
PyBaiduYuyin
This project has been deprecated
UrbanSound8K-JAMS
JAMS annotation files for the original and augmented UrbanSound8K dataset
audio-classifier-keras-cnn
Audio Classifier in Keras using Convolutional Neural Network
extreme-sound-stretch
Stretch any audio to extreme lengths