GRU's repositories
KWS
基于HTK工具箱的音频检索系统
Project
Speaker Verification
CASApythonPort
Python code that implements the DUET blind source separation algorithm. Converted from the MATLAB code from here - https://github.com/yvesx/casa495
voiceenhance
voice enhancement algorithm
AudioMLProject1
Voice Activity Detection: In this first assignment, we will create a dataset that simulates speech in every-day scenarios. We train a classifier on this dataset for distinguishing voiced from non-voiced sections, a task called voice activity detection, VAD for short. This, of course, requires a ground truth in terms of VAD annotations.
cvaf
Use computer vision techniques for fingerprinting.
autoencoder
Use denoising auto-encoder to extracting fingerprints.
kaldi-lstm
C++ implementation of LSTM (Long Short Term Memory), in Kaldi's nnet1 framework. Used for automatic speech recognition, possibly language modeling etc, the training can be switched between CPU and GPU(CUDA). This repo is now merged into official Kaldi codebase(Karel's setup), so this repo is no longer maintained, please check out the Kaldi project instead.
fingerprint
Baseline method from Haitsma et al. for fingerprinting.
STRAIGHT
This is a speech analysis, modification and synthesis system
opendcd.github.io
Open Source WFST-based Decoder Toolkit
py-arpa-lm
Python API for reading and querying ARPA formatted language models.
wfst-lm-decoder
wfst-based language model decoder
Tether-iOS
Tethering for non-jailbroken iOS Devices over USB
pfp
Pretty fast parser for probabilistic context free grammars
waveprint
My implementation of Waveprint, an algorithm for audio searching
gmm
Gaussian Mixture Models in Python
half_float
C++ implementation of a 16 bit floating-point type mimicking most of the IEEE 754 behaviour. Compatible with the half data type used as texture format by OpenGl/Direct3D.
adbputty
Putty enhanced with the ability to connect to Android Debug Bridge