Suldier's repositories
prefix-beam-search
Code for prefix beam search tutorial by @labodk
caffe-cvprw15
:heart::coffee: Deep Learning of Binary Hash Codes for Fast Image Retrieval (CVPRW15)
dtw
DTW (Dynamic Time Warping) python module
eesen
The official repository of the Eesen project
eesen-for-thchs30
ASR for Chinese Mandarin
fastdtw
A Python implementation of FastDTW
graph-based-nn
Graph Convolutional Networks (GCNs)
improved_wgan_training
Code for reproducing experiments in "Improved Training of Wasserstein GANs"
kaldi
This is now the official location of the Kaldi project.
kaldi-python
Python wrappers for Kaldi data
librosa
Python library for audio and music analysis
Low-Latency-Android-Audio-iOS-Audio-Engine
Superpowered Audio Engine for Games, Virtual Reality, Music and Interactive Audio Apps. Cross Platform on Android, iOS, Mac OSX, tvOS and Linux. Real-time, low-Latency. Free.
madmom
Python audio and music signal processing library
magphase
MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.
merlin
This is now the official location of the Merlin project.
pinyin-data
汉字拼音数据
python-pinyin
汉字转拼音(pypinyin)
python_speech_features
This library provides common speech features for ASR including MFCCs and filterbank energies.
pytorch-rnn-sequence-generation-classification
Lyrics and piano music generation in Pytorch
sparse-subspace-clustering-python
Python implementation of Sparse Subspace Clustering algorithm.
speech-to-text-wavenet
Speech-to-Text-WaveNet : End-to-end sentence level English speech recognition based on DeepMind's WaveNet and tensorflow
tacotron-1
A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model
tensorflow-wavenet
A TensorFlow implementation of DeepMind's WaveNet paper
wavenet
Keras WaveNet implementation
WavGenSR
Waveform generator based on signal reshaping for SPSS