Suldier's repositories

prefix-beam-search

Code for prefix beam search tutorial by @labodk

Language:PythonStargazers:1Issues:1Issues:0

psola

Python package implementing the TD-PSOLA algorithm for speech processing

Language:PythonLicense:MITStargazers:1Issues:1Issues:0

audiotsm

A python library for real-time audio time-scale modification procedures

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

caffe-cvprw15

:heart::coffee: Deep Learning of Binary Hash Codes for Fast Image Retrieval (CVPRW15)

License:NOASSERTIONStargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0

dtw

DTW (Dynamic Time Warping) python module

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

eesen

The official repository of the Eesen project

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

eesen-for-thchs30

ASR for Chinese Mandarin

Language:PerlStargazers:0Issues:0Issues:0

fastdtw

A Python implementation of FastDTW

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

graph-based-nn

Graph Convolutional Networks (GCNs)

Stargazers:0Issues:0Issues:0

improved_wgan_training

Code for reproducing experiments in "Improved Training of Wasserstein GANs"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

kaldi

This is now the official location of the Kaldi project.

Language:ShellLicense:NOASSERTIONStargazers:0Issues:0Issues:0

kaldi-python

Python wrappers for Kaldi data

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

librosa

Python library for audio and music analysis

Language:PythonLicense:ISCStargazers:0Issues:0Issues:0

Low-Latency-Android-Audio-iOS-Audio-Engine

Superpowered Audio Engine for Games, Virtual Reality, Music and Interactive Audio Apps. Cross Platform on Android, iOS, Mac OSX, tvOS and Linux. Real-time, low-Latency. Free.

Language:C++Stargazers:0Issues:0Issues:0

madmom

Python audio and music signal processing library

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

magphase

MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

merlin

This is now the official location of the Merlin project.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

pinyin-data

汉字拼音数据

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

python-pinyin

汉字转拼音(pypinyin)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

python_speech_features

This library provides common speech features for ASR including MFCCs and filterbank energies.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

pytorch-rnn-sequence-generation-classification

Lyrics and piano music generation in Pytorch

Language:Jupyter NotebookStargazers:0Issues:0Issues:0
Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

sparse-subspace-clustering-python

Python implementation of Sparse Subspace Clustering algorithm.

License:MITStargazers:0Issues:0Issues:0

speech-to-text-wavenet

Speech-to-Text-WaveNet : End-to-end sentence level English speech recognition based on DeepMind's WaveNet and tensorflow

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

tacotron-1

A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

tensorflow-wavenet

A TensorFlow implementation of DeepMind's WaveNet paper

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:0

wavenet

Keras WaveNet implementation

Language:PythonStargazers:0Issues:0Issues:0

WavGenSR

Waveform generator based on signal reshaping for SPSS

Language:MatlabLicense:Apache-2.0Stargazers:0Issues:0Issues:0