timit

There are 2 repositories under timit topic.

pytorch-kaldi
mravanelli / pytorch-kaldi
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
speech-recognition gru dnn kaldi rnn-model pytorch timit deep-learning deep-neural-networks recurrent-neural-networks multilayer-perceptron-network lstm lstm-neural-networks speech asr rnn dnn-hmm
Language:Python 2391
mravanelli / SincNet
SincNet is a neural architecture for efficiently processing raw audio samples.
artificial-intelligence asr audio audio-processing cnn convolutional-neural-networks deep-learning digital-signal-processing filtering neural-networks python pytorch signal-processing speaker-identification speaker-recognition speaker-verification speech-processing speech-recognition timit waveform
Language:Python 1195
speechbrain / speechbrain.github.io
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
deep-learning speech-recognition speech-to-text speech speech-processing speaker-recognition speaker-verification speaker-identification speech-separation speech-emotion-recognition speechrecognition speech-recognizer deeplearning neural-network neural-networks beamforming timit librispeech speech-analysis speech-api
Language:HTML 371
hirofumi0810 / tensorflow_end2end_speech_recognition
End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)
speech-recognition ctc tensorflow timit csj timit-dataset attention-mechanism automatic-speech-recognition asr librispeech end-to-end end-to-end-learning speech-to-text joint-ctc-attention beam-search
Language:Python 315
philipperemy / timit
The DARPA TIMIT Acoustic-Phonetic Continuous Speech Corpus.
darpa speech timit timit-dataset
308
Diamondfan / CTC_pytorch
CTC end -to-end ASR for timit and 863 corpus.
ctc pytorch timit kaldi decoder
Language:Python 220
HawkAaron / RNN-Transducer
MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks
end-to-end asr transducers mxnet rnn-transducer timit speech-recognition sequence-transduction rnnt-joint rnnt-model
Language:Python 139
WindQAQ / listen-attend-and-spell
Tensorflow implementation of "Listen, Attend and Spell" authored by William Chan. This project utilizes input pipeline and estimator API of Tensorflow, which makes the training and evaluation truly end-to-end.
listen-attend-and-spell speech-recognition tensorflow timit seq2seq speech-to-text
Language:Python 89
grausof / keras-sincnet
Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)
deep-learning audio waveform filtering cnn convolutional-neural-networks speaker-recognition speaker-verification speech-recognition asr audio-processing speech-processing digital-signal-processing neural-network machine-learning artificial-intelligence timit tensorflow keras
Language:Python 72
hirofumi0810 / asr_preprocessing
Python implementation of pre-processing for End-to-End speech recognition
speech-recognition ctc attention-mechanism timit timit-dataset switchboard csj automatic-speech-recognition librispeech end-to-end transcription preprocessing dataset
Language:Python 69
matthijsvk / TIMITspeech
Speech recognition on the TIMIT (or any other) dataset
speech timit neural-network theano phonemes speech-recognition
Language:Python 42
mravanelli / pytorch_MLP_for_ASR
This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and decoding are performed with Kaldi. The current implementation supports dropout and batch normalization. An example for phoneme recognition using the standard TIMIT dataset is provided.
asr cuda deep-learning deep-neural-networks feedforward-neural-network kaldi kaldi-asr mlp multilayer-perceptron neural-networks python pytorch speech-recognition timit
Language:Perl 38
AppleHolic / PytorchSR
Pytorch based phoneme recognition (TIMIT phoneme classification)
pytorch paper timit speechrecognition minimalgru cbhg
Language:Python 35
mravanelli / theano-kaldi-rnn
THEANO-KALDI-RNNs is a project implementing various Recurrent Neural Networks (RNNs) for RNN-HMM speech recognition. The Theano Code is coupled with the Kaldi decoder.
deep-learning deep-neural-networks gated-recurrent-units gru kaldi recurrent-neural-networks rnn theano theano-kaldi-rnns timit
Language:Perl 33
zhaoyu611 / Automatic_Speech_Recognition_with_Multi_Models
A Simple Automatic Speech Recognition (ASR) Model in Tensorflow, which only needs to focus on Deep Neural Network. It's easy to test popular cells (most are LSTM and its variants) and models (unidirectioanl RNN, bidirectional RNN, ResNet and so on). Moreover, you are welcome to play with self-defined cells or models.
automatic-speech-recognition tensorflow ctc lstm deep-learning timit rnn acoustic-model
Language:Python 19
biyoml / PyTorch-End-to-End-ASR-on-TIMIT
Attention-based end-to-end ASR on TIMIT in PyTorch
end-to-end asr timit pytorch attention-seq2seq
Language:Python 17
orbxball / timit-preprocessor
Extract mfcc vectors and phones from TIMIT dataset
timit-dataset timit data-preprocessing mfcc phone deep-learning speech-recognition
Language:Shell 16
anicolson / SPN-ASI
Sum-Product Networks (SPNs) for Robust Automatic Speaker Identification.
spn-speaker-model robust-speaker-recognition robustness sum-product-networks deep-xi ideal-binary-mask timit timit-dataset robust-speaker-identification speaker-identification speaker-verification robust-speaker-verification missing-data missing-feature-theory marginalisation marginalization
Language:Python 11
colinator / timit_utils
Python/numpy/pandas convenience wrapper for the TIMIT database.
timit timit-database phonemes audio python phoneme-transcriptions timit-utils transcription audio-recordings
Language:Jupyter Notebook 11
drkostas / bench-utils
A collection of benchmarking tools.
benchmark timit benchmarking timer
Language:Python 11
dingzeyuli / SpEAR-speech-database
A database of clean and noisy speech for audio research
speech dataset timit audio waveform
10
WindQAQ / tensorflow-wavenet
Implementation of WaveNet network based on Tensorflow.
tensorflow wavenet speech-recognition speech-to-text timit
Language:Python 9
KrishnaDN / LAS-Pytorch
Implementation of the paper "Listen, Attend and Spell" Paper in Pytorch
speech-re speech-to-text listen-attend-and-spell seq2seq-model timit asr-model asr
Language:Python 7
jackyzha0 / Speech2Braille
[🏆 Silver Medal at CWSF] Tensorflow Implementation of TIMIT Deep BLSTM-CTC with Tensorboard Support
braille raspberry-pi tensorflow blstm ctc blstm-ctc timit
Language:Python 6
HanSeokhyeon / Speech_recognition_for_English_and_Korean
다양한 feature를 이용한 음성인식 LAS model입니다. (한국어는 개발예정)
timit phoneme mfcc las
Language:Python 4
BradleyHe / TIMIT-Alignment
TIMIT forced alignment with the Montreal Forced Aligner
timit
Language:Python 2
BradleyHe / TIMIT-Phoneme-Mixer
Python project that mixes phonemes from the TIMIT dataset
timit
Language:Python 2
hammaad2002 / SimpleASRmodel
A simple CRDNN based ASR model for my own understanding of how ASR works and are trained. (Work in progress) If anyone finds any error or have any suggestion please do let me know.
asr asr-model librispeech pytorch pytorch-implementation pytorch-tutorial speech-recognition supervised-learning timit timit-dataset crdnn
Language:Jupyter Notebook 2
kipmccharen / sys6016_DL_project
pretrained SpeechBrain wav2vec seq2seq+CTC model trained on TIMIT dataset. Created by Kip McCharen, Siddharth Surapaneni, and Pavan Bondalapati
timit speechbrain per apr wav2vec2
Language:Python 2
BradleyHe / TIMIT-Voice-Mixer
Python project which mixes and tests sentences from the TIMIT dataset using LAS
timit
Language:Python 1
freha-mezzoudj / Magister_works1
My magister (Bac+5+2) topic is about the Timit phonems multi_classification using GA and SVM. My works are presented here to help the research community, thanks !
svm ga speech timit feature mfcc
1
AntonDemchenko / voiceprint_maker
deeplearning audio voice speaker-identification keras-tensorflow timit
Language:Python 0
benivalotker / benchmarking_and_profiling
simple use for benchmarking and profiling module
python cprofile timit benchmark performance
Language:Python 0

timit

mravanelli / pytorch-kaldi

mravanelli / SincNet

speechbrain / speechbrain.github.io

hirofumi0810 / tensorflow_end2end_speech_recognition

philipperemy / timit

Diamondfan / CTC_pytorch

HawkAaron / RNN-Transducer

WindQAQ / listen-attend-and-spell

grausof / keras-sincnet

hirofumi0810 / asr_preprocessing

matthijsvk / TIMITspeech

mravanelli / pytorch_MLP_for_ASR

AppleHolic / PytorchSR

mravanelli / theano-kaldi-rnn

zhaoyu611 / Automatic_Speech_Recognition_with_Multi_Models

biyoml / PyTorch-End-to-End-ASR-on-TIMIT

orbxball / timit-preprocessor

anicolson / SPN-ASI

colinator / timit_utils

drkostas / bench-utils

dingzeyuli / SpEAR-speech-database

WindQAQ / tensorflow-wavenet

KrishnaDN / LAS-Pytorch

jackyzha0 / Speech2Braille

HanSeokhyeon / Speech_recognition_for_English_and_Korean

BradleyHe / TIMIT-Alignment

BradleyHe / TIMIT-Phoneme-Mixer

hammaad2002 / SimpleASRmodel

kipmccharen / sys6016_DL_project

BradleyHe / TIMIT-Voice-Mixer

freha-mezzoudj / Magister_works1

AntonDemchenko / voiceprint_maker

benivalotker / benchmarking_and_profiling