There are 2 repositories under timit topic.
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
SincNet is a neural architecture for efficiently processing raw audio samples.
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)
The DARPA TIMIT Acoustic-Phonetic Continuous Speech Corpus.
MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks
Tensorflow implementation of "Listen, Attend and Spell" authored by William Chan. This project utilizes input pipeline and estimator API of Tensorflow, which makes the training and evaluation truly end-to-end.
Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)
Python implementation of pre-processing for End-to-End speech recognition
Speech recognition on the TIMIT (or any other) dataset
This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and decoding are performed with Kaldi. The current implementation supports dropout and batch normalization. An example for phoneme recognition using the standard TIMIT dataset is provided.
Pytorch based phoneme recognition (TIMIT phoneme classification)
THEANO-KALDI-RNNs is a project implementing various Recurrent Neural Networks (RNNs) for RNN-HMM speech recognition. The Theano Code is coupled with the Kaldi decoder.
A Simple Automatic Speech Recognition (ASR) Model in Tensorflow, which only needs to focus on Deep Neural Network. It's easy to test popular cells (most are LSTM and its variants) and models (unidirectioanl RNN, bidirectional RNN, ResNet and so on). Moreover, you are welcome to play with self-defined cells or models.
Attention-based end-to-end ASR on TIMIT in PyTorch
Extract mfcc vectors and phones from TIMIT dataset
Python/numpy/pandas convenience wrapper for the TIMIT database.
A database of clean and noisy speech for audio research
Implementation of WaveNet network based on Tensorflow.
Implementation of the paper "Listen, Attend and Spell" Paper in Pytorch
[๐ Silver Medal at CWSF] Tensorflow Implementation of TIMIT Deep BLSTM-CTC with Tensorboard Support
๋ค์ํ feature๋ฅผ ์ด์ฉํ ์์ฑ์ธ์ LAS model์ ๋๋ค. (ํ๊ตญ์ด๋ ๊ฐ๋ฐ์์ )
A simple CRDNN based ASR model for my own understanding of how ASR works and are trained. (Work in progress) If anyone finds any error or have any suggestion please do let me know.
pretrained SpeechBrain wav2vec seq2seq+CTC model trained on TIMIT dataset. Created by Kip McCharen, Siddharth Surapaneni, and Pavan Bondalapati
Python project which mixes and tests sentences from the TIMIT dataset using LAS
My magister (Bac+5+2) topic is about the Timit phonems multi_classification using GA and SVM. My works are presented here to help the research community, thanks !
simple use for benchmarking and profiling module