mravanelli

Mirco Ravanelli's repositories

pytorch-kaldi

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

Language:Python2391 92 214

SincNet

SincNet is a neural architecture for efficiently processing raw audio samples.

Language:PythonMIT1195 33 106

pySpeechRev

This python code performs an efficient speech reverberation starting from a dataset of close-talking speech signals and a collection of acoustic impulse responses.

Language:Python95 9 8

pytorch_MLP_for_ASR

This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and decoding are performed with Kaldi. The current implementation supports dropout and batch normalization. An example for phoneme recognition using the standard TIMIT dataset is provided.

Language:Perl38 4 1

theano-kaldi-rnn

THEANO-KALDI-RNNs is a project implementing various Recurrent Neural Networks (RNNs) for RNN-HMM speech recognition. The Theano Code is coupled with the Kaldi decoder.

Language:Perl33 70

joint_training_ASR

Joint Training for (Distant) Speech Recognition

3 4 1

pretrain_speech_model

Speech Model Pre-training for End-to-End Spoken Language Understanding

Language:PythonApache-2.03 30

benchmarks

This repository contains the SpeechBrain Benchmarks

Language:PythonApache-2.0010

speechbrain-1

A PyTorch-based Speech Toolkit

Language:PythonApache-2.0010