There are 3 repositories under librispeech topic.
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
Voice Activity Detection based on Deep Learning & TensorFlow
End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)
Pytorch and TensorFlow data loaders for several audio datasets
SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
Python implementation of pre-processing for End-to-End speech recognition
Final project for the Speaker Recognition course on Udemy, 机器之心, 深蓝学院 and 语音之家
End-to-End Speech Recognition Using Tensorflow
End-to-End Speech Recognition using Neural Networks.
Pytorch implementation of conformer with with training script for end-to-end speech recognition on the LibriSpeech dataset.
Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]
Automatic speech recognition using neural networks
Speech Recognition Using Tensorflow
Baselines for the Zero-Resources Speech Challenge using VisuallyGrounded Models of Spoken Language, 2021 edition
Some approaches based on deep learning to build the acoustic model for an end-to-end automatic speech recognition (ASR) pipeline.
A web-app/library for transcribing speech
Scripts to generate the reverberant LibriCHiME-5 dataset.
Few-shot learning experiments mostly on speaker recognition.
Gender Classification of the speaker from LibriSpeech Dataset
Gender Classification with different Machine Learning models, using the LibriSpeech ASR dataset.
Replication of Jasper speech-to-text network using Intel optimized TensorFlow.
A simple CRDNN based ASR model for my own understanding of how ASR works and are trained. (Work in progress) If anyone finds any error or have any suggestion please do let me know.
Implementation of Deep Speech 2 paper with BiGRU and BiLSTM using LibriSpeech Dataset
An end-to-end speech recognition engine similar to DeepSpeech2
This repository contains Kaldi recipes on the LibriSpeech corpora to extract fMLLR features
End to End Automatic Speech Recognition on Librispeech: Pytorch implementation
Implement a deep neural network that functions as part of an end-to-end automatic speech recognition (ASR) pipeline
A useful librispeech project where without using the datastes available in the internet. Here you have to create your own audio files and take them as input to create text as output. There is problem with the dataset available in online this project comes in handy for people who interested in this.
PyTorch implementation of Transformer-based Automatic Speech Recognition with attention mechanisms, SpecAugment, CTC loss, and mixed precision training. Achieves competitive WER/CER on LibriSpeech.