librispeech

There are 3 repositories under librispeech topic.

speechbrain / speechbrain.github.io
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
deep-learning speech-recognition speech-to-text speech speech-processing speaker-recognition speaker-verification speaker-identification speech-separation speech-emotion-recognition speechrecognition speech-recognizer deeplearning neural-network neural-networks beamforming timit librispeech speech-analysis speech-api
Language:HTML 371
filippogiruzzi / voice_activity_detection
Voice Activity Detection based on Deep Learning & TensorFlow
voice-activity-detection deep-learning speech tensorflow time-series time-series-classification resnet speech-recognition speech-detection python mfcc-features machine-learning vad deeplearning artificial-intelligence deep-neural-networks librispeech librispeech-dataset
Language:Python 369
hirofumi0810 / tensorflow_end2end_speech_recognition
End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)
speech-recognition ctc tensorflow timit csj timit-dataset attention-mechanism automatic-speech-recognition asr librispeech end-to-end end-to-end-learning speech-to-text joint-ctc-attention beam-search
Language:Python 315
juliagusak / dataloaders
Pytorch and TensorFlow data loaders for several audio datasets
dataloader pytorch tfrecords dataset librispeech gtzan nsynth esc audio-processing
Language:Python 112
pyyush / SpecAugment
SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
automatic-speech-recognition specaugment librispeech data-augmentation spectrogram masking
Language:Python 77
hirofumi0810 / asr_preprocessing
Python implementation of pre-processing for End-to-End speech recognition
speech-recognition ctc attention-mechanism timit timit-dataset switchboard csj automatic-speech-recognition librispeech end-to-end transcription preprocessing dataset
Language:Python 69
wq2012 / SpeakerRecognitionFromScratch
Final project for the Speaker Recognition course on Udemy, 机器之心, 深蓝学院 and 语音之家
attention-mechanism deep-learning librispeech lstm neural-network pytorch speaker-recognition speaker-recognition-systems transformer transformer-models
Language:Python 44
30stomercury / Automatic-Speech-Recognition
End-to-End Speech Recognition Using Tensorflow
speech-recognition listen-attend-and-spell tensorflow automatic-speech-recognition asr librispeech tfrecord location-aware-attention
Language:Python 42
soheil-mp / Speech-Recognition
End-to-End Speech Recognition using Neural Networks.
asr audio automatic-speech-recognition librispeech
Language:Jupyter Notebook 35
jreremy / conformer
Pytorch implementation of conformer with with training script for end-to-end speech recognition on the LibriSpeech dataset.
conformer pytorch librispeech librispeech-dataset machine-learning speech-recognition asr
Language:Python 27
oleges1 / quartznet-pytorch
Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]
quartznet quartznet-pytorch automatic-speech-recognition asr asr-model pytorch common-voice librispeech
Language:Jupyter Notebook 27
stefanpantic / asr
Automatic speech recognition using neural networks
machine-learning automatic-speech-recognition jasper quartznet asr neural-networks python tensorflow librispeech common-voice
Language:Python 18
zssloth / TF-Speech-Recognition
Speech Recognition Using Tensorflow
speech-recognition librispeech tensorflow deep-learning neural-networks
Language:Python 13
bhigy / zr-2021vg_baseline
Baselines for the Zero-Resources Speech Challenge using VisuallyGrounded Models of Spoken Language, 2021 edition
challenge deep-neural-networks librispeech multimodal-learning pytorch representation-learning speech-processing spokencoco visually-grounded-speech weakly-supervised-learning
Language:Python 7
nvmoyar / aind2-speech-recognition
Some approaches based on deep learning to build the acoustic model for an end-to-end automatic speech recognition (ASR) pipeline.
asr automatic-speech-recognition asr-pipeline speech-recognition speech-recognizer librispeech acoustic-model
Language:Jupyter Notebook 6
BenAAndrew / speech-transcriber
A web-app/library for transcribing speech
cmu-sphinx librispeech silero transcription
Language:Python 5
UDASE-CHiME2023 / reverberant-LibriCHiME-5
Scripts to generate the reverberant LibriCHiME-5 dataset.
chime-5 chime-7-udase chime-challenge librispeech speech-enhancement voice-home
Language:Python 5
vjoki / fsl-experi
Few-shot learning experiments mostly on speaker recognition.
speaker-recognition speaker-identification few-shot-learning siamese-neural-network resnet34 librispeech voxceleb metric-learning
Language:Python 5
jayaneetha / GenderClassifierLibriSpeech
Gender Classification of the speaker from LibriSpeech Dataset
librispeech librispeech-dataset cnn keras lstm classification speaker-identification audio-processing
Language:Python 4
EmanuelAlogna / Gender-Classification-using-ML
Gender Classification with different Machine Learning models, using the LibriSpeech ASR dataset.
speech-recognition speech-classification machine-learning machine-learning-algorithms logistic-regression naive-bayes perceptron svm mlp deep-learning convolutional-neural-networks librispeech librispeech-dataset k-nearest-neighbors
Language:Jupyter Notebook 3
nick-monto / Specs2text
Replication of Jasper speech-to-text network using Intel optimized TensorFlow.
librispeech jasper intel keras-api fully-convolutional-networks skip-connections residual-networks speech-to-text deep-learning deep-neural-networks tensorflow
Language:Python 3
hammaad2002 / SimpleASRmodel
A simple CRDNN based ASR model for my own understanding of how ASR works and are trained. (Work in progress) If anyone finds any error or have any suggestion please do let me know.
asr asr-model librispeech pytorch pytorch-implementation pytorch-tutorial speech-recognition supervised-learning timit timit-dataset crdnn
Language:Jupyter Notebook 2
LuluW8071 / Deep-Speech-2
Implementation of Deep Speech 2 paper with BiGRU and BiLSTM using LibriSpeech Dataset
asr ctc-decode deep-speech kenlm-toolkit librispeech hacktoberfest
Language:Jupyter Notebook 2
to-schi / ASR-Deepspeech2-Tensorflow
An end-to-end speech recognition engine similar to DeepSpeech2
data-preparation librispeech mel-spectrogram speech-recognition speech-to-text tensorflow ctc-decode ctc-loss
Language:Jupyter Notebook 2
andi611 / Kaldi-LibriSpeech-fMLLR
This repository contains Kaldi recipes on the LibriSpeech corpora to extract fMLLR features
kaldi kaldi-librispeech librispeech fmllr librispeech-fmllr
Language:Shell 1
Ephrem-ETH / E2E-ASR-on-Librispeech
End to End Automatic Speech Recognition on Librispeech: Pytorch implementation
asr ctc e2e-asr librispeech
Language:Python 1
tnakatani / dnn_speech_recognition
Implement a deep neural network that functions as part of an end-to-end automatic speech recognition (ASR) pipeline
asr librispeech speech-recognition
Language:HTML 0
HarishGoudVennakula / SELF-SUPERVISED-REPRESENTATION-LEARNING
A useful librispeech project where without using the datastes available in the internet. Here you have to create your own audio files and take them as input to create text as output. There is problem with the dataset available in online this project comes in handy for people who interested in this.
librispeech
Language:Python
realjules / speech-transformer
PyTorch implementation of Transformer-based Automatic Speech Recognition with attention mechanisms, SpecAugment, CTC loss, and mixed precision training. Achieves competitive WER/CER on LibriSpeech.
encoder-decoder speech-recognition speech-to-text transformer asr-model attention-mechanism audio-processing ctc deep-learning librispeech machine-learning natural-language-processing pytorch specaugment wandb
Language:Python

librispeech

speechbrain / speechbrain.github.io

filippogiruzzi / voice_activity_detection

hirofumi0810 / tensorflow_end2end_speech_recognition

juliagusak / dataloaders

pyyush / SpecAugment

hirofumi0810 / asr_preprocessing

wq2012 / SpeakerRecognitionFromScratch

30stomercury / Automatic-Speech-Recognition

soheil-mp / Speech-Recognition

jreremy / conformer

oleges1 / quartznet-pytorch

stefanpantic / asr

zssloth / TF-Speech-Recognition

bhigy / zr-2021vg_baseline

nvmoyar / aind2-speech-recognition

BenAAndrew / speech-transcriber

UDASE-CHiME2023 / reverberant-LibriCHiME-5

vjoki / fsl-experi

jayaneetha / GenderClassifierLibriSpeech

EmanuelAlogna / Gender-Classification-using-ML

nick-monto / Specs2text

hammaad2002 / SimpleASRmodel

LuluW8071 / Deep-Speech-2

to-schi / ASR-Deepspeech2-Tensorflow

andi611 / Kaldi-LibriSpeech-fMLLR

Ephrem-ETH / E2E-ASR-on-Librispeech

tnakatani / dnn_speech_recognition

HarishGoudVennakula / SELF-SUPERVISED-REPRESENTATION-LEARNING

realjules / speech-transformer