luquesky's repositories
speech_dataset_generator
Generate speech data sets using the audios and transcriptions of YouTube videos.
ASR_Audio_Data_Links
A list of publically available audio data that anyone can download for ASR or other speech activities
audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
audioset_models
📊 Easily apply 527 machine learning models trained on AudioSet.
Bison-BCN-practical-info
Practical info for BISON BCN meeting Sept. 25th
ctc-asr
End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.
dejavu
Audio fingerprinting and recognition in Python
download_audioset
📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).
dysts
More than a hundred strange attractors
gcommands
Speech Commands Recognition using end-to-end deep learning models in pytorch
kaldi-ios-poc
Proof of concept app that demonstrates use of KeenASR speech recognition framework
Keras-Trigger-Word
How to do Real Time Trigger Word Detection with Keras | DLology
kws
An End-to-End Architecture for Keyword Spotting and Voice Activity Detection
KWS-1
Keyword Spotting for detecting a word in an audio file
make-a-smart-speaker
A collection of resources to make a smart speaker
ML-KWS-for-MCU
Keyword spotting on Arm Cortex-M Microcontrollers
mycroft-precise
A lightweight, simple-to-use, RNN wake word listener
py-webrtcvad
Python interface to the WebRTC Voice Activity Detector
reinforcement-learning-an-introduction
Python implementation for Reinforcement Learning: An Introduction
SGC
official implementation for the paper "Simplifying Graph Convolutional Networks"
TC-ResNet
Code for Temporal Convolution for Real-time Keyword Spotting on Mobile Devices
tesis
Repo de mi tesis
voice_datasets
🔊 A comprehensive list of open-source datasets for voice and sound computing (40+ datasets).
voicebook
🗣️ A book and repo to get you started programming voice computing applications in Python - 10 chapters and 200+ scripts.
vosk-android-demo
Runnable demo for Kaldi android
vrain
vrAIn
wav2letter
Facebook AI Research Automatic Speech Recognition Toolkit
Wav2Letter-1
Speech Recognition model based off of FAIR research paper built using Pytorch.
zerospeech2017
All you need to get started for the Zero Speech Challenge 2017