luquesky's repositories

speech_dataset_generator

Generate speech data sets using the audios and transcriptions of YouTube videos.

Language:PythonLicense:MITStargazers:1Issues:1Issues:0

ASR_Audio_Data_Links

A list of publically available audio data that anyone can download for ASR or other speech activities

License:Apache-2.0Stargazers:0Issues:1Issues:0

audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

audioset_models

📊 Easily apply 527 machine learning models trained on AudioSet.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

Bison-BCN-practical-info

Practical info for BISON BCN meeting Sept. 25th

Stargazers:0Issues:2Issues:0

ctc-asr

End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

dejavu

Audio fingerprinting and recognition in Python

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

download_audioset

📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

dysts

More than a hundred strange attractors

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

gcommands

Speech Commands Recognition using end-to-end deep learning models in pytorch

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

kaldi-ios-poc

Proof of concept app that demonstrates use of KeenASR speech recognition framework

Language:Objective-CLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Keras-Trigger-Word

How to do Real Time Trigger Word Detection with Keras | DLology

License:NOASSERTIONStargazers:0Issues:0Issues:0

kws

An End-to-End Architecture for Keyword Spotting and Voice Activity Detection

License:MITStargazers:0Issues:0Issues:0

KWS-1

Keyword Spotting for detecting a word in an audio file

Stargazers:0Issues:0Issues:0

make-a-smart-speaker

A collection of resources to make a smart speaker

Stargazers:0Issues:0Issues:0

ML-KWS-for-MCU

Keyword spotting on Arm Cortex-M Microcontrollers

License:Apache-2.0Stargazers:0Issues:0Issues:0

mycroft-precise

A lightweight, simple-to-use, RNN wake word listener

License:Apache-2.0Stargazers:0Issues:0Issues:0

py-webrtcvad

Python interface to the WebRTC Voice Activity Detector

License:NOASSERTIONStargazers:0Issues:0Issues:0

reinforcement-learning-an-introduction

Python implementation for Reinforcement Learning: An Introduction

License:Apache-2.0Stargazers:0Issues:0Issues:0

SGC

official implementation for the paper "Simplifying Graph Convolutional Networks"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

TC-ResNet

Code for Temporal Convolution for Real-time Keyword Spotting on Mobile Devices

License:Apache-2.0Stargazers:0Issues:0Issues:0

tesis

Repo de mi tesis

Language:Jupyter NotebookStargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

voice_datasets

🔊 A comprehensive list of open-source datasets for voice and sound computing (40+ datasets).

Stargazers:0Issues:0Issues:0

voicebook

🗣️ A book and repo to get you started programming voice computing applications in Python - 10 chapters and 200+ scripts.

License:Apache-2.0Stargazers:0Issues:0Issues:0

vosk-android-demo

Runnable demo for Kaldi android

License:Apache-2.0Stargazers:0Issues:0Issues:0

vrain

vrAIn

Stargazers:0Issues:0Issues:0

wav2letter

Facebook AI Research Automatic Speech Recognition Toolkit

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0

Wav2Letter-1

Speech Recognition model based off of FAIR research paper built using Pytorch.

Stargazers:0Issues:0Issues:0

zerospeech2017

All you need to get started for the Zero Speech Challenge 2017

License:GPL-3.0Stargazers:0Issues:0Issues:0