Saurabh Vyas's starred repositories
Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
NLP-progress
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
libfacedetection
An open source library for face detection in images. The face detection speed can reach 1000FPS.
speech_recognition
Speech recognition module for Python, supporting several engines and APIs, online and offline.
DeepPavlov
An open source library for deep learning end-to-end dialog systems and chatbots.
BERT-pytorch
Google AI 2018 BERT pytorch implementation
open-speech-corpora
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
kaldi-gstreamer-server
Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.
FloWaveNet
A Pytorch implementation of "FloWaveNet: A Generative Flow for Raw Audio"
zamia-speech
Open tools and data for cloudless automatic speech recognition
kaldi-dnn-ali-gop
Forced alignment and Goodness of Pronunciation (GOP) with DNN support. Bases on Kaldi.
speech_separation
Include some core functions and model to handle speech separation
KTSpeechCrawler
Automatically constructing corpus for automatic speech recognition from YouTube videos
kaldi-long-audio-alignment
Long audio alignment using Kaldi
prep4kaldi
Data preparation code for building Kaldi ASR system
kaldi-helpers
Helper scripts to work with Kaldi
kaldi_scripts
a few useful kaldi scripts for my own use