Saurabh Vyas (saurabhvyas)

saurabhvyas

User data from Github https://github.com/saurabhvyas

Location:India

GitHub:@saurabhvyas

Saurabh Vyas's starred repositories

Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Language:PythonLicense:NOASSERTIONStargazers:55519Issues:943Issues:1101

bert

TensorFlow code and pre-trained models for BERT

Language:PythonLicense:Apache-2.0Stargazers:39511Issues:997Issues:1144

NLP-progress

Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.

Language:PythonLicense:MITStargazers:22943Issues:1263Issues:101

libfacedetection

An open source library for face detection in images. The face detection speed can reach 1000FPS.

Language:C++License:NOASSERTIONStargazers:12622Issues:532Issues:321

speech_recognition

Speech recognition module for Python, supporting several engines and APIs, online and offline.

Language:PythonLicense:BSD-3-ClauseStargazers:8861Issues:274Issues:632

ffsubsync

Automagically synchronize subtitles with video.

Language:PythonLicense:MITStargazers:7342Issues:77Issues:161

DeepPavlov

An open source library for deep learning end-to-end dialog systems and chatbots.

Language:PythonLicense:Apache-2.0Stargazers:6930Issues:207Issues:643

BERT-pytorch

Google AI 2018 BERT pytorch implementation

Language:PythonLicense:Apache-2.0Stargazers:6447Issues:124Issues:88

yolact

A simple, fully convolutional model for real-time instance segmentation.

Language:PythonLicense:MITStargazers:5171Issues:103Issues:795

waveglow

A Flow-based Generative Network for Speech Synthesis

Language:PythonLicense:BSD-3-ClauseStargazers:2333Issues:76Issues:257

open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

kaldi-gstreamer-server

Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.

Language:PythonLicense:BSD-2-ClauseStargazers:1087Issues:68Issues:222

espresso

Espresso: A Fast End-to-End Neural Speech Recognition Toolkit

Language:PythonLicense:NOASSERTIONStargazers:943Issues:43Issues:54

mellotron

Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing training data

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:859Issues:28Issues:96

FloWaveNet

A Pytorch implementation of "FloWaveNet: A Generative Flow for Raw Audio"

Language:PythonLicense:MITStargazers:491Issues:41Issues:21

zamia-speech

Open tools and data for cloudless automatic speech recognition

Language:PythonLicense:LGPL-3.0Stargazers:447Issues:37Issues:90

obamanet

ObamaNet : Photo-realistic lip-sync from audio (Unofficial port)

Language:PythonLicense:MITStargazers:238Issues:13Issues:27

kaldi-dnn-ali-gop

Forced alignment and Goodness of Pronunciation (GOP) with DNN support. Bases on Kaldi.

Language:C++License:NOASSERTIONStargazers:228Issues:15Issues:0

pykaldi2

Yet another speech toolkit based on Kaldi and PyTorch

Language:PythonLicense:MITStargazers:174Issues:12Issues:14

speech_separation

Include some core functions and model to handle speech separation

Language:PythonLicense:MITStargazers:155Issues:11Issues:29

KTSpeechCrawler

Automatically constructing corpus for automatic speech recognition from YouTube videos

Language:PythonLicense:MITStargazers:154Issues:16Issues:4

audiomate

Python library for handling audio datasets.

Language:PythonLicense:MITStargazers:137Issues:11Issues:81

idlak

Official home of the Idlak Speech Synthesis Toolkit

Language:ShellLicense:NOASSERTIONStargazers:66Issues:11Issues:31

ventib

:chart_with_upwards_trend: Ventib records your voice, transcribes it in realtime, and performs speech pattern analysis to give you objective statistics about how you speak.

DiViMe

ACLEW Diarization Virtual Machine

Language:ShellLicense:Apache-2.0Stargazers:33Issues:13Issues:152

asr24

24-hour Automatic Speech Recognition

Language:C++License:GPL-3.0Stargazers:27Issues:10Issues:0

kaldi-long-audio-alignment

Long audio alignment using Kaldi

Language:ShellLicense:Apache-2.0Stargazers:23Issues:3Issues:1

prep4kaldi

Data preparation code for building Kaldi ASR system

Language:PythonLicense:GPL-3.0Stargazers:14Issues:2Issues:1

kaldi-helpers

Helper scripts to work with Kaldi

Language:PythonLicense:MITStargazers:6Issues:5Issues:3

kaldi_scripts

a few useful kaldi scripts for my own use

Language:ShellStargazers:1Issues:0Issues:0