lalimili6's repositories

allosaurus

Allosaurus is a pretrained universal phone recognizer for more than 2000 languages

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

asv-subtools

An Open Source Tools for Speaker Recognition

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

bash-fun

Functional programming in bash

Language:ShellLicense:MITStargazers:0Issues:0Issues:0

DeepLearningExamples

Deep Learning Examples

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

gentle

gentle forced aligner

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

goclassy

An asynchronous concurrent pipeline for classifying Common Crawl based on fastText's pipeline.

Language:GoLicense:Apache-2.0Stargazers:0Issues:0Issues:0

iMoCap

dataset for ECCV 2020 "Motion Capture from Internet Videos"

Stargazers:0Issues:0Issues:0

lattice_combination

Lattice combination algorithm to combine inaccurate transcripts with hypothesis lattices

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

LPCNet

Efficient neural speech synthesis

Language:CLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

mcnCrossModalEmotions

Supporting code for "Emotion Recognition in Speech using Cross-Modal Transfer in the Wild"

Language:MATLABLicense:MITStargazers:0Issues:0Issues:0

neural_sp

End-to-end ASR/LM implementation with PyTorch

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

opendcd

Open Source WFST-based Decoder Toolkit

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

pysepm

Python implementation of performance metrics in Loizou's Speech Enhancement book

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

Self-Supervised-Speech-Pretraining-and-Representation-Learning

The S3PRL speech toolkit: self-supervised pre-training and representation learning of Mockingjay, TERA, A-ALBERT, APC, and more to come. With easy-to-use standard downstream evaluation scripts including phone classification, speaker recognition, and ASR. (All in Pytorch!)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

ShEMO

Sharif Emotional Speech Database

Stargazers:0Issues:0Issues:0

speechmetrics

A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR

Language:PythonStargazers:0Issues:0Issues:0

speechsquad

Conversational AI Benchmark.

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

tacotron

A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

TensorFlowTTS

:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, Korean, Chinese and Easy to adapt for other languages)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

vegeta

HTTP load testing tool and library. It's over 9000!

Language:GoLicense:MITStargazers:0Issues:0Issues:0

vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

watson-voice-bot

Create a Watson Assistant chatbot that uses voice over a web browser.

Language:JavaScriptLicense:Apache-2.0Stargazers:0Issues:0Issues:0

WavAugment

A library for speech data augmentation in time-domain

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

WaveRNN

WaveRNN Vocoder + TTS

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

wer_are_we

Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.

Stargazers:0Issues:0Issues:0