Beast code in Giters

PocketSphinx is a lightweight speech recognition engine, specifically tuned for handheld and mobile devices, though it works equally well on the desktop

Language:CNOASSERTION000

py-webrtcvad

Python interface to the WebRTC Voice Activity Detector

Language:CNOASSERTION020

pyroomacoustics

Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.

Language:PythonMIT000

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

Language:Perl000

rnnoise_orignal

Recurrent neural network for audio noise reduction

Language:CBSD-3-Clause020

SincNet

SincNet is a neural architecture for efficiently processing raw audio samples.

Language:PythonMIT000

SourceLocalization

SPR-PHAT source localization

Language:Matlab020

speech-denoising-wavenet

A neural network for end-to-end speech denoising

Language:PythonMIT000

speechbrain

A PyTorch-based Speech Toolkit

Apache-2.0000

srp_phat

Language:Matlab000

tf-kaldi-speaker-master

Language:PythonApache-2.0010

Tutorial_Separation

This repo summarizes the tutorials, datasets, papers, codes and tools for speech separation and speaker extraction task. You are kindly invited to pull requests.

Language:MATLAB010

Two-Stage-Polyphonic-Sound-Event-Detection-and-Localization

A two-stage polyphonic sound event detection and localization method. It can achieve the best scores for both SED and DOA.

Language:Python000

voxceleb_trainer

In defence of metric learning for speaker recognition

Language:PythonMIT010

WebRTC_AGC

Automatic Gain Control Module Port From WebRTC

Language:CBSD-3-Clause020