Beast code in Giters

The Microsoft Scalable Noisy Speech Dataset (MS-SNSD) is a noisy speech dataset that can scale to arbitrary sizes depending on the number of speakers, noise types, and Speech to Noise Ratio (SNR) levels desired.

Language:HTMLMIT010

onssen

An open-source speech separation and enhancement library

Language:Python010

pase

Problem Agnostic Speech Encoder

Language:Python010

pulsemodel

Pulse Model vocoder

Language:PythonApache-2.0010

pyroomacoustics

Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.

Language:PythonMIT010

Python

All Algorithms implemented in Python

Language:PythonMIT010

pytorch-kaldi

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

Language:Perl010

resemble-enhance

AI powered speech denoising and enhancement

MIT000

seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Language:CNOASSERTION000

silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector, Language Classifier and Spoken Number Detector

Language:PythonMIT000

speech-dereverberation

speech-dereverberation-using-GANs

Language:Python010

Speech-Separation-Paper-Tutorial

A must-read paper for speech separation based on neural networks

010

tacotron

A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)

Language:PythonMIT000

Tacotron-2

DeepMind's Tacotron-2 Tensorflow implementation

Language:PythonMIT010

tacotron2-1

Tacotron 2 - PyTorch implementation with faster-than-realtime inference

Language:Jupyter NotebookBSD-3-Clause010

TasNet-tensorflow

A tensorflow implementation of TasNet (ICASSP 2018)

Language:Python010

uis-rnn

This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.

Language:PythonNOASSERTION010

vall-e

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Apache-2.0000

VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

Language:PythonMIT000

waveglow

A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis

Language:PythonApache-2.0010

wyn314

Yannan Wang's repositories

asteroid

Beamforming-for-speech-enhancement

bss

clone-voice

Conv-TasNet

FloWaveNet

Forward

jhu-neural-wpe

LPCNet

MeloTTS

MS-SNSD

onssen

pase

pulsemodel

pyroomacoustics

Python

pytorch-kaldi

resemble-enhance

seamless_communication

silero-vad

speech-dereverberation

Speech-Separation-Paper-Tutorial

tacotron

Tacotron-2

tacotron2-1

TasNet-tensorflow

uis-rnn

vall-e

VALL-E-X

waveglow