Yannan Wang (wyn314)

wyn314

Geek Repo

Company:University of Science and Technology of China

Location:Hefei

Github PK Tool:Github PK Tool

Yannan Wang's repositories

asteroid

The PyTorch-based audio source separation toolkit for researchers || Pretrained models available

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

Beamforming-for-speech-enhancement

simple delaysum, MVDR and CGMM-MVDR

Language:PythonStargazers:0Issues:1Issues:0

bss

工学博覧会 : 音源分離チーム

Language:PythonStargazers:0Issues:1Issues:0

clone-voice

A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

Conv-TasNet

Deep Neural Network for Speaker Separation

Language:PythonStargazers:0Issues:1Issues:0

FloWaveNet

A Pytorch implementation of "FloWaveNet: A Generative Flow for Raw Audio"

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

Forward

A library for high performance deep learning inference on NVIDIA GPUs.

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0

jhu-neural-wpe

Neural Dereverberation

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

LPCNet

Efficient neural speech synthesis

Language:CLicense:BSD-3-ClauseStargazers:0Issues:1Issues:0

MeloTTS

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.

License:MITStargazers:0Issues:0Issues:0

MS-SNSD

The Microsoft Scalable Noisy Speech Dataset (MS-SNSD) is a noisy speech dataset that can scale to arbitrary sizes depending on the number of speakers, noise types, and Speech to Noise Ratio (SNR) levels desired.

Language:HTMLLicense:MITStargazers:0Issues:1Issues:0

onssen

An open-source speech separation and enhancement library

Language:PythonStargazers:0Issues:1Issues:0

pase

Problem Agnostic Speech Encoder

Language:PythonStargazers:0Issues:1Issues:0

pulsemodel

Pulse Model vocoder

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

pyroomacoustics

Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

Python

All Algorithms implemented in Python

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

pytorch-kaldi

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

Language:PerlStargazers:0Issues:1Issues:0

resemble-enhance

AI powered speech denoising and enhancement

License:MITStargazers:0Issues:0Issues:0

seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Language:CLicense:NOASSERTIONStargazers:0Issues:0Issues:0

silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector, Language Classifier and Spoken Number Detector

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

speech-dereverberation

speech-dereverberation-using-GANs

Language:PythonStargazers:0Issues:1Issues:0

Speech-Separation-Paper-Tutorial

A must-read paper for speech separation based on neural networks

Stargazers:0Issues:1Issues:0

tacotron

A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Tacotron-2

DeepMind's Tacotron-2 Tensorflow implementation

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

tacotron2-1

Tacotron 2 - PyTorch implementation with faster-than-realtime inference

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:0Issues:1Issues:0

TasNet-tensorflow

A tensorflow implementation of TasNet (ICASSP 2018)

Language:PythonStargazers:0Issues:1Issues:0

uis-rnn

This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

vall-e

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

License:Apache-2.0Stargazers:0Issues:0Issues:0

VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

waveglow

A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0