Yannan Wang's repositories
Beamforming-for-speech-enhancement
simple delaysum, MVDR and CGMM-MVDR
ChatTTS
ChatTTS is a generative speech model for daily dialogue.
clone-voice
A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频
Conv-TasNet
Deep Neural Network for Speaker Separation
FloWaveNet
A Pytorch implementation of "FloWaveNet: A Generative Flow for Raw Audio"
Forward
A library for high performance deep learning inference on NVIDIA GPUs.
jhu-neural-wpe
Neural Dereverberation
MeloTTS
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
pyroomacoustics
Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.
pytorch-kaldi
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
resemble-enhance
AI powered speech denoising and enhancement
seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector, Language Classifier and Spoken Number Detector
speech-dereverberation
speech-dereverberation-using-GANs
Speech-Separation-Paper-Tutorial
A must-read paper for speech separation based on neural networks
Tacotron-2
DeepMind's Tacotron-2 Tensorflow implementation
tacotron2-1
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
TasNet-tensorflow
A tensorflow implementation of TasNet (ICASSP 2018)
vall-e
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
VALL-E-X
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io