PengyuWang's repositories

VQ-VAE-Speech

PyTorch implementation of VQ-VAE + WaveNet by [Chorowski et al., 2019] and VQ-VAE on speech signals by [van den Oord et al., 2017]

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

Deep-Learning-Specialization-Coursera

This repo contains the updated version of all the assignments/labs (done by me) of Deep Learning Specialization on Coursera by Andrew Ng. It includes building various deep learning models from scratch and implementing them for object detection, facial recognition, autonomous driving, neural machine translation, trigger word detection, etc.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

PengyuWang.github.io

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

Language:JavaScriptLicense:MITStargazers:0Issues:0Issues:0

room-impulse-responses

A list of publicly available room impulse response datasets and scripts to download them.

Language:ShellStargazers:0Issues:0Issues:0

RVAE-EM

Official PyTorch implementation of "RVAE-EM: Generative speech dereverberation based on recurrent variational auto-encoder and convolutive transfer function" [ICASSP2024]

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:MATLABLicense:GPL-3.0Stargazers:0Issues:0Issues:0

VAENAR-TTS

PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

VPIDM

This is official repository of new SOTA diffusion models based method for speech enhancement

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0