PengyvWANG

PengyuWang's repositories

VQ-VAE-Speech

PyTorch implementation of VQ-VAE + WaveNet by [Chorowski et al., 2019] and VQ-VAE on speech signals by [van den Oord et al., 2017]

Language:PythonMIT100

Deep-Learning-Specialization-Coursera

This repo contains the updated version of all the assignments/labs (done by me) of Deep Learning Specialization on Coursera by Andrew Ng. It includes building various deep learning models from scratch and implementing them for object detection, facial recognition, autonomous driving, neural machine translation, trigger word detection, etc.

Language:Jupyter NotebookApache-2.0000

PengyuWang.github.io

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

Language:JavaScriptMIT000

room-impulse-responses

A list of publicly available room impulse response datasets and scripts to download them.

Language:Shell000

RVAE-EM

Official PyTorch implementation of "RVAE-EM: Generative speech dereverberation based on recurrent variational auto-encoder and convolutive transfer function" [ICASSP2024]

Language:PythonMIT000

ShiArthur03

Language:MATLABGPL-3.0000

VAENAR-TTS

PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.

Language:PythonMIT000

VPIDM

This is official repository of new SOTA diffusion models based method for speech enhancement

Language:PythonGPL-3.0000