wnhsu

followers

0

following

stars

http://people.csail.mit.edu/wnhsu/

Wei-Ning Hsu's repositories

FactorizedHierarchicalVAE

This repository contains the code to reproduce the core results from the paper "Unsupervised Learning of Disentangled and Interpretable Representations from Sequential Data"

Language:PythonApache-2.0151 7 8

ScalableFHVAE

This repository contains the code to reproduce the core results from the paper "Scalable Factorized Hierarchical Variational Autoencoders"

Language:Python52 5 5

SpeechVAE

This repository contains the code to reproduce the core results from the paper "Learning Latent Representations for Speech Generation and Transformation".

Language:PythonApache-2.051 6 7

ResDAVEnet-VQ

Official codes for the paper "Learning Hierarchical Discrete Linguistic Units from Visually-Grounded Speech"

Language:Jupyter NotebookBSD-3-Clause26 20

ReVISE

ReVISE: Self-Supervised Speech Resynthesis with Visual Input for Universal and Generalized Speech Enhancement

Language:HTML13 3 1

PGLSTM_ASR

This repo contains codes to reproduce the core results of "A Prioritized Grid Long Short-Term Memory RNN for Speech Recognition"

Language:Shell3 10

semi-supervised-pytorch

Implementations of different VAE-based semi-supervised and generative models in PyTorch

Language:PythonMIT3 20

tensorflow-wavenet

A TensorFlow implementation of DeepMind's WaveNet paper

Language:PythonMIT2 10

tacotron2_dev

Language:Jupyter NotebookBSD-3-Clause1 10

wavenet_vocoder

WaveNet vocoder

Language:PythonNOASSERTION1 20

ZeroSpeech2019_RLE_eval

ZeroSpeech 2019 evaluation with run-length encoding (RLE), metrics reported in ResDAVEnet-VQ.

Language:Python1 30

a-PyTorch-Tutorial-to-Image-Captioning

Show, Attend, and Tell | a PyTorch Tutorial to Image Captioning

Language:Python020

ABXpy

ABX discrimination task in python

Language:PythonMIT010

CNTK

Microsoft Cognitive Toolkit (CNTK), an open source deep-learning toolkit

Language:C++NOASSERTION010

einops

Deep learning operations reinvented (for pytorch, tensorflow, jax and others)

Language:PythonMIT000

espnet_tts_frontend

Text frontend for ESPnet tts recipes

000

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonMIT010

image-to-speech-demo

020

kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

Language:ShellNOASSERTION010

show-attend-and-tell

TensorFlow Implementation of "Show, Attend and Tell"

Language:Jupyter NotebookMIT010

wav2letter

Facebook AI Research's Automatic Speech Recognition Toolkit

Language:C++NOASSERTION020

wnhsu.github.io

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

Language:JavaScriptMIT000