Wei-Ning Hsu's repositories

FactorizedHierarchicalVAE

This repository contains the code to reproduce the core results from the paper "Unsupervised Learning of Disentangled and Interpretable Representations from Sequential Data"

Language:PythonLicense:Apache-2.0Stargazers:151Issues:7Issues:8

ScalableFHVAE

This repository contains the code to reproduce the core results from the paper "Scalable Factorized Hierarchical Variational Autoencoders"

SpeechVAE

This repository contains the code to reproduce the core results from the paper "Learning Latent Representations for Speech Generation and Transformation".

Language:PythonLicense:Apache-2.0Stargazers:51Issues:6Issues:7

ResDAVEnet-VQ

Official codes for the paper "Learning Hierarchical Discrete Linguistic Units from Visually-Grounded Speech"

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:26Issues:2Issues:0

ReVISE

ReVISE: Self-Supervised Speech Resynthesis with Visual Input for Universal and Generalized Speech Enhancement

PGLSTM_ASR

This repo contains codes to reproduce the core results of "A Prioritized Grid Long Short-Term Memory RNN for Speech Recognition"

Language:ShellStargazers:3Issues:1Issues:0

semi-supervised-pytorch

Implementations of different VAE-based semi-supervised and generative models in PyTorch

Language:PythonLicense:MITStargazers:3Issues:2Issues:0

tensorflow-wavenet

A TensorFlow implementation of DeepMind's WaveNet paper

Language:PythonLicense:MITStargazers:2Issues:1Issues:0
Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:1Issues:1Issues:0

wavenet_vocoder

WaveNet vocoder

Language:PythonLicense:NOASSERTIONStargazers:1Issues:2Issues:0

ZeroSpeech2019_RLE_eval

ZeroSpeech 2019 evaluation with run-length encoding (RLE), metrics reported in ResDAVEnet-VQ.

Language:PythonStargazers:1Issues:3Issues:0

a-PyTorch-Tutorial-to-Image-Captioning

Show, Attend, and Tell | a PyTorch Tutorial to Image Captioning

Language:PythonStargazers:0Issues:2Issues:0

ABXpy

ABX discrimination task in python

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

CNTK

Microsoft Cognitive Toolkit (CNTK), an open source deep-learning toolkit

Language:C++License:NOASSERTIONStargazers:0Issues:1Issues:0

einops

Deep learning operations reinvented (for pytorch, tensorflow, jax and others)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

espnet_tts_frontend

Text frontend for ESPnet tts recipes

Stargazers:0Issues:0Issues:0

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Stargazers:0Issues:2Issues:0

kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

Language:ShellLicense:NOASSERTIONStargazers:0Issues:1Issues:0

show-attend-and-tell

TensorFlow Implementation of "Show, Attend and Tell"

Language:Jupyter NotebookLicense:MITStargazers:0Issues:1Issues:0

wav2letter

Facebook AI Research's Automatic Speech Recognition Toolkit

Language:C++License:NOASSERTIONStargazers:0Issues:2Issues:0

wnhsu.github.io

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

Language:JavaScriptLicense:MITStargazers:0Issues:0Issues:0