Heng-Jui Chang's repositories
End-to-end-ASR-Pytorch-DLHLP
Joint CTC-Attention End-to-end Speech Recognition - PyTorch Implementation (Deep Learning for Human Language Processing Special Project)
Face-Image-Morphing
π§π»π¨πΌπ±πΎββοΈπΆπ» Face Image Morphing: an OpenCV and NumPy Implementation
awesome-self-supervised-speech-representation-learning
A comprehensive list of awesome self-supervised speech representation learning papers.
Switchboard-WSJ-Utils
Utilities for preprocessing the Switchboard and WSJ corpora in Python3
Course-Map-Visualization
A simple website for visualizing course maps ππΊ.
SBCSAE-preprocess
Preprocessing and downloading scripts for the Santa Barbara Corpus of Spoken American English (SBCSAE).
awesome-self-supervised-learning
A curated list of awesome self-supervised methods
ZJ-Solutions-in-Python
π» Solutions to ZeroJudge in Python
Algorithms2019Fall
π» Solutions to the three programming assignments of the course Algorithms 2019 Fall, NTU EE.
benchmarks
A command line tool that helps use the "Zero Ressource Challenge" benchmarks
End-to-end-ASR-Pytorch
This is an open source project (formerly named Listen, Attend and Spell - PyTorch Implementation) for end-to-end ASR implemented with Pytorch, the well known deep learning toolkit.
espnet
End-to-End Speech Processing Toolkit
eval-word-vectors
Easy to use scripts for evaluating word vectors on a variety of tasks.
GeoRect-Demo
Demo of Deep Learning-based Image Geometric Rectification
ICG2020Spring-HW1
π¨ HW1 (shading and transformation) of the course Interactive Computer Graphics, NTU CSIE.
phone-seg-ssl
Phoneme segmentation using pre-trained speech models
receptive-field-calculator
A simple receptive field calculator for convolutional neural networks (CNN).
s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit.
spectra-review-paper-competition
Competition for best expository article on cutting-edge ML research
SpeechTokenizer
This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on
torch-audiomentations
Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.
vectominist.github.io.old
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
zr-2021vg_baseline
Baselines for the Zero-Resources Speech Challenge using VisuallyGrounded Models of Spoken Language, 2021 edition