vectominist

This is an open source project (formerly named Listen, Attend and Spell - PyTorch Implementation) for end-to-end ASR implemented with Pytorch, the well known deep learning toolkit.

Language:PythonMIT000

espnet

End-to-End Speech Processing Toolkit

Language:PythonApache-2.0000

eval-word-vectors

Easy to use scripts for evaluating word vectors on a variety of tasks.

Language:PythonMIT000

GeoRect-Demo

Demo of Deep Learning-based Image Geometric Rectification

Language:Jupyter Notebook010

ICG2020Spring-HW1

🎨 HW1 (shading and transformation) of the course Interactive Computer Graphics, NTU CSIE.

Language:JavaScript010

ml-ta-helper

Language:Python000

phone-seg-ssl

Phoneme segmentation using pre-trained speech models

Language:PythonGPL-3.0000

receptive-field-calculator

A simple receptive field calculator for convolutional neural networks (CNN).

Language:Python010

s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit.

Language:PythonApache-2.0000

spectra-review-paper-competition

Competition for best expository article on cutting-edge ML research

000

SpeechTokenizer

This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on

Language:PythonApache-2.0000

torch-audiomentations

Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.

Language:PythonMIT000

vectominist

010

vectominist.github.io

Language:HTML010

vectominist.github.io.old

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

Language:JavaScriptMIT000

website

Language:HTML000

zr-2021vg_baseline

Baselines for the Zero-Resources Speech Challenge using VisuallyGrounded Models of Spoken Language, 2021 edition

Language:PythonApache-2.0000