許湛然(Jeff Hsu)'s repositories
Tomofun-Challenge-Audio-Classificaiton
This is a repository for Tomofun 狗音辨識 AI 百萬挑戰賽, a audio classification challenge focusing on dog sounds and noises inside the house.
wav2vec-u-patch
Repository for "Analyzing the Robustness of Unsupervised Speech Recognition", including patches to wav2vec-u and analysis code
ML_submission_parser
ML submission parser
Codejam2020
python implementation of codejam 2020
annotated_deep_learning_paper_implementations
🧑🏫 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit), optimizers (adam, radam, adabelief), gans(dcgan, cyclegan, stylegan2), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, etc. 🧠
datasets
TFDS is a collection of datasets ready to use with TensorFlow, Jax, ...
DUAL-textless-SQA
Textless (ASR-transcript free) Spoken Question Answering. The official release of NMSQA dataset and the implementation of "DUAL: Textless Spoken Question Answering with Speech Discrete Unit Adaptive Learning" paper.
fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
lxmert
PyTorch code for EMNLP 2019 paper "LXMERT: Learning Cross-Modality Encoder Representations from Transformers".
medium-appendix
codes and other refs for posts on medium
Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
MR-Models
聯發創新基地(MediaTek Research) 致力於研究基礎模型。我們將研究體現在適合繁體中文使用者的模型上,並在使用權許可的情況下,提供模型給學術界研究或產業界使用。
research-contributions
Implementations of recent research prototypes/demonstrations using MONAI.
s3prl-ssl
Self-Supervised Speech Pre-training and Representation Learning Toolkit.
SpeechMix
Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together
transformers
🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.
wav2letter
Facebook AI Research's Automatic Speech Recognition Toolkit