許湛然(Jeff Hsu)'s repositories

T5lephone

Code for T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5

Language:PythonStargazers:19Issues:3Issues:0

Zhuan

小篆學習器 learn ancient chinese characters

Language:CSSLicense:NOASSERTIONStargazers:8Issues:2Issues:1

Tomofun-Challenge-Audio-Classificaiton

This is a repository for Tomofun 狗音辨識 AI 百萬挑戰賽, a audio classification challenge focusing on dog sounds and noises inside the house.

Language:PythonStargazers:6Issues:0Issues:0

wav2vec-u-patch

Repository for "Analyzing the Robustness of Unsupervised Speech Recognition", including patches to wav2vec-u and analysis code

Language:RoffStargazers:5Issues:0Issues:0

ML_submission_parser

ML submission parser

Language:PythonStargazers:3Issues:2Issues:0
Stargazers:3Issues:0Issues:0

hfDUAL

DUAL with run_squad

Language:PythonStargazers:2Issues:0Issues:0

XDBERT

Code for ACL 2022 Conference Paper "XDBERT: Distilling Visual Information to BERT from Cross-Modal Systems to Improve Language Understanding"

Language:PythonStargazers:2Issues:0Issues:0

CLIP

Contrastive Language-Image Pretraining

License:MITStargazers:1Issues:0Issues:0

Codejam2020

python implementation of codejam 2020

Language:PythonStargazers:1Issues:0Issues:0

darkchess

Multiple Assignments of the Course : "Theory of Computer Games"

Language:ShellStargazers:1Issues:2Issues:0

FRAIG

Fuctionally Reduced AND Inverter Graph

Language:C++Stargazers:1Issues:2Issues:0
Language:HTMLStargazers:1Issues:2Issues:0

ALIEN

game that can be used as a corpus collector

Language:HTMLStargazers:0Issues:2Issues:0

annotated_deep_learning_paper_implementations

🧑‍🏫 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit), optimizers (adam, radam, adabelief), gans(dcgan, cyclegan, stylegan2), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, etc. 🧠

License:MITStargazers:0Issues:0Issues:0

datasets

TFDS is a collection of datasets ready to use with TensorFlow, Jax, ...

License:Apache-2.0Stargazers:0Issues:0Issues:0

DUAL-textless-SQA

Textless (ASR-transcript free) Spoken Question Answering. The official release of NMSQA dataset and the implementation of "DUAL: Textless Spoken Question Answering with Speech Discrete Unit Adaptive Learning" paper.

Language:PythonStargazers:0Issues:0Issues:0

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

License:MITStargazers:0Issues:0Issues:0

lxmert

PyTorch code for EMNLP 2019 paper "LXMERT: Learning Cross-Modality Encoder Representations from Transformers".

License:MITStargazers:0Issues:0Issues:0

medium-appendix

codes and other refs for posts on medium

Language:PythonStargazers:0Issues:0Issues:0

Megatron-DeepSpeed

Ongoing research training transformer language models at scale, including: BERT & GPT-2

License:NOASSERTIONStargazers:0Issues:0Issues:0

MR-Models

聯發創新基地(MediaTek Research) 致力於研究基礎模型。我們將研究體現在適合繁體中文使用者的模型上,並在使用權許可的情況下,提供模型給學術界研究或產業界使用。

License:NOASSERTIONStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

research-contributions

Implementations of recent research prototypes/demonstrations using MONAI.

License:Apache-2.0Stargazers:0Issues:0Issues:0

s3prl-ssl

Self-Supervised Speech Pre-training and Representation Learning Toolkit.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

SpeechMix

Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together

Language:PythonStargazers:0Issues:0Issues:0

transformers

🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0
Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

wav2letter

Facebook AI Research's Automatic Speech Recognition Toolkit

Language:C++License:NOASSERTIONStargazers:0Issues:1Issues:0