Splend1d

followers

following

stars

許湛然(Jeff Hsu)'s repositories

T5lephone

Code for T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5

Language:Python19 30

Zhuan

小篆學習器 learn ancient chinese characters

Language:CSSNOASSERTION8 2 1

Tomofun-Challenge-Audio-Classificaiton

This is a repository for Tomofun 狗音辨識 AI 百萬挑戰賽, a audio classification challenge focusing on dog sounds and noises inside the house.

Language:Python600

wav2vec-u-patch

Repository for "Analyzing the Robustness of Unsupervised Speech Recognition", including patches to wav2vec-u and analysis code

Language:Roff500

ML_submission_parser

ML submission parser

Language:Python3 20

wav2vec-u

300

hfDUAL

DUAL with run_squad

Language:Python200

XDBERT

Code for ACL 2022 Conference Paper "XDBERT: Distilling Visual Information to BERT from Cross-Modal Systems to Improve Language Understanding"

Language:Python200

CLIP

Contrastive Language-Image Pretraining

MIT100

Codejam2020

python implementation of codejam 2020

Language:Python100

darkchess

Multiple Assignments of the Course : "Theory of Computer Games"

Language:Shell1 20

FRAIG

Fuctionally Reduced AND Inverter Graph

Language:C++1 20

Splend1d

Language:HTML1 20

ALIEN

game that can be used as a corpus collector

Language:HTML020

annotated_deep_learning_paper_implementations

🧑‍🏫 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit), optimizers (adam, radam, adabelief), gans(dcgan, cyclegan, stylegan2), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, etc. 🧠

MIT000

datasets

TFDS is a collection of datasets ready to use with TensorFlow, Jax, ...

Apache-2.0000

DUAL-textless-SQA

Textless (ASR-transcript free) Spoken Question Answering. The official release of NMSQA dataset and the implementation of "DUAL: Textless Spoken Question Answering with Speech Discrete Unit Adaptive Learning" paper.

Language:Python000

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

MIT000

lxmert

PyTorch code for EMNLP 2019 paper "LXMERT: Learning Cross-Modality Encoder Representations from Transformers".

MIT000

medium-appendix

codes and other refs for posts on medium

Language:Python000

Megatron-DeepSpeed

Ongoing research training transformer language models at scale, including: BERT & GPT-2

NOASSERTION000

MR-Models

聯發創新基地（MediaTek Research) 致力於研究基礎模型。我們將研究體現在適合繁體中文使用者的模型上，並在使用權許可的情況下，提供模型給學術界研究或產業界使用。

NOASSERTION000

Reddit_Showerthought_Analysis

Language:Python000

research-contributions

Implementations of recent research prototypes/demonstrations using MONAI.

Apache-2.0000

s3prl-ssl

Self-Supervised Speech Pre-training and Representation Learning Toolkit.

Language:PythonApache-2.0000

SpeechMix

Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together

Language:Python000

transformers

🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.

Language:PythonApache-2.0010

Voice-Conversion

000

voidful

000

wav2letter

Facebook AI Research's Automatic Speech Recognition Toolkit

Language:C++NOASSERTION010