jtang-asapp

jtang-asapp's starred repositories

annotated_deep_learning_paper_implementations

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

Language:PythonMIT55516 456 132

alt-tab-macos

Windows alt-tab on macOS

Language:SwiftGPL-3.010857 43 3420

multilingual-t5

Language:PythonApache-2.01248 22 48

Multimodal-Transformer

[ACL'19] [PyTorch] Multimodal Transformer

Language:PythonMIT812 15 49

neuspell

NeuSpell: A Neural Spelling Correction Toolkit

Language:PythonMIT667 10 74

voxpopuli

A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation

Language:PythonNOASSERTION509 18 22

Graph2Seq

Graph2Seq is a simple code for building a graph-encoder and sequence-decoder for NLP and other AI/ML/DL tasks.

Language:PythonApache-2.0238 15 10

PLOME

Source code for the paper "PLOME: Pre-training with Misspelled Knowledge for Chinese Spelling Correction" in ACL2021

Language:PythonApache-2.0228 3 31

Overview-of-Non-autoregressive-Applications

Apache-2.0161 3 1

bidirectional-cross-attention

A simple cross attention that updates both the source and target in one step

Language:PythonMIT147 4 2

speech-datasets

Various speech datasets made available to the public

Language:Jupyter Notebook98 15 12

confidence-aware-learning

Confidence-Aware Learning for Deep Neural Networks (ICML2020)

Language:PythonMIT72 6 4

CRASpell

The code for our ACL2022 findings paper: CRACSpell: A Contextual Typo Robust Approach with Copy Mechanism to Improve Chinese Spelling Correction

Language:PythonMIT72 2 5

awesome-asr-contextualization

A curated list of awesome papers on contextualizing E2E ASR outputs

Apache-2.072 2 2

transducer-loss-benchmarking

Language:PythonNOASSERTION64 5 8

WhisperBiasing

Language:Jupyter NotebookMIT63 2 9

sentence-doctor

Many Natural Language Processing tasks rely on sentence boundary detection (SBD). Although amazing libraries like spacy provide state of the art SBD, they often depend on text extractors (e.g pdf text extractors or OCR). The quality of these extractors greatly influence the quality of SBD libraries and as a consequence, the performance of downstream models as well. To help address this problem, we fine-tuned a T5 model from the hugging face hub that attempts to reconstruct “broken sentences”

Language:Python61 3 4

gridspace-stanford-harper-valley

The Gridspace-Stanford Harper Valley speech dataset. Created in support of CS224S.

Language:PythonCC-BY-4.041 10 2

Awesome-Failure-Detection

A list of papers that studies out-of-distribution (OOD) detection and misclassification detection (MisD)

MIT40 1 1

emoASR

End-to-end MOdeling of ASR (Automatic Speech Recognition)

Language:Python33 3 27

OpenMix

PyTorch implementation of our CVPR2023 paper "OpenMix: Exploring Out-of-Distribution samples for Misclassification Detection"

Language:PythonMIT23 2 6

fbai-speech

Repo for the FB AI Speech team.

Language:PythonMIT22 6 1

aligned-cross-entropy

Test implementation of "Aligned Cross Entropy for Non-Autoregressive Machine Translation" https://arxiv.org/abs/2004.01655

Language:Jupyter NotebookMIT21 4 1

contextual-attention-nlm

Accompanying code for paper "Attention-Based Contextual Language Model Adaptation for Speech Recognition", submitted to ACL 2021.

Language:PythonNOASSERTION14 20

SoundsLike

A python package for finding words that sound like other words. Useful for entity resolution and poetry, among other things.

Language:PythonApache-2.013 10

Contextual-Biasing-Dataset

open-source Mandarian biased word dataset

10 1 2

benchmarking-uncertainty-estimation-performance

Language:PythonMIT8 1 1

fairseq-dual-loss

Language:PythonMIT5 10

NeMo

NeMo: a toolkit for conversational AI

Language:PythonApache-2.0100

tapir

Code for "TAPIR: Learning Adaptive Revision for Incremental Natural Language Understanding with a Two-Pass Model", Findings of ACL 2023

Language:PythonMIT100