jtang-asapp's starred repositories
annotated_deep_learning_paper_implementations
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
alt-tab-macos
Windows alt-tab on macOS
Multimodal-Transformer
[ACL'19] [PyTorch] Multimodal Transformer
bidirectional-cross-attention
A simple cross attention that updates both the source and target in one step
speech-datasets
Various speech datasets made available to the public
confidence-aware-learning
Confidence-Aware Learning for Deep Neural Networks (ICML2020)
awesome-asr-contextualization
A curated list of awesome papers on contextualizing E2E ASR outputs
sentence-doctor
Many Natural Language Processing tasks rely on sentence boundary detection (SBD). Although amazing libraries like spacy provide state of the art SBD, they often depend on text extractors (e.g pdf text extractors or OCR). The quality of these extractors greatly influence the quality of SBD libraries and as a consequence, the performance of downstream models as well. To help address this problem, we fine-tuned a T5 model from the hugging face hub that attempts to reconstruct “broken sentences”
gridspace-stanford-harper-valley
The Gridspace-Stanford Harper Valley speech dataset. Created in support of CS224S.
Awesome-Failure-Detection
A list of papers that studies out-of-distribution (OOD) detection and misclassification detection (MisD)
fbai-speech
Repo for the FB AI Speech team.
aligned-cross-entropy
Test implementation of "Aligned Cross Entropy for Non-Autoregressive Machine Translation" https://arxiv.org/abs/2004.01655
contextual-attention-nlm
Accompanying code for paper "Attention-Based Contextual Language Model Adaptation for Speech Recognition", submitted to ACL 2021.
SoundsLike
A python package for finding words that sound like other words. Useful for entity resolution and poetry, among other things.
Contextual-Biasing-Dataset
open-source Mandarian biased word dataset