Chenyang Huang's repositories
Emotionator
The best emotion detector on this planet 🌍
mmsegmentation
OpenMMLab Semantic Segmentation Toolbox and Benchmark.
Awesome-Text-Diffusion-Models
[IJCAI'23] The official Github page of the paper "Diffusion Models for Non-autoregressive Text Generation: A Survey".
BANG
BANG is a new pretraining model to Bridge the gap between Autoregressive (AR) and Non-autoregressive (NAR) Generation. AR and NAR generation can be uniformly regarded as to what extent previous tokens can be attended, and BANG bridges AR and NAR generation by designing a novel model structure for large-scale pretraining. The pretrained BANG model can simultaneously support AR, NAR and semi-NAR generation to meet different requirements.
benchmarks
Collection of benchmarks written by researchers at Amii
chenyangh.github.io
A beautiful, simple, clean, and responsive Jekyll theme for academics
eflomal
Efficient Low-Memory Aligner
esm
Evolutionary Scale Modeling (esm): Pretrained language models for proteins
fairseq-1
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
fairseq2
FAIR Sequence Modeling Toolkit 2
giza-py
A simple, Python-based, command-line runner for MGIZA++.
GLUE-baselines
[DEPRECATED] Repo for exploring multi-task learning approaches to learning sentence representations
GMA
Code for ACL 2022 findings paper "Gaussian Multi-head Attention for Simultaneous Machine Translation"
MoE-Waitk
Code for EMNLP 2021 oral paper "Universal Simultaneous Machine Translation with Mixture-of-Experts Wait-k Policy"
NAG-BERT
[EACL'21] Non-Autoregressive with Pretrained Language Model
pytorch-struct
Fast, general, and tested differentiable structured prediction in PyTorch
tensor2struct-public
Semantic parsers based on encoder-decoder framework