Martin Tutek's starred repositories
RedPajama-Data
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
promptsource
Toolkit for creating, sharing and using natural language prompts.
state-spaces
Sequence Modeling with Structured State Spaces
pytorch-goodies
PyTorch Boilerplate For Research
slot-attention
Implementation of Slot Attention from GoogleAI
mlm-pytorch
An implementation of masked language modeling for Pytorch, made as concise and simple as possible
xai-benchmark
A Diagnostic Study of Explainability Techniques for Text Classification
Interpretable-Attention
Official Code for Towards Transparent and Explainable Attention Models paper (ACL 2020)
fairseq-entmax
Demo of fairseq with entmax loss and fenchel-young label smoothing
acl-search
A search-as-you-type web app for the ACL Anthology.