ofirpress

Ofir Press's repositories

attention_with_linear_biases

Code for the ALiBi method for transformer language models (ICLR 2022)

Language:PythonMIT489 11 18

YouMayNotNeedAttention

Code for the Eager Translation Model from the paper You May Not Need Attention

Language:Python292 15 2

self-ask

Code and data for "Measuring and Narrowing the Compositionality Gap in Language Models"

Language:Jupyter NotebookMIT288 6 4

shortformer

Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.

Language:PythonMIT146 4 4

sandwich_transformer

This repository contains the code for running the character-level Sandwich Transformers from our ACL 2020 paper on Improving Transformer Models by Reordering their Sublayers.

Language:PythonNOASSERTION55 3 2

UsingTheOutputEmbedding

Code for the EACL paper "Using the Output Embedding to Improve Language Models" by Ofir Press and Lior Wolf

Language:Lua44 40

0plot

Use 0plot to automatically build matplotlib plots using ChatGPT.

Language:JavaScriptApache-2.019 1 4

PartialShuffle

Language:Python14 20

tstl_t5_bias

This is our implementation of the T5 bias for fairseq.

Language:PythonMIT2 10

tensorflow_with_latest_papers

Implementation of Newest RNN and Seq2Seq Features

Language:PythonApache-2.01 20

awd-lstm-lm

Language:PythonBSD-3-Clause020

BIG-bench

Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models

Language:PythonApache-2.0000

composer

library of algorithms to speed up neural network training

Language:PythonNOASSERTION000

dl4mt-tutorial

Language:PythonBSD-3-Clause010

examples

A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.

Language:PythonBSD-3-Clause010

LeViT_ALiBi

LeViT + ALiBi

Language:PythonApache-2.0030

Megatron-DeepSpeed

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Language:PythonNOASSERTION000

NLP-progress

Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.

Language:PythonMIT010