Ofir Press (ofirpress)

ofirpress

Geek Repo

Company:@uwnlp

Home Page:http://ofir.io/about

Twitter:@ofirpress

Github PK Tool:Github PK Tool


Organizations
princeton-nlp
uwnlp

Ofir Press's repositories

attention_with_linear_biases

Code for the ALiBi method for transformer language models (ICLR 2022)

Language:PythonLicense:MITStargazers:480Issues:11Issues:18

YouMayNotNeedAttention

Code for the Eager Translation Model from the paper You May Not Need Attention

self-ask

Code and data for "Measuring and Narrowing the Compositionality Gap in Language Models"

Language:Jupyter NotebookLicense:MITStargazers:283Issues:6Issues:4

shortformer

Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.

Language:PythonLicense:MITStargazers:146Issues:4Issues:4

sandwich_transformer

This repository contains the code for running the character-level Sandwich Transformers from our ACL 2020 paper on Improving Transformer Models by Reordering their Sublayers.

Language:PythonLicense:NOASSERTIONStargazers:55Issues:3Issues:2

UsingTheOutputEmbedding

Code for the EACL paper "Using the Output Embedding to Improve Language Models" by Ofir Press and Lior Wolf

Language:LuaStargazers:44Issues:4Issues:0

0plot

Use 0plot to automatically build matplotlib plots using ChatGPT.

Language:JavaScriptLicense:Apache-2.0Stargazers:19Issues:1Issues:4

tstl_t5_bias

This is our implementation of the T5 bias for fairseq.

Language:PythonLicense:MITStargazers:2Issues:1Issues:0

tensorflow_with_latest_papers

Implementation of Newest RNN and Seq2Seq Features

Language:PythonLicense:Apache-2.0Stargazers:1Issues:2Issues:0
Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:2Issues:0

BIG-bench

Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

composer

library of algorithms to speed up neural network training

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:1Issues:0

examples

A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:1Issues:0

LeViT_ALiBi

LeViT + ALiBi

Language:PythonLicense:Apache-2.0Stargazers:0Issues:3Issues:0

Megatron-DeepSpeed

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

NLP-progress

Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

ofirpress.github.io

Build a Jekyll blog in minutes, without touching the command line.

Language:SCSSLicense:MITStargazers:0Issues:1Issues:0

pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Language:PythonLicense:NOASSERTIONStargazers:0Issues:2Issues:0

RecurrentHighwayNetworks

Recurrent Highway Networks - Author implementation for Tensorflow and Torch

Language:PythonLicense:MITStargazers:0Issues:2Issues:0
Stargazers:0Issues:0Issues:0

sockeye

Sequence-to-sequence framework with a focus on Neural Machine Translation based on Apache MXNet

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

tensorflow

Computation using data flow graphs for scalable machine learning

Language:C++License:Apache-2.0Stargazers:0Issues:1Issues:0

the-gan-zoo

A list of all named GANs!

Language:PythonLicense:MITStargazers:0Issues:2Issues:0