abertsch72

Amanda Bertsch's repositories

unlimiformer

Public repo for the NeurIPS 2023 paper "Unlimiformer: Long-Range Transformers with Unlimited Length Input"

Language:PythonMIT1052 23 60

long-context-icl

Data and code for the preprint "In-Context Learning with Long-Context Models: An In-Depth Exploration"

Language:Jupyter NotebookApache-2.028 3 2

perspective-shifting

Code for the Findings of EMNLP 2022 paper "He Said, She Said: Style Transfer for Shifting the Perspective of Dialogues"

Language:Python6 4 1

maskeraid

A friendly bot that reminds you to wear your mask and wear it right :)

Language:PythonGPL-3.03 40

ADEPTLab

A jupyter lab repository for ADEPT notebooks

Language:Jupyter Notebook1 20

al-folio

A beautiful, simple, clean, and responsive Jekyll theme for academics

Language:JavaScriptMIT1 20

conlang_generator

Playing around with conlang generation using syntax trees and IPA

Language:PythonGPL-3.01 30

entailment-hallucination

Language:Python1 60

genre-diverse-story-gen

Language:PythonGPL-3.01 30

minimum-bayes-risk

For the preprint "It's MBR All the Way Down"

Language:Python1 3 6

topic-modeling

Different dimensionality reduction techniques applied to the 20Newsgroups toy data set for topic modeling

Language:PythonGPL-3.01 30

wikipedia-puffery-detection

Language:Python1 30

30-seconds-of-python

Short Python code snippets for all your development needs

Language:PythonCC0-1.0020

711-project-3

Language:Python040

dialogue-collection

Scraping social media for natural language dialogues

Language:Python030

EasyContext

Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.

Language:PythonApache-2.0000

example-439-hello

030

Extractive_Summarization

Language:Python020

genre-labeled-bookcorpus

This repository contains code to replicate the no-longer publicly available Toronto BookCorpus dataset

Language:PythonGPL-3.0020

gridlock

A 2D traffic control game

Language:GAPGPL-3.0030

lattice-generation

Code for Massive-scale Decoding for Text Generation using Lattices

Language:HTML020

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonMIT010

lost-in-the-middle

Code and data for "Lost in the Middle: How Language Models Use Long Contexts"

Language:PythonMIT010

perspective-shift

Language:Python030

pointer-generator

Code for the ACL 2017 paper "Get To The Point: Summarization with Pointer-Generator Networks"

Language:PythonNOASSERTION020

sent-bias

Code and test data for "On Measuring Bias in Sentence Encoders", to appear at NAACL 2019.

Language:PythonNOASSERTION020

seq2seq-xsum

Language:Python030

tensor2tensor

Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.

Language:PythonApache-2.0020

wikipedia-peacock-finder

Deployment for the peacock phrase detection project

Language:Python030

wikiwatson

A partial imitation of IBM Watson using information retrieval on Wikipedia articles.

Language:Java030