aaronmueller

Aaron Mueller's repositories

clams

Syntactic evaluation sets, attribute-varying grammars, and code for replicating the CLAMS paper. ACL 2020.

Language:PythonApache-2.014 5 2

contextualized-topic-models

A python package to setup topic classification fine-tuning, run contextualized topic modeling, and run TCCTMs

Language:PythonMIT700

emergent-syntax

Code for "How to Plant Trees in Language Models" (ACL 2023).

Language:PythonMIT600

syntax-icl

Code and data for In-context Learning Generalizes, But Not Always Robustly: The Case of Syntax

Language:Python400

aaronmueller.github.io

Aaron Mueller's personal website.

Language:TeX200

multilingual-lm-intervention

Multilingual causal mediation analysis

Language:Jupyter NotebookMIT200

lm-evaluation-harness

Few-shot evaluation of language models. Fork for the BabyLM competition (CoNLL '23).

Language:PythonMIT100

messing-with-fst

Trying out finite-state transducers.

Language:Shell100

aaronmueller

000

babylm.github.io

Language:HTML000

dont-stop-pretraining

Adapting the Don't Stop Pretraining approach for multilingual applications. Modified by Aaron Mueller and Nathaniel Weir.

Language:Python000

dotfiles

Config files for easy setup on new UNIX-based machines

Language:Vim script000

earley-parser

Earley parser implementation.

Language:Python000

for-submission

Language:Python020

inverse-scaling-eval-pipeline

Basic pipeline for running different sized GPT models and plotting the results

Language:Python000

LHDFall2015

Language:Python000

mBERT-docclass

Investigation of different methods of multilingual fine-tuning for document classification with mBERT.

Language:PythonApache-2.0000

minicons

Utility for analyzing Transformer based representations of language.

Language:PythonMIT000

mt-decoders

Basic IBM-style machine translation models with various decoding methods.

Language:Python000

neural-narrative-generation

Generating stories given prompts using GPT-2. We also try diverse decoding!

Language:Python000

nshell

nshell: a basic shell environment written in C

Language:C000

parlai-hred

Implementation of Hierarchical Recurrent Encoder-Decoder (HRED) model for narrative generation in ParlAI.

Language:PythonMIT000

pos-hmm

Hidden Markov Model tagger

Language:Python000

smoothed-lm

Implementing smoothed n-gram language models.

Language:Roff000

sparse_coding

Using sparse coding to find distributed representations used by neural networks.

Language:Jupyter Notebook000

structural_causal_mediation

Language:Python000

tanl

Language:PythonApache-2.0010

text-to-text-transfer-transformer

Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

Apache-2.0000

transductions

A PyTorch framework for creating, running, and reproducing experiments on seq2seq models.

Language:Python000

wiktionary-derivations-parser

For foreign editions of Wiktionary, extract derivations on each page (if they exist).

Language:PythonMIT000