orionw

Orion Weller's repositories

RedditHumorDetection

Code and datasets for the paper "Humor Detection: A Transformer Gets the Last Laugh"

Language:PythonMIT72 3 5

rJokesData

A large scale Humor Dataset, containing more than 550k rated English jokes (LREC'20)

Language:Python48 10

FollowIR

FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions

Language:Python32 1 1

Multilingual-Federated-Learning

Code for the paper "Pretrained Models for Multilingual Federated Learning" at NAACL 2022

Language:Python10 10

MTLvsIFT

Code for the paper "When to Use Multi-Task Learning vs Intermediate Fine-Tuning for Pre-Trained Encoder Transfer Learning"

Language:Python6 10

humorTranslate

Using Machine Translation to "translate" non-humor into humor. Code for the paper "Humorous Headline Generation via Style Transfer" at FigLang 2020

Language:Python5 20

NevIR

Negation in Information Retrieval (EACL'24)

Language:PythonMIT4 1 1

according-to

Getting language models to quote from their pre-training data (EACL'24)

Language:PythonMIT1 10

configtune

An easy way to tune machine learning hyperparameters (especially for those that use a config file)

Language:PythonMIT1 3 17

DocumentReadingTime

Code and data from the ACL paper "You Don’t Have Time to Read This: an Exploration of Document-LevelReading Time Prediction"

Language:Python1 30

GANExperiments

My experiments with GANs

Language:Python1 30

LLaMA-Factory

Unify Efficient Fine-Tuning of 100+ LLMs

Language:PythonApache-2.0100

LM-expansions

When do Generative Query and Document Expansions Fail? A Comprehensive Study Across Methods, Retrievers, and Datasets

Language:PythonMIT010

amti

A Mechanical Turk Interface (amti) 🤖

Language:PythonApache-2.0010

A toolkit for evaluating the linguistic knowledge and transferability of contextual representations. Code for "Linguistic Knowledge and Transferability of Contextual Representations", to appear at NAACL 2019.

Language:Python010

CS450

Labs and assignments for CS450: Computer Vision in Python

Language:Jupyter Notebook020

disinformation-defense

Defending Against Misinformation Attacks in Open-Domain Question Answering

Language:Python010

FedNLP

FedNLP: A Research Platform for Federated Learning in Natural Language Processing

Language:Python010

fisher-callhome-corpus

The Fisher and CALLHOME Spanish–English Speech Translation Corpus

Language:ErlangNOASSERTION010

InstructIR

IntructIR, a novel benchmark specifically designed to evaluate the instruction following ability in information retrieval models. Our focuses on user-aligned instructions tailored to each query instance.

Language:PythonMIT000