Orion Weller's repositories

RedditHumorDetection

Code and datasets for the paper "Humor Detection: A Transformer Gets the Last Laugh"

Language:PythonLicense:MITStargazers:72Issues:3Issues:5

rJokesData

A large scale Humor Dataset, containing more than 550k rated English jokes (LREC'20)

Language:PythonStargazers:48Issues:1Issues:0

FollowIR

FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions

Multilingual-Federated-Learning

Code for the paper "Pretrained Models for Multilingual Federated Learning" at NAACL 2022

Language:PythonStargazers:10Issues:1Issues:0

MTLvsIFT

Code for the paper "When to Use Multi-Task Learning vs Intermediate Fine-Tuning for Pre-Trained Encoder Transfer Learning"

Language:PythonStargazers:6Issues:1Issues:0

humorTranslate

Using Machine Translation to "translate" non-humor into humor. Code for the paper "Humorous Headline Generation via Style Transfer" at FigLang 2020

Language:PythonStargazers:5Issues:2Issues:0

NevIR

Negation in Information Retrieval (EACL'24)

Language:PythonLicense:MITStargazers:4Issues:1Issues:1

according-to

Getting language models to quote from their pre-training data (EACL'24)

Language:PythonLicense:MITStargazers:1Issues:1Issues:0

configtune

An easy way to tune machine learning hyperparameters (especially for those that use a config file)

Language:PythonLicense:MITStargazers:1Issues:3Issues:17

DocumentReadingTime

Code and data from the ACL paper "You Don’t Have Time to Read This: an Exploration of Document-LevelReading Time Prediction"

Language:PythonStargazers:1Issues:3Issues:0

GANExperiments

My experiments with GANs

Language:PythonStargazers:1Issues:3Issues:0

LLaMA-Factory

Unify Efficient Fine-Tuning of 100+ LLMs

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

LM-expansions

When do Generative Query and Document Expansions Fail? A Comprehensive Study Across Methods, Retrievers, and Datasets

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

amti

A Mechanical Turk Interface (amti) 🤖

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

contextual-repr-analysis

A toolkit for evaluating the linguistic knowledge and transferability of contextual representations. Code for "Linguistic Knowledge and Transferability of Contextual Representations", to appear at NAACL 2019.

Language:PythonStargazers:0Issues:1Issues:0

CS450

Labs and assignments for CS450: Computer Vision in Python

Language:Jupyter NotebookStargazers:0Issues:2Issues:0

disinformation-defense

Defending Against Misinformation Attacks in Open-Domain Question Answering

Language:PythonStargazers:0Issues:1Issues:0

FedNLP

FedNLP: A Research Platform for Federated Learning in Natural Language Processing

Language:PythonStargazers:0Issues:1Issues:0

fisher-callhome-corpus

The Fisher and CALLHOME Spanish–English Speech Translation Corpus

Language:ErlangLicense:NOASSERTIONStargazers:0Issues:1Issues:0

InstructIR

IntructIR, a novel benchmark specifically designed to evaluate the instruction following ability in information retrieval models. Our focuses on user-aligned instructions tailored to each query instance.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:1Issues:0

mteb

MTEB: Massive Text Embedding Benchmark

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

ocaml-bert

Transformer-based models for Natural Language Processing in OCaml

Language:OCamlLicense:Apache-2.0Stargazers:0Issues:0Issues:0

pyresparser

A simple resume parser used for extracting information from resumes

Language:PythonLicense:GPL-3.0Stargazers:0Issues:1Issues:0

rebiber

A simple tool to update bib entries with their official information (e.g., DBLP or the ACL anthology).

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

strategyqa

The official code of TACL 2021, "Did Aristotle Use a Laptop? A Question Answering Benchmark with Implicit Reasoning Strategies".

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

streaming

A Data Streaming Library for Efficient Neural Network Training

License:Apache-2.0Stargazers:0Issues:0Issues:0

tevatron

Tevatron - A flexible toolkit for neural retrieval research and development.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0