Princeton Natural Language Processing

Princeton Natural Language Processing's repositories

SWE-agent

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.29% of bugs in the SWE-bench evaluation set and takes just 1.5 minutes to run.

Language:PythonMIT10454 78 201

tree-of-thought-llm

[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Language:PythonMIT4226 121 51

SimCSE

[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821

Language:PythonMIT3265 27 264

SWE-bench

[ICLR 2024] SWE-Bench: Can Language Models Resolve Real-world Github Issues?

Language:PythonMIT1238 22 73

MeZO

[NeurIPS 2023] MeZO: Fine-Tuning Language Models with Just Forward Passes. https://arxiv.org/abs/2305.17333

Language:PythonMIT984 19 30

LLM-Shearing

[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning

Language:PythonMIT467 26 68

ALCE

[EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627

Language:PythonMIT392 8 17

AutoCompressors

[EMNLP 2023] Adapting Language Models to Compress Long Contexts

Language:Python235 9 18

LESS

Preprint: Less: Selecting Influential Data for Targeted Instruction Tuning

Language:Jupyter NotebookMIT232 4 14

WebShop

[NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents

Language:PythonMIT213 13 22

intercode

[NeurIPS 2023 D&B] Code repository for InterCode benchmark https://arxiv.org/abs/2306.14898

Language:PythonMIT171 9 15

TransformerPrograms

[NeurIPS 2023] Learning Transformer Programs

Language:Python150 3 4

CEPE

Preprint: Long-Context Language Modeling with Parallel Encodings

Language:PythonMIT105 5 2

QuRating

Selecting High-Quality Data for Training Language Models

Language:Python92 7 2

LLMBar

[ICLR 2024] Evaluating Large Language Models at Evaluating Instruction Following

Language:PythonMIT80 5 2

NLProofS

EMNLP 2022: Generating Natural Language Proofs with Verifier-Guided Search https://arxiv.org/abs/2205.12443

Language:PythonMIT79 6 3

USACO

Can Language Models Solve Olympiad Programming?

Language:Python79 40

MQuAKE

[EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions

Language:Jupyter NotebookMIT76 6 10

LM-Kernel-FT

A Kernel-Based View of Language Model Fine-Tuning https://arxiv.org/abs/2210.05643

Language:PythonMIT68 8 1

c-sts

[EMNLP 2023] C-STS: Conditional Semantic Textual Similarity

Language:Python60 4 5

Collie

[ICLR 2024] COLLIE: Systematic Construction of Constrained Text Generation Tasks

Language:Jupyter NotebookMIT46 70

MABEL

EMNLP 2022: "MABEL: Attenuating Gender Bias using Textual Entailment Data" https://arxiv.org/abs/2210.14975

Language:PythonMIT36 4 5

LM-Science-Tutor

Language:Python2500

PTP

Improving Language Understanding from Screenshots. Paper: https://arxiv.org/abs/2402.14073

Language:PythonMIT18 6 1

corpus-poisoning

[EMNLP 2023] Poisoning Retrieval Corpora by Injecting Adversarial Passages https://arxiv.org/abs/2310.19156

Language:PythonMIT15 7 1

SocraticAI

Problem solving by engaging multiple AI agents in conversation with each other and the user.

Language:PythonMIT9 10

lwm

We develop world models that can be adapted with natural language. Intergrating these models into artificial agents allows humans to effectively control these agents through verbal communication.

Language:PythonGPL-3.07 9 2

Heuristic-Core

The code accompanying the paper "The Heuristic Core: Understanding Subnetwork Generalization in Pretrained Language Models" - https://arxiv.org/abs/2403.03942

Language:PythonMIT6 40

il-scaling-in-games

Official code repo of "Scaling Laws for Imitation Learning in NetHack"

Language:Python3 50

MoQA

Language:Python3 30