Princeton Natural Language Processing (princeton-nlp)

Princeton Natural Language Processing

princeton-nlp

Geek Repo

Home Page:http://nlp.cs.princeton.edu

Github PK Tool:Github PK Tool

Princeton Natural Language Processing's repositories

SWE-agent

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.29% of bugs in the SWE-bench evaluation set and takes just 1.5 minutes to run.

Language:PythonLicense:MITStargazers:9871Issues:77Issues:150

tree-of-thought-llm

[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Language:PythonLicense:MITStargazers:4149Issues:121Issues:51

SimCSE

[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821

Language:PythonLicense:MITStargazers:3242Issues:27Issues:262

SWE-bench

[ICLR 2024] SWE-Bench: Can Language Models Resolve Real-world Github Issues?

Language:PythonLicense:MITStargazers:1156Issues:23Issues:55

MeZO

[NeurIPS 2023] MeZO: Fine-Tuning Language Models with Just Forward Passes. https://arxiv.org/abs/2305.17333

Language:PythonLicense:MITStargazers:976Issues:19Issues:30

LLM-Shearing

[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning

Language:PythonLicense:MITStargazers:442Issues:27Issues:64

ALCE

[EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627

Language:PythonLicense:MITStargazers:382Issues:8Issues:17

AutoCompressors

[EMNLP 2023] Adapting Language Models to Compress Long Contexts

LESS

Preprint: Less: Selecting Influential Data for Targeted Instruction Tuning

Language:Jupyter NotebookLicense:MITStargazers:212Issues:4Issues:10

WebShop

[NeurIPS 2022] đź›’WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents

Language:PythonLicense:MITStargazers:204Issues:13Issues:21

intercode

[NeurIPS 2023 D&B] Code repository for InterCode benchmark https://arxiv.org/abs/2306.14898

Language:PythonLicense:MITStargazers:166Issues:9Issues:15

TransformerPrograms

[NeurIPS 2023] Learning Transformer Programs

CEPE

Preprint: Long-Context Language Modeling with Parallel Encodings

Language:PythonLicense:MITStargazers:100Issues:5Issues:2

QuRating

Selecting High-Quality Data for Training Language Models

NLProofS

EMNLP 2022: Generating Natural Language Proofs with Verifier-Guided Search https://arxiv.org/abs/2205.12443

Language:PythonLicense:MITStargazers:78Issues:6Issues:3

LLMBar

[ICLR 2024] Evaluating Large Language Models at Evaluating Instruction Following

Language:PythonLicense:MITStargazers:74Issues:5Issues:2

MQuAKE

[EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions

Language:Jupyter NotebookLicense:MITStargazers:73Issues:6Issues:10

USACO

Can Language Models Solve Olympiad Programming?

Language:PythonStargazers:72Issues:0Issues:0

LM-Kernel-FT

A Kernel-Based View of Language Model Fine-Tuning https://arxiv.org/abs/2210.05643

Language:PythonLicense:MITStargazers:68Issues:8Issues:1

c-sts

[EMNLP 2023] C-STS: Conditional Semantic Textual Similarity

Collie

[ICLR 2024] COLLIE: Systematic Construction of Constrained Text Generation Tasks

Language:Jupyter NotebookLicense:MITStargazers:44Issues:7Issues:0

MABEL

EMNLP 2022: "MABEL: Attenuating Gender Bias using Textual Entailment Data" https://arxiv.org/abs/2210.14975

Language:PythonLicense:MITStargazers:36Issues:4Issues:5
Language:PythonStargazers:22Issues:0Issues:0

PTP

Improving Language Understanding from Screenshots. Paper: https://arxiv.org/abs/2402.14073

Language:PythonLicense:MITStargazers:18Issues:6Issues:1

corpus-poisoning

[EMNLP 2023] Poisoning Retrieval Corpora by Injecting Adversarial Passages https://arxiv.org/abs/2310.19156

Language:PythonLicense:MITStargazers:14Issues:7Issues:1

SocraticAI

Problem solving by engaging multiple AI agents in conversation with each other and the user.

Language:PythonLicense:MITStargazers:9Issues:1Issues:0

lwm

We develop world models that can be adapted with natural language. Intergrating these models into artificial agents allows humans to effectively control these agents through verbal communication.

Language:PythonLicense:GPL-3.0Stargazers:7Issues:9Issues:2

Heuristic-Core

The code accompanying the paper "The Heuristic Core: Understanding Subnetwork Generalization in Pretrained Language Models" - https://arxiv.org/abs/2403.03942

Language:PythonLicense:MITStargazers:5Issues:4Issues:0

il-scaling-in-games

Official code repo of "Scaling Laws for Imitation Learning in NetHack"

Language:PythonStargazers:3Issues:5Issues:0
Language:PythonStargazers:3Issues:3Issues:0