SWE-bench

SWE-bench

Organization data from Github https://github.com/SWE-bench

Organization for maintaining the SWE-bench/agent projects

Home Page:https://swebench.com/

GitHub:@SWE-bench

SWE-bench's repositories

SWE-bench

SWE-bench: Can Language Models Resolve Real-world Github Issues?

Language:PythonLicense:MITStargazers:3509Issues:31Issues:240

experiments

Open sourced predictions, execution logs, trajectories, and results from model inference + evaluation runs on the SWE-bench task.

sb-cli

Run SWE-bench evaluations remotely

Language:PythonLicense:MITStargazers:8Issues:0Issues:0

swe-bench.github.io

Landing page + leaderboard for SWE-Bench benchmark

Stargazers:0Issues:0Issues:0

humanevalfix-results

Evaluation data + results for SWE-agent inference on HumanEvalFix task

Language:Jupyter NotebookStargazers:0Issues:1Issues:0