SWE-bench's repositories
experiments
Open sourced predictions, execution logs, trajectories, and results from model inference + evaluation runs on the SWE-bench task.
humanevalfix-results
Evaluation data + results for SWE-agent inference on HumanEvalFix task
swe-bench.github.io
Landing page + leaderboard for SWE-Bench benchmark
000
Language:TypeScriptMIT000
Language:TypeScriptGPL-2.0000
Language:JavaScriptNOASSERTION000
Language:JavaScriptMIT000
Language:JavaScriptMIT000
Language:JavaScriptMIT000
Language:JavaScriptApache-2.0000
Language:JavaScriptApache-2.0000
Language:JavaScriptNOASSERTION000
Language:JavaScriptBSD-2-Clause000
Language:JavaScriptMIT000
Language:JavaScriptMIT000
Language:JavaScriptLGPL-2.1000
Language:JavaScript000