Evaluate against the SWE-bench benchmark
kripper opened this issue Β· comments
Christopher Pereira commented
Duplicates
- I have searched the existing issues
Summary π‘
Evaluate against the SWE-bench benchmark:
https://github.com/princeton-nlp/SWE-bench
Examples π
No response
Motivation π¦
Compare with and learn from other similar open source projects.
github-actions commented
This issue has automatically been marked as stale because it has not had any activity in the last 50 days. You can unstale it by commenting or removing the label. Otherwise, this issue will be closed in 10 days.
Nicholas Tindle commented
Unstale