NL2Code's repositories
NL2Code.github.io
Large Language Models Meet NL2Code: A Survey
CMMLU
CMMLU: Measuring massive multitask language understanding in Chinese
Language:Python000
experiments
Open sourced predictions, execution logs, trajectories, and results from model inference + evaluation runs on the SWE-bench task.
Language:Python000