open-compass / opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Home Page:https://opencompass.org.cn/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

[Feature] Add some examples in the documentation of how to sandbox the humaneval code execution

sgjohnson1981 opened this issue · comments

Describe the feature

Would it be possible to add some quick examples/recommendations as to how to sandbox the humaneval code execution?

Will you implement it?

  • I would like to implement this feature and create a PR!