Make evaluate() run test cases concurrently
penguine-ip opened this issue · comments
Currently, metrics for each test case is ran concurrently, but not test cases in a test run.
The LLM Evaluation Framework
penguine-ip opened this issue · comments
Currently, metrics for each test case is ran concurrently, but not test cases in a test run.