Giters
simonw
/
llm-evals-plugin
Run evals using LLM
Geek Repo:
Geek Repo
Github PK Tool:
Github PK Tool
Stargazers:
20
Watchers:
6
Issues:
10
Forks:
simonw/llm-evals-plugin Issues
Evaluating RAG outputs?
Updated
5 months ago
Researching evaluations
Updated
5 months ago
Run a subset of MMLU
Updated
6 months ago
Comments count
9
Ability to store evals in the database and run them from there too
Updated
6 months ago
Comments count
1
Design and document checks and plugin hook
Updated
6 months ago
Comments count
5
Design and implement better error reporting
Updated
6 months ago
Comments count
3
Design and implement parameterization mechanism
Updated
5 months ago
Comments count
7
Log results to database
Updated
6 months ago
Ship alpha to PyPI
Closed
6 months ago
Comments count
2
Initial design
Closed
6 months ago
Comments count
19