danesherbs / evals

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

This repository is not active

About

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

License:Other


Languages

Language:Python 79.3%Language:Jupyter Notebook 13.5%Language:HTML 5.2%Language:Shell 1.6%Language:JavaScript 0.3%Language:Dockerfile 0.0%Language:Makefile 0.0%