aastroza / structured-generation-benchmark

Structured Generation Evals

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

structured-generation-benchmark

To use Large Language Models (LLMs) effectively and reliably, it's essential to include structured generation techniques. Being able to get outputs like regular expressions, JSON, or a Pydantic data model is key for making useful software.

But what's the real effect of using libraries like Outlines or Instructor to achieve that goal?

This repository has put together evaluations to answer this question.

Function Calling

The ability of the LLM to call functions.

Datasets

Evaluation

Reports

About

Structured Generation Evals

License:Apache License 2.0


Languages

Language:Jupyter Notebook 92.3%Language:Python 7.7%