prometheus-eval

prometheus-eval

Organization data from Github https://github.com/prometheus-eval

Codebase to inference and train foundation models specialized on evaluating other foundation models

Location:United States of America

Home Page:https://seungonekim.github.io/

GitHub:@prometheus-eval

prometheus-eval's repositories

prometheus-eval

Evaluate your LLM's response with Prometheus and GPT4 💯

Language:PythonLicense:Apache-2.0Stargazers:1009Issues:2Issues:42

prometheus

[ICLR 2024 & NeurIPS 2023 WS] An Evaluator LM that is open-source, offers reproducible evaluation, and inexpensive to use. Specifically designed for fine-grained evaluation on a customized score rubric, Prometheus is a good alternative for human evaluation and GPT-4 evaluation.

Language:PythonLicense:MITStargazers:306Issues:4Issues:17

prometheus-vision

[ACL 2024 Findings & ICLR 2024 WS] An Evaluator VLM that is open-source, offers reproducible evaluation, and inexpensive to use. Specifically designed for fine-grained evaluation on customized score rubric, Prometheus-Vision is a good alternative for human evaluation and GPT-4V evaluation.

Language:PythonLicense:Apache-2.0Stargazers:78Issues:1Issues:4

scaling-evaluation-compute

Repository for "Scaling Evaluation-time Compute with Reasoning Models as Process Evaluators"

Stargazers:12Issues:0Issues:0

.github

Organization README for prometheus-eval

Stargazers:0Issues:1Issues:0

leaderboard

BiGGen-Bench Leaderboard

Language:PythonStargazers:0Issues:0Issues:0

prometheus-eval.github.io

Documentation and blogposts for Prometheus

Stargazers:0Issues:1Issues:0