LMSYS

LMSYS's repositories

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonApache-2.035903 348 1732

A framework for serving and evaluating LLM routers - save LLM costs without compromising quality!

Language:PythonApache-2.02373 22 25

Arena-Hard-Auto: An automatic LLM benchmark.

Language:Jupyter NotebookApache-2.0347 5 22

Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"

Language:PythonApache-2.0182 3 5

Language:JavaScriptNOASSERTION47 10 3

The code and data for the GPT-4 based benchmark in the vicuna blog post

Language:PythonApache-2.032 20