SeungoneKim

Seungone Kim's repositories

alpaca_eval

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

Apache-2.0000

reward-bench

RewardBench: the first evaluation tool for reward models.

Language:PythonApache-2.0000

SICK_Summarization

[COLING 2022] Mind the Gap! Injecting Commonsense Knowledge for Abstractive Dialogue Summarization

Language:Python2500

just-eval

A simple GPT-based evaluation tool for multi-aspect, interpretable assessment of LLMs.

MIT000

UniIR

Official code for paper "UniIR: Training and Benchmarking Universal Multimodal Information Retrievers"

Language:PythonMIT100

An Evaluator VLM that is open-source, offers reproducible evaluation, and inexpensive to use. Specifically designed for fine-grained evaluation on customized score rubric, Prometheus-Vision is a good alternative for human evaluation and GPT-4V evaluation.

Apache-2.0100

tevatron

Tevatron - A flexible toolkit for neural retrieval research and development.

Apache-2.0100

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

MIT100

e5-mistral-7b-instruct

Finetune mistral-7b-instruct for sentence embeddings

Apache-2.0200

screenshot-to-code

Drop in a screenshot and convert it to clean HTML/Tailwind/JS code

MIT100

LLaMA-Factory

Easy-to-use LLM fine-tuning framework (LLaMA, BLOOM, Mistral, Baichuan, Qwen, ChatGLM)

Apache-2.0100

prometheus

[NeurIPS 2023 WS] An Evaluator LM that is open-source, offers reproducible evaluation, and inexpensive to use. Specifically designed for fine-grained evaluation on a customized score rubric, Prometheus is a good alternative for human evaluation and GPT-4 evaluation.

Language:PythonMIT100