Seungone Kim's repositories
SeungoneKim
All About Me!
alpaca_eval
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
reward-bench
RewardBench: the first evaluation tool for reward models.
SICK_Summarization
[COLING 2022] Mind the Gap! Injecting Commonsense Knowledge for Abstractive Dialogue Summarization
just-eval
A simple GPT-based evaluation tool for multi-aspect, interpretable assessment of LLMs.
prometheus-vision
An Evaluator VLM that is open-source, offers reproducible evaluation, and inexpensive to use. Specifically designed for fine-grained evaluation on customized score rubric, Prometheus-Vision is a good alternative for human evaluation and GPT-4V evaluation.
e5-mistral-7b-instruct
Finetune mistral-7b-instruct for sentence embeddings
screenshot-to-code
Drop in a screenshot and convert it to clean HTML/Tailwind/JS code
LLaMA-Factory
Easy-to-use LLM fine-tuning framework (LLaMA, BLOOM, Mistral, Baichuan, Qwen, ChatGLM)
prometheus
[NeurIPS 2023 WS] An Evaluator LM that is open-source, offers reproducible evaluation, and inexpensive to use. Specifically designed for fine-grained evaluation on a customized score rubric, Prometheus is a good alternative for human evaluation and GPT-4 evaluation.
litellm
Use any LLM as a drop in replacement for OpenAI. Use Azure, OpenAI, Cohere, Anthropic, Ollama, VLLM, Sagemaker, HuggingFace, Replicate (100+ LLMs)
FLASK
Official codebase for "FLASK: Fine-grained Language Model Evaluation based on Alignment Skill Sets"
trl
Train transformer language models with reinforcement learning.
BIG-bench
Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models
ParlAI
A framework for training and evaluating AI models on a variety of openly available dialogue datasets.
web-llm
Bringing large-language models and chat to web browsers. Everything runs inside the browser with no server support.
LongForm
Instruction Tuning Dataset and Models for Long Text Generation with Corpus Extraction
llama-trl
LLaMA-TRL: Fine-tuning LLaMA with PPO and LoRA
RL4LMs
A modular RL library to fine-tune language models to human preferences