sanwooo

followers

following

stars

Sanwoo Lee's starred repositories

human-eval

Code for the paper "Evaluating Large Language Models Trained on Code"

Language:PythonMIT234500

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonMIT661600

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonApache-2.03659000

Awesome-Model-Merging-Methods-Theories-Applications

Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. arXiv:2408.07666.

dnn-mode-connectivity

Mode Connectivity and Fast Geometric Ensembles in PyTorch

Language:PythonBSD-2-Clause26500

model_merging

Language:Python6100

mats

Language:Python2400

iclr2024-model-merging

This is the repository for "Model Merging by Uncertainty-Based Gradient Matching", ICLR 2024.

Language:Python1600

AdaMerging

AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.

Language:PythonMIT4500

BayesianOptimization

A Python implementation of global optimization with gaussian processes.

Language:PythonMIT782600

qasc

Repository for the Question Answering via Sentence Composition (QASC) dataset

Language:PythonApache-2.05100

loss-landscape

Code for visualizing the loss landscape of neural nets

Language:PythonMIT278800

TransformerCompression

For releasing code related to compression methods for transformers, accompanying our publications

Language:PythonMIT35900

model-soups

Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time

Language:PythonMIT41200

model-stock

Model Stock: All we need is just a few fine-tuned models

Language:Jupyter Notebook8000

nlp-uncertainty-zoo

Model zoo for different kinds of uncertainty quantification methods used in Natural Language Processing, implemented in PyTorch.

Language:Python4500

mergekit

Tools for merging pretrained large language models.

Language:PythonLGPL-3.0458700

ties-merging

Language:PythonBSD-3-Clause13800

task_vectors

Editing Models with Task Arithmetic

Language:Python41300

MergeLM

Codebase for Merging Language Models (ICML 2024)

Language:Python74900

lm-human-preference-details

RLHF implementation details of OAI's 2019 codebase

Language:PythonMIT14500

alpaca_eval

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

Language:Jupyter NotebookApache-2.0146400

reward-bench

RewardBench: the first evaluation tool for reward models.

Language:PythonApache-2.037600

composer

Supercharge Your Model Training

Language:PythonApache-2.0513100

trl

Train transformer language models with reinforcement learning.

Language:PythonApache-2.0961800

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Language:PythonApache-2.01602100

LLM-AES

[arXiv] Human-AI Collaborative Essay Scoring: A Dual-Process Framework with LLMs

Language:Python1600

Transformers-Tutorials

This repository contains demos I made with the Transformers library by HuggingFace.

Language:Jupyter NotebookMIT917600

ms-swift

Use PEFT or Full-parameter to finetune 350+ LLMs or 90+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL, Phi3.5-Vision, ...)

Language:PythonApache-2.0369200

awesome-uncertainty-deeplearning

This repository contains a collection of surveys, datasets, papers, and codes, for predictive uncertainty estimation in deep learning models.

MIT54300