Sanwoo Lee's starred repositories

human-eval

Code for the paper "Evaluating Large Language Models Trained on Code"

Language:PythonLicense:MITStargazers:2345Issues:0Issues:0

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonLicense:MITStargazers:6616Issues:0Issues:0

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonLicense:Apache-2.0Stargazers:36590Issues:0Issues:0

Awesome-Model-Merging-Methods-Theories-Applications

Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. arXiv:2408.07666.

Stargazers:135Issues:0Issues:0

dnn-mode-connectivity

Mode Connectivity and Fast Geometric Ensembles in PyTorch

Language:PythonLicense:BSD-2-ClauseStargazers:265Issues:0Issues:0
Language:PythonStargazers:61Issues:0Issues:0
Language:PythonStargazers:24Issues:0Issues:0

iclr2024-model-merging

This is the repository for "Model Merging by Uncertainty-Based Gradient Matching", ICLR 2024.

Language:PythonStargazers:16Issues:0Issues:0

AdaMerging

AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.

Language:PythonLicense:MITStargazers:45Issues:0Issues:0

BayesianOptimization

A Python implementation of global optimization with gaussian processes.

Language:PythonLicense:MITStargazers:7826Issues:0Issues:0

qasc

Repository for the Question Answering via Sentence Composition (QASC) dataset

Language:PythonLicense:Apache-2.0Stargazers:51Issues:0Issues:0

loss-landscape

Code for visualizing the loss landscape of neural nets

Language:PythonLicense:MITStargazers:2788Issues:0Issues:0

TransformerCompression

For releasing code related to compression methods for transformers, accompanying our publications

Language:PythonLicense:MITStargazers:359Issues:0Issues:0

model-soups

Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time

Language:PythonLicense:MITStargazers:412Issues:0Issues:0

model-stock

Model Stock: All we need is just a few fine-tuned models

Language:Jupyter NotebookStargazers:80Issues:0Issues:0

nlp-uncertainty-zoo

Model zoo for different kinds of uncertainty quantification methods used in Natural Language Processing, implemented in PyTorch.

Language:PythonStargazers:45Issues:0Issues:0

mergekit

Tools for merging pretrained large language models.

Language:PythonLicense:LGPL-3.0Stargazers:4587Issues:0Issues:0
Language:PythonLicense:BSD-3-ClauseStargazers:138Issues:0Issues:0

task_vectors

Editing Models with Task Arithmetic

Language:PythonStargazers:413Issues:0Issues:0

MergeLM

Codebase for Merging Language Models (ICML 2024)

Language:PythonStargazers:749Issues:0Issues:0

lm-human-preference-details

RLHF implementation details of OAI's 2019 codebase

Language:PythonLicense:MITStargazers:145Issues:0Issues:0

alpaca_eval

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1464Issues:0Issues:0

reward-bench

RewardBench: the first evaluation tool for reward models.

Language:PythonLicense:Apache-2.0Stargazers:376Issues:0Issues:0

composer

Supercharge Your Model Training

Language:PythonLicense:Apache-2.0Stargazers:5131Issues:0Issues:0

trl

Train transformer language models with reinforcement learning.

Language:PythonLicense:Apache-2.0Stargazers:9618Issues:0Issues:0

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Language:PythonLicense:Apache-2.0Stargazers:16021Issues:0Issues:0

LLM-AES

[arXiv] Human-AI Collaborative Essay Scoring: A Dual-Process Framework with LLMs

Language:PythonStargazers:16Issues:0Issues:0

Transformers-Tutorials

This repository contains demos I made with the Transformers library by HuggingFace.

Language:Jupyter NotebookLicense:MITStargazers:9176Issues:0Issues:0

ms-swift

Use PEFT or Full-parameter to finetune 350+ LLMs or 90+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL, Phi3.5-Vision, ...)

Language:PythonLicense:Apache-2.0Stargazers:3692Issues:0Issues:0

awesome-uncertainty-deeplearning

This repository contains a collection of surveys, datasets, papers, and codes, for predictive uncertainty estimation in deep learning models.

License:MITStargazers:543Issues:0Issues:0