Yuancheng Xu (Yuancheng-Xu)

Yuancheng-Xu

Geek Repo

Company:University of Maryland, College Park

Home Page:https://yuancheng-xu.github.io

Github PK Tool:Github PK Tool

Yuancheng Xu's starred repositories

alignment-handbook

Robust recipes to align language models with human and AI preferences

Language:PythonLicense:Apache-2.0Stargazers:4156Issues:0Issues:0

llama2-fine-tune

Scripts for fine-tuning Llama2 via SFT and DPO.

Language:PythonStargazers:161Issues:0Issues:0

safe-rlhf

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Language:PythonLicense:Apache-2.0Stargazers:1220Issues:0Issues:0

rewardedsoups

Rewarded soups official implementation

Language:HTMLStargazers:39Issues:0Issues:0

SimPO

SimPO: Simple Preference Optimization with a Reference-Free Reward

Language:PythonStargazers:477Issues:0Issues:0

awesome-llm-human-preference-datasets

A curated list of Human Preference Datasets for LLM fine-tuning, RLHF, and eval.

License:MITStargazers:274Issues:0Issues:0

HALOs

A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).

Language:PythonLicense:Apache-2.0Stargazers:614Issues:0Issues:0

RLHF-Reward-Modeling

Recipes to train reward model for RLHF.

Language:PythonLicense:Apache-2.0Stargazers:351Issues:0Issues:0
Language:PythonStargazers:24Issues:0Issues:0

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonLicense:MITStargazers:5703Issues:0Issues:0

language-model-arithmetic

Controlled Text Generation via Language Model Arithmetic

Language:PythonLicense:MITStargazers:182Issues:0Issues:0

reward-bench

RewardBench: the first evaluation tool for reward models.

Language:PythonLicense:Apache-2.0Stargazers:268Issues:0Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:80Issues:0Issues:0

awesome-RLAIF

A continually updated list of literature on Reinforcement Learning from AI Feedback (RLAIF)

License:Apache-2.0Stargazers:89Issues:0Issues:0

UltraFeedback

A large-scale, fine-grained, diverse preference dataset (and models).

Language:PythonLicense:MITStargazers:277Issues:0Issues:0

Awesome-Video-Diffusion

A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

Stargazers:2743Issues:0Issues:0

VGen

Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models

Language:PythonStargazers:2732Issues:0Issues:0

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonLicense:Apache-2.0Stargazers:19950Issues:0Issues:0

HarmBench

HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal

Language:Jupyter NotebookLicense:MITStargazers:205Issues:0Issues:0

ToolEmu

A language model (LM)-based emulation framework for identifying the risks of LM agents with tool use

Language:PythonLicense:Apache-2.0Stargazers:95Issues:0Issues:0

curiosity_redteam

Official implementation of ICLR'24 paper, "Curiosity-driven Red Teaming for Large Language Models" (https://openreview.net/pdf?id=4KqkizXgXU)

Language:Jupyter NotebookLicense:MITStargazers:47Issues:0Issues:0

awesome-RLHF

A curated list of reinforcement learning with human feedback resources (continually updated)

License:Apache-2.0Stargazers:2988Issues:0Issues:0

chain-of-hindsight

Chain-of-Hindsight, A Scalable RLHF Method

Language:PythonLicense:Apache-2.0Stargazers:210Issues:0Issues:0

LLMAgentPapers

Must-read Papers on LLM Agents.

Stargazers:1403Issues:0Issues:0

opacus

Training PyTorch models with differential privacy

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1642Issues:0Issues:0

WAVES

Code for our paper "Benchmarking the Robustness of Image Watermarks"

Language:PythonStargazers:30Issues:0Issues:0

LLM-Agents-Papers

A repo lists papers related to LLM based agent

Language:PythonStargazers:803Issues:0Issues:0

VLM-Poison.github.io

Project Website for the paper "Shadowcast: Stealthy Data Poisoning Attacks Against Vision-Language Models"

Language:JavaScriptStargazers:1Issues:0Issues:0

VLM-Poisoning

Code for the paper "Shadowcast: Stealthy Data Poisoning Attacks Against Vision-Language Models"

Language:PythonStargazers:16Issues:0Issues:0

Academic-project-page-template

A project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/

Language:JavaScriptStargazers:1452Issues:0Issues:0