Harryis Wang (Harry-mic)

Harry-mic

Geek Repo

Company:Tsinghua

Github PK Tool:Github PK Tool

Harryis Wang's starred repositories

Visual-Adversarial-Examples-Jailbreak-Large-Language-Models

Repository for the Paper (AAAI 2024, Oral) --- Visual Adversarial Examples Jailbreak Large Language Models

Language:PythonStargazers:132Issues:0Issues:0
Language:PythonStargazers:24Issues:0Issues:0

OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

Language:PythonLicense:Apache-2.0Stargazers:1713Issues:0Issues:0

textgrad

Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.

Language:PythonLicense:MITStargazers:961Issues:0Issues:0

reward-bench

RewardBench: the first evaluation tool for reward models.

Language:PythonLicense:Apache-2.0Stargazers:277Issues:0Issues:0
Language:PythonLicense:MITStargazers:13Issues:0Issues:0

alignment-handbook

Robust recipes to align language models with human and AI preferences

Language:PythonLicense:Apache-2.0Stargazers:4178Issues:0Issues:0

ReMax

Code for Paper (ReMax: A Simple, Efficient and Effective Reinforcement Learning Method for Aligning Large Language Models)

Language:PythonStargazers:126Issues:0Issues:0

LLM-Extrapolation

Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"

Language:PythonStargazers:50Issues:0Issues:0

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:22193Issues:0Issues:0

sdft

[ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".

Language:ShellStargazers:56Issues:0Issues:0

self-rewarding-lm-pytorch

Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI

Language:PythonLicense:MITStargazers:1266Issues:0Issues:0

self-rag

This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.

Language:PythonLicense:MITStargazers:1603Issues:0Issues:0

CUT

Source code of "Reasons to Reject? Aligning Language Models with Judgments"

Language:PythonLicense:Apache-2.0Stargazers:52Issues:0Issues:0

reid-strong-baseline

Bag of Tricks and A Strong Baseline for Deep Person Re-identification

License:MITStargazers:1Issues:0Issues:0

models

Models and examples built with TensorFlow

License:NOASSERTIONStargazers:1Issues:0Issues:0

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

License:Apache-2.0Stargazers:1Issues:0Issues:0

TREvaL

Reasonable Reward Evaluation of Large Language Models

Language:PythonLicense:MITStargazers:7Issues:0Issues:0

alpaca_eval

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1293Issues:0Issues:0

Dromedary

Dromedary: towards helpful, ethical and reliable LLMs.

Language:PythonLicense:GPL-3.0Stargazers:1100Issues:0Issues:0

Finetune_LLAMA

简单易懂的LLaMA微调指南。

Language:PythonStargazers:309Issues:0Issues:0

Stable-Alignment

Multi-agent Social Simulation + Efficient, Effective, and Stable alternative of RLHF. Code for the paper "Training Socially Aligned Language Models in Simulated Human Society".

Language:PythonLicense:NOASSERTIONStargazers:332Issues:0Issues:0

CycAug

[NeurIPS 2023] CycAug implementation from paper 'Learning Better with Less: Effective Augmentation for Sample-Efficient Visual RL'.

Language:PythonStargazers:3Issues:0Issues:0

LLM-Agent-Paper-Digest

papers related to LLM-agent that published on top conferences

Stargazers:291Issues:0Issues:0

promptbench

A unified evaluation framework for large language models

Language:PythonLicense:MITStargazers:2264Issues:0Issues:0

RL-ViGen

This is the repo of "RL-ViGen: A Reinforcement Learning Benchmark for Visual Generalization"

Language:PythonLicense:MITStargazers:80Issues:0Issues:0

DA-in-visualRL

Collection of papers and resources for data augmentation (DA) in visual reinforcement learning (RL).

Stargazers:68Issues:0Issues:0

safe-rlhf

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Language:PythonLicense:Apache-2.0Stargazers:1223Issues:0Issues:0

la-mbda

LAMBDA is a model-based reinforcement learning agent that uses Bayesian world models for safe policy optimization

Language:PythonLicense:MITStargazers:29Issues:0Issues:0

DI-adventure

Decision Intelligence Adventure for Beginners

Language:PythonLicense:Apache-2.0Stargazers:66Issues:0Issues:0