Haoran Ye (henry-yeh)

henry-yeh

Geek Repo

Company:Peking University

Location:Nanjing

Home Page:https://henry-yeh.github.io

Github PK Tool:Github PK Tool


Organizations
ai4co
Value4AI

Haoran Ye's starred repositories

illustration-collection

The collection of my research papers' illustrations.

Language:JavaScriptStargazers:14Issues:0Issues:0

Awesome-LLM

Awesome-LLM: a curated list of Large Language Model

License:CC0-1.0Stargazers:15992Issues:0Issues:0

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonLicense:Apache-2.0Stargazers:29120Issues:0Issues:0

Stable-Alignment

Multi-agent Social Simulation + Efficient, Effective, and Stable alternative of RLHF. Code for the paper "Training Socially Aligned Language Models in Simulated Human Society".

Language:PythonLicense:NOASSERTIONStargazers:332Issues:0Issues:0

hh-rlhf

Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"

License:MITStargazers:1500Issues:0Issues:0

safe-rlhf

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Language:PythonLicense:Apache-2.0Stargazers:1229Issues:0Issues:0

litgpt

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Language:PythonLicense:Apache-2.0Stargazers:8164Issues:0Issues:0

MiniGPT-5

Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens"

Language:PythonLicense:Apache-2.0Stargazers:832Issues:0Issues:0

LLM-Optimizers-Papers

Must-read Papers on Large Language Model (LLM) as Optimizers and Automatic Optimization for Prompting LLMs.

Stargazers:190Issues:0Issues:0

DREAMPlace

Deep learning toolkit-enabled VLSI placement

License:BSD-3-ClauseStargazers:1Issues:0Issues:0

rl

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

Language:PythonLicense:MITStargazers:2002Issues:0Issues:0

ControlNet

Let us control diffusion models!

Language:PythonLicense:Apache-2.0Stargazers:28961Issues:0Issues:0

DIFUSCO

Code of NeurIPS paper: arxiv.org/abs/2302.08224

Language:PythonLicense:MITStargazers:139Issues:0Issues:0

generative_agents

Generative Agents: Interactive Simulacra of Human Behavior

License:Apache-2.0Stargazers:15792Issues:0Issues:0

ppo-implementation-details

The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization

Language:PythonLicense:NOASSERTIONStargazers:580Issues:0Issues:0

LLM-Agent-Paper-List

The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.

Stargazers:5740Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:95Issues:0Issues:0

Omni-VRP

[ICML 2023] "Towards Omni-generalizable Neural Methods for Vehicle Routing Problems"

Language:PythonLicense:MITStargazers:33Issues:0Issues:0
Language:HTMLStargazers:4Issues:0Issues:0

bcitoolbox

A package for Bayesian causal infe

Language:PythonStargazers:7Issues:0Issues:0

MetaBox

MetaBox: A Benchmark Platform for Meta-Black-Box Optimization with Reinforcement Learning (https://arxiv.org/abs/2310.08252)

Language:PythonLicense:BSD-3-ClauseStargazers:50Issues:0Issues:0
Language:Jupyter NotebookLicense:NOASSERTIONStargazers:6Issues:0Issues:0

RecLicense

An open source license recommendation tool.

Language:PythonLicense:NOASSERTIONStargazers:11Issues:0Issues:0

rl4co

A PyTorch library for all things Reinforcement Learning (RL) for Combinatorial Optimization (CO)

Language:PythonLicense:MITStargazers:330Issues:0Issues:0

ccf-deadlines

⏰ Collaboratively track deadlines of conferences recommended by CCF (Website, Python Cli, Wechat Applet) / If you find it useful, please star this project, thanks~

Language:VueLicense:MITStargazers:5337Issues:0Issues:0

GNNPapers

Must-read papers on graph neural networks (GNN)

Stargazers:15709Issues:0Issues:0
Language:PythonStargazers:2Issues:0Issues:0

DREAMPlace

Deep learning toolkit-enabled VLSI placement

Language:C++License:BSD-3-ClauseStargazers:640Issues:0Issues:0

awesome-ml4co

Awesome machine learning for combinatorial optimization papers.

Language:PythonStargazers:1543Issues:0Issues:0

The-Art-of-Linear-Algebra-zh-CN

Graphic notes on Gilbert Strang's "Linear Algebra for Everyone", 线性代数的艺术中文版, 欢迎PR.

Language:PostScriptLicense:CC0-1.0Stargazers:3757Issues:0Issues:0