liushunyu

Shunyu Liu's starred repositories

LLMSurvey

The official GitHub page for the survey paper "A Survey of Large Language Models".

Language:Python11223 163 65

LLM-Agent-Paper-List

The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.

7331 138 15

Awesome-LLM-Strawberry

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

Apache-2.06581 110 12

Academic-project-page-template

A project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/

Language:JavaScript2768 7 3

textgrad

TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.

Language:PythonMIT2151 21 89

prm800k

800,000 step-level correctness labels on LLM solutions to MATH problems

Language:PythonMIT1952 115 18

PPO-PyTorch

Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch

Language:PythonMIT1939 6 61

theseus

A library for differentiable nonlinear optimization

Language:PythonMIT1854 32 195

Paper-Picture-Writing-Code

MLNLP: Paper Picture Writing Code

Language:TeX1112 18 1

rliable

[NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.

Language:Jupyter NotebookApache-2.0813 9 19

CleanDiffuser

CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making

Language:Jupyter NotebookApache-2.0532 3 23

VMAS is a vectorized differentiable simulator designed for efficient Multi-Agent Reinforcement Learning benchmarking. It is comprised of a vectorized 2D physics engine written in PyTorch and a set of challenging multi-robot scenarios. Additional scenarios can be implemented through a simple and modular interface.

Language:PythonGPL-3.0398 7 62

RL4VLM

Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning

Language:Jupyter NotebookMIT322 6 28

Reinforcement-Learning-Papers

📚 List of Top-tier Conference Papers on Reinforcement Learning (RL)，including: NeurIPS, ICML, AAAI, IJCAI, AAMAS, ICLR, ICRA, etc.

MIT301 12 3

Odyssey

Odyssey: Empowering Minecraft Agents with Open-World Skills

Language:PythonMIT299 4 4

Analytic-continual-learning

This repository will be posting analytic continual learning series, including Analytic Class-Incremental Learning (ACIL), Gaussian Kernel Embedded Analytic Learning (GKEAL), Dual-Stream Analytic Learning (DS-AL), etc.

Language:PythonMIT224 4 10

llm_benchmarks

Language:Python18500

fusion_bench

FusionBench: A Comprehensive Benchmark/Toolkit of Deep Model Fusion

Language:PythonMIT117 4 11

learn_how_to_research

This is a repository for learning how to conduct academic research.

MIT81 10

Analytic-federated-learning

This repo will be continually updating analytic federated learning methods.

Language:Jupyter Notebook50 20

MAM

[IEEE Transactions on Power Systems] Transmission Interface Power Flow Adjustment: A Deep Reinforcement Learning Approach based on Multi-task Attribution Map

Language:PythonApache-2.012 10