Shunyu Liu (liushunyu)

liushunyu

User data from Github https://github.com/liushunyu

Company:Nanyang Technological University

Location:Singapore

Home Page:https://liushunyu.github.io/

GitHub:@liushunyu

Shunyu Liu's starred repositories

LLMSurvey

The official GitHub page for the survey paper "A Survey of Large Language Models".

LLM-Agent-Paper-List

The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.

Awesome-LLM-Strawberry

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

Academic-project-page-template

A project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/

textgrad

TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.

Language:PythonLicense:MITStargazers:2151Issues:21Issues:89

prm800k

800,000 step-level correctness labels on LLM solutions to MATH problems

Language:PythonLicense:MITStargazers:1952Issues:115Issues:18

PPO-PyTorch

Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch

Language:PythonLicense:MITStargazers:1939Issues:6Issues:61

theseus

A library for differentiable nonlinear optimization

Language:PythonLicense:MITStargazers:1854Issues:32Issues:195

Paper-Picture-Writing-Code

MLNLP: Paper Picture Writing Code

rliable

[NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:813Issues:9Issues:19

CleanDiffuser

CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:532Issues:3Issues:23

VectorizedMultiAgentSimulator

VMAS is a vectorized differentiable simulator designed for efficient Multi-Agent Reinforcement Learning benchmarking. It is comprised of a vectorized 2D physics engine written in PyTorch and a set of challenging multi-robot scenarios. Additional scenarios can be implemented through a simple and modular interface.

Language:PythonLicense:GPL-3.0Stargazers:398Issues:7Issues:62

RL4VLM

Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning

Language:Jupyter NotebookLicense:MITStargazers:322Issues:6Issues:28

Reinforcement-Learning-Papers

📚 List of Top-tier Conference Papers on Reinforcement Learning (RL),including: NeurIPS, ICML, AAAI, IJCAI, AAMAS, ICLR, ICRA, etc.

Odyssey

Odyssey: Empowering Minecraft Agents with Open-World Skills

Language:PythonLicense:MITStargazers:299Issues:4Issues:4

Analytic-continual-learning

This repository will be posting analytic continual learning series, including Analytic Class-Incremental Learning (ACIL), Gaussian Kernel Embedded Analytic Learning (GKEAL), Dual-Stream Analytic Learning (DS-AL), etc.

Language:PythonLicense:MITStargazers:224Issues:4Issues:10
Language:PythonStargazers:185Issues:0Issues:0

fusion_bench

FusionBench: A Comprehensive Benchmark/Toolkit of Deep Model Fusion

Language:PythonLicense:MITStargazers:117Issues:4Issues:11

learn_how_to_research

This is a repository for learning how to conduct academic research.

License:MITStargazers:81Issues:1Issues:0

Analytic-federated-learning

This repo will be continually updating analytic federated learning methods.

Language:Jupyter NotebookStargazers:50Issues:2Issues:0

MAM

[IEEE Transactions on Power Systems] Transmission Interface Power Flow Adjustment: A Deep Reinforcement Learning Approach based on Multi-task Attribution Map

Language:PythonLicense:Apache-2.0Stargazers:12Issues:1Issues:0

Transformer-Doctor

Transformer Doctor: Diagnosing and Treating Vision Transformers

Language:PythonStargazers:10Issues:3Issues:0

STAR-TKDE

The official implementation of 'Spatiotemporal-Augmented Graph Neural Networks for Human Mobility Simulation'.

Language:PythonStargazers:10Issues:2Issues:0

Vision-Mamba-Mender

Vision Mamba Mender

Language:PythonStargazers:8Issues:2Issues:0

TPA-for-AVC

PyTorch implementation of Temporal Prototype-Aware Learning for Active Voltage Control on Power Distribution Networks (SIGKDD 2024)

Language:PythonStargazers:6Issues:2Issues:0
Language:PythonStargazers:4Issues:1Issues:0
Language:PythonLicense:Apache-2.0Stargazers:3Issues:1Issues:0

EmmaGNN

The official code for EmmaGNN

Language:PythonLicense:Apache-2.0Stargazers:3Issues:0Issues:0
Stargazers:2Issues:0Issues:0

GIP-Framework

The realization of kdd 2024 accepted paper "Unveiling Global Interactive Patterns across Graphs: Towards Interpretable Graph Neural Networks"

Language:PythonLicense:Apache-2.0Stargazers:2Issues:1Issues:0