Shangding Gu (chauncygu)

chauncygu

User data from Github https://github.com/chauncygu

Location:Berkeley, USA

GitHub:@chauncygu


Organizations
SafeRL-Lab

Shangding Gu's repositories

Safe-Reinforcement-Learning-Baselines

The repository is for safe reinforcement learning baselines.

Language:Jupyter NotebookStargazers:698Issues:11Issues:1

Multi-Agent-Constrained-Policy-Optimisation

Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).

Language:PythonLicense:NOASSERTIONStargazers:165Issues:2Issues:10

Safe-Multi-Agent-Mujoco

Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.

Language:PythonLicense:MITStargazers:60Issues:1Issues:3

Safe-Multi-Agent-Isaac-Gym

Safe Multi-Agent Isaac Gym benchmark for safe multi-agent reinforcement learning research.

Language:PythonLicense:Apache-2.0Stargazers:59Issues:4Issues:1

Safe-Multi-Agent-Robosuite

Safe Multi-Agent Robosuite benchmark for safe multi-agent reinforcement learning research.

Language:PythonLicense:MITStargazers:18Issues:1Issues:1

Safe-Policy-Optimization-Serial-Version

This is a benchmark repository for safe reinforcement learning algorithms

Language:PythonLicense:Apache-2.0Stargazers:2Issues:0Issues:0

Awesome-LLM-RL

A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.

Stargazers:1Issues:0Issues:0

Minigrid-work-python3.9

Simple and easily configurable grid world environments for reinforcement learning

Language:PythonLicense:NOASSERTIONStargazers:1Issues:0Issues:0

alpaca_eval

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

ChatGPTAPIFree

A simple and open-source proxy API that allows you to access OpenAI's ChatGPT API for free!

Language:JavaScriptLicense:CC0-1.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:1Issues:0

DB-Football

A Simple, Distributed and Asynchronous Multi-Agent Reinforcement Learning Framework for Google Research Football AI.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

DexterousHands

This is a library that provides dual dexterous hand manipulation tasks through Isaac Gym

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

Grounding_LLMs_with_online_RL_work

We perform functional grounding of LLMs' knowledge in BabyAI-Text

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

Minecraft-work-python3.8

Simple Minecraft-inspired program using Python and Pyglet

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

mtenv

MultiTask Environments for Reinforcement Learning.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

omnisafe

JMLR: OmniSafe is an infrastructural framework for accelerating SafeRL research.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

openr

OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models

License:MITStargazers:0Issues:0Issues:0

ray

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

README

README文件语法解读,即Github Flavored Markdown语法介绍

License:UnlicenseStargazers:0Issues:0Issues:0

rl_on_manifold

Robot Reinforcement Learning on the Constraint Manifold

Language:PythonStargazers:0Issues:0Issues:0

safety-gymnasium

NeurIPS 2023: Safety-Gymnasium: A Unified Safe Reinforcement Learning Benchmark

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:1Issues:0

semikong

First Open-Source Industry-Specific Model for Semiconductors

License:Apache-2.0Stargazers:0Issues:0Issues:0

tianshou

An elegant PyTorch deep reinforcement learning library.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

TimeChamber-rl

A Massively Parallel Large Scale Self-Play Framework

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

tree-of-thought-llm

Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

WizardLM

LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath

Language:PythonStargazers:0Issues:0Issues:0