chauncygu

Shangding Gu's repositories

Safe-Reinforcement-Learning-Baselines

The repository is for safe reinforcement learning baselines.

Language:Jupyter Notebook698 11 1

Multi-Agent-Constrained-Policy-Optimisation

Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).

Language:PythonNOASSERTION165 2 10

Safe-Multi-Agent-Mujoco

Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.

Language:PythonMIT60 1 3

Safe-Multi-Agent-Isaac-Gym

Safe Multi-Agent Isaac Gym benchmark for safe multi-agent reinforcement learning research.

Language:PythonApache-2.059 4 1

Safe-Multi-Agent-Robosuite

Safe Multi-Agent Robosuite benchmark for safe multi-agent reinforcement learning research.

Language:PythonMIT18 1 1

Safe-Policy-Optimization-Serial-Version

This is a benchmark repository for safe reinforcement learning algorithms

Language:PythonApache-2.0200

Awesome-LLM-RL

A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.

100

Minigrid-work-python3.9

Simple and easily configurable grid world environments for reinforcement learning

Language:PythonNOASSERTION100

alpaca_eval

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

Language:Jupyter NotebookApache-2.0000

ChatGPTAPIFree

A simple and open-source proxy API that allows you to access OpenAI's ChatGPT API for free!

Language:JavaScriptCC0-1.0000

chauncygu

010

DB-Football

A Simple, Distributed and Asynchronous Multi-Agent Reinforcement Learning Framework for Google Research Football AI.

Language:PythonNOASSERTION000

DexterousHands

This is a library that provides dual dexterous hand manipulation tasks through Isaac Gym

Language:PythonApache-2.0000

fix-TEMPERA

Language:Python000

Grounding_LLMs_with_online_RL_work

We perform functional grounding of LLMs' knowledge in BabyAI-Text

Language:PythonMIT000

MFL

000

Minecraft-work-python3.8

Simple Minecraft-inspired program using Python and Pyglet

Language:PythonMIT000

mtenv

MultiTask Environments for Reinforcement Learning.

Language:PythonMIT000

omnisafe

JMLR: OmniSafe is an infrastructural framework for accelerating SafeRL research.

Language:PythonApache-2.0000

openr

OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models

MIT000

ray

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Language:PythonApache-2.0000

README

README文件语法解读，即Github Flavored Markdown语法介绍

Unlicense000

rl_on_manifold

Robot Reinforcement Learning on the Constraint Manifold

Language:Python000

safety-gymnasium

NeurIPS 2023: Safety-Gymnasium: A Unified Safe Reinforcement Learning Benchmark

Language:PythonApache-2.0000

sample-efficient-rl

010

semikong

First Open-Source Industry-Specific Model for Semiconductors

Apache-2.0000

tianshou

An elegant PyTorch deep reinforcement learning library.

Language:PythonMIT000

TimeChamber-rl

A Massively Parallel Large Scale Self-Play Framework

Language:PythonMIT000

tree-of-thought-llm

Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Language:PythonMIT000

WizardLM

LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath

Language:Python000