Jinbo He's starred repositories

DI-engine

OpenDILab Decision AI Engine

Language:PythonLicense:Apache-2.0Stargazers:2652Issues:0Issues:0

MAIC

The implementation of AAAI'22 paper "Multi-Agent Incentive Communication via Decentralized Teammate Modeling".

Language:PythonLicense:Apache-2.0Stargazers:46Issues:0Issues:0

A2PO-ICLR2023

Codebase for [Order Matters: Agent-by-agent Policy Optimization](https://openreview.net/forum?id=Q-neeWNVv1)

Language:PythonLicense:MITStargazers:19Issues:0Issues:0

HARL

Official implementation of HARL algorithms based on PyTorch.

Language:PythonStargazers:376Issues:0Issues:0
Language:PythonStargazers:287Issues:0Issues:0

ExplorerPatcher

ExplorerPatcher Chinese L10n - 在 Windows 11 上恢复高效的工作环境

Language:CLicense:GPL-2.0Stargazers:1758Issues:0Issues:0

TIT_open_source

The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"

Language:PythonStargazers:50Issues:0Issues:0

morl-baselines

Multi-Objective Reinforcement Learning algorithms implementations.

Language:PythonLicense:MITStargazers:236Issues:0Issues:0

ChatGLM2-6B

ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型

Language:PythonLicense:NOASSERTIONStargazers:15547Issues:0Issues:0

ChuanhuChatGPT

GUI for ChatGPT API and many LLMs. Supports agents, file-based QA, GPT finetuning and query with web search. All with a neat UI.

Language:PythonLicense:GPL-3.0Stargazers:14897Issues:0Issues:0

light_mappo

Lightweight version of MAPPO to help you quickly migrate to your local environment.

Language:PythonStargazers:408Issues:0Issues:0

NoteWidget

Markdown add-in for Microsoft Office OneNote

Language:C#License:Apache-2.0Stargazers:150Issues:0Issues:0

omnisafe

OmniSafe is an infrastructural framework for accelerating SafeRL research.

Language:PythonLicense:Apache-2.0Stargazers:866Issues:0Issues:0

Hands-on-RL

https://hrl.boyuai.com/

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2050Issues:0Issues:0

MADDPG_torch

The code for maddpg using pytorch

Language:PythonStargazers:156Issues:0Issues:0

Vehicular-Trajectories-Processing-for-Didi-Open-Data

Vehicular trajectories processing for Didi GAIA Open Data Set

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:33Issues:0Issues:0

stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Language:PythonLicense:MITStargazers:8140Issues:0Issues:0

awesome-reinforcement-learning-lib

GitHub's code repository is all you need

Stargazers:295Issues:0Issues:0

maddpg-pettingzoo-pytorch

implementation of MADDPG using PettingZoo and PyTorch

Language:PythonStargazers:77Issues:0Issues:0

PettingZoo

An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities

Language:PythonLicense:NOASSERTIONStargazers:2409Issues:0Issues:0

warp-drive

Extremely Fast End-to-End Deep Multi-Agent Reinforcement Learning Framework on a GPU (JMLR 2022)

Language:PythonLicense:BSD-3-ClauseStargazers:437Issues:0Issues:0

pytorch-lightning-template

An easy/swift-to-adapt PyTorch-Lighting template. 套壳模板,简单易用,稍改原来Pytorch代码,即可适配Lightning。You can translate your previous Pytorch code much easier using this template, and keep your freedom to edit all the functions as well. Big-project-friendly as well. No need to rewrite your config in hydra.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1233Issues:0Issues:0

baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Language:PythonLicense:MITStargazers:15436Issues:0Issues:0

maddpg-pytorch

PyTorch Implementation of MADDPG (Lowe et. al. 2017)

Language:PythonLicense:MITStargazers:536Issues:0Issues:0

maddpg

Code for the MADDPG algorithm from the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"

Language:PythonLicense:MITStargazers:1549Issues:0Issues:0

multiagent-particle-envs

Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"

Language:PythonLicense:MITStargazers:2238Issues:0Issues:0

rl4co

A PyTorch library for all things Reinforcement Learning (RL) for Combinatorial Optimization (CO)

Language:PythonLicense:MITStargazers:279Issues:0Issues:0

g-helper

Lightweight Armoury Crate alternative for Asus laptops and ROG Ally. Control tool for ROG Zephyrus G14, G15, G16, M16, Flow X13, Flow X16, TUF, Strix, Scar and other models

Language:C#License:GPL-3.0Stargazers:5367Issues:0Issues:0

rl

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

Language:PythonLicense:MITStargazers:1946Issues:0Issues:0

ray

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Language:PythonLicense:Apache-2.0Stargazers:31562Issues:0Issues:0