Xihuai Wang (xihuai18)

xihuai18

Geek Repo

Company:Shanghai Jiao Tong University

Location:Shanghai, China

Home Page:https://xihuai18.github.io/

Github PK Tool:Github PK Tool

Xihuai Wang's repositories

A2PO-ICLR2023

Codebase for [Order Matters: Agent-by-agent Policy Optimization](https://openreview.net/forum?id=Q-neeWNVv1)

Language:PythonLicense:MITStargazers:25Issues:1Issues:2
Language:PythonLicense:NOASSERTIONStargazers:16Issues:2Issues:2

awesome-RL-generalization

A list of papers regarding generalization in (deep) reinforcement learning

AlphaZero-for-Othello

(Re)-Implementation of alphazero for othello using pytorch

Language:PythonLicense:MITStargazers:4Issues:1Issues:1

Common-Cooperative-Multi-Agent-Environments

Commonly-used Cooperative Multi-agent Environments Installation, Convenient Wrappers, and VectorEnv Implementation with PettingZoo (and Gymnasium) Compatibility.

Language:PythonLicense:MITStargazers:3Issues:0Issues:0

MARL-Comm

Basic MARL algorithms with Communication

Language:PythonStargazers:3Issues:1Issues:0

Operating-System-Project

Project for Operating System Course, Semester 2018 Spring.

Language:HTMLLicense:MITStargazers:3Issues:1Issues:0

RL-Proofs

Some fundamental proofs in Reinforcement Learning.

License:MITStargazers:2Issues:2Issues:0

GFootball-Gymnasium-Pettingzoo

Google Research Football with gymnasium support.

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

Go-Distributed-Storage-Service

A distributed storage service developed by Golang.

Language:GoLicense:MITStargazers:1Issues:1Issues:0

Image-Processing-in-CUDA

Implementation of Image Processing Method

Language:CudaLicense:MITStargazers:1Issues:1Issues:0

MaMuJoCo-PettingZoo

MaMuJoCo from https://github.com/Farama-Foundation/Gymnasium-Robotics with Convenient Wrappers and Utilities.

Language:PythonLicense:MITStargazers:1Issues:1Issues:0

pysc2

StarCraft II Learning Environment

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

RadixSort-Cuda

RadixSort using CUDA

Language:CudaStargazers:1Issues:1Issues:0
Language:PythonLicense:MITStargazers:1Issues:1Issues:0
Language:VerilogLicense:MITStargazers:1Issues:1Issues:0

Virtual-Routing

Simulating self-organized routing & centralized routing

Language:PythonLicense:MITStargazers:1Issues:2Issues:0

AgileRL

Streamlining reinforcement learning with RLOps. State-of-the-art RL algorithms and tools.

License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:2Issues:0

cleanMAPG

High-quality single file implementation of Multi-agent Policy Gradient algorithms with research-friendly features (MAPPO, A2PO).

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

Gymnasium

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:C++Stargazers:0Issues:0Issues:0

openbilibili-go-common

听说这是来自 https://github.com/openbilibili/go-common/ 的 “哔哩哔哩 bilibili 网站后台工程 源码”,不过咱也不知道这是啥。

Stargazers:0Issues:0Issues:0

overcooked_ai

A benchmark environment for fully cooperative human-AI performance.

License:MITStargazers:0Issues:0Issues:0

PettingZoo

An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

Recommendation-System-Based-on-MPI-OpenMP

A distributed and parallel recommendation system

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

Reinforcement-Learning-Notes

Reinforcement Learning Notes for Reinforcement Learning: An Introduction (2nd Edition) and David Silver's Reinforcement Learning Course in UCL

Language:Jupyter NotebookStargazers:0Issues:1Issues:0
Stargazers:0Issues:1Issues:0