xihuai18

Xihuai Wang's repositories

A2PO-ICLR2023

Codebase for [Order Matters: Agent-by-agent Policy Optimization](https://openreview.net/forum?id=Q-neeWNVv1)

Language:PythonMIT25 1 2

arxiv-sanity-x

Language:PythonNOASSERTION16 2 2

awesome-RL-generalization

A list of papers regarding generalization in (deep) reinforcement learning

10 10

AlphaZero-for-Othello

(Re)-Implementation of alphazero for othello using pytorch

Language:PythonMIT4 1 1

Common-Cooperative-Multi-Agent-Environments

Commonly-used Cooperative Multi-agent Environments Installation, Convenient Wrappers, and VectorEnv Implementation with PettingZoo (and Gymnasium) Compatibility.

Language:PythonMIT300

MARL-Comm

Basic MARL algorithms with Communication

Language:Python3 10

Operating-System-Project

Project for Operating System Course, Semester 2018 Spring.

Language:Assembly3 1 1

xihuai18.github.io

Language:HTMLMIT3 10

RL-Proofs

Some fundamental proofs in Reinforcement Learning.

MIT2 20

Computer-Organization-And-Design-Review

1 10

GFootball-Gymnasium-Pettingzoo

Google Research Football with gymnasium support.

Language:PythonApache-2.0100

Go-Distributed-Storage-Service

A distributed storage service developed by Golang.

Language:GoMIT1 10

Image-Processing-in-CUDA

Implementation of Image Processing Method

Language:CudaMIT1 10

MaMuJoCo-PettingZoo

MaMuJoCo from https://github.com/Farama-Foundation/Gymnasium-Robotics with Convenient Wrappers and Utilities.

Language:PythonMIT1 10

pysc2

StarCraft II Learning Environment

Language:PythonApache-2.0100

RadixSort-Cuda

RadixSort using CUDA

Language:Cuda1 10

SMAC-PettingZoo

Language:PythonMIT1 10

Stream-multiprocessor-design

Language:VerilogMIT1 10

Virtual-Routing

Simulating self-organized routing & centralized routing

Language:PythonMIT1 20

AgileRL

Streamlining reinforcement learning with RLOps. State-of-the-art RL algorithms and tools.

Apache-2.0000

C-S-and-P2P-demo

Language:PythonMIT020

cleanMAPG

High-quality single file implementation of Multi-agent Policy Gradient algorithms with research-friendly features (MAPPO, A2PO).

Language:PythonNOASSERTION000

Gymnasium

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

Language:PythonMIT000

Multi-threaded-Queue

Language:C++000

openbilibili-go-common

听说这是来自 https://github.com/openbilibili/go-common/ 的 “哔哩哔哩 bilibili 网站后台工程源码”，不过咱也不知道这是啥。

000

overcooked_ai

A benchmark environment for fully cooperative human-AI performance.

MIT000

PettingZoo

An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities

Language:PythonNOASSERTION000

Recommendation-System-Based-on-MPI-OpenMP

A distributed and parallel recommendation system

Language:C++Apache-2.0000

Reinforcement-Learning-Notes

Reinforcement Learning Notes for Reinforcement Learning: An Introduction (2nd Edition) and David Silver's Reinforcement Learning Course in UCL

Language:Jupyter Notebook010

xihuai18

010