HXT (tjuHaoXiaotian)

tjuHaoXiaotian

Geek Repo

Company:tju student

Github PK Tool:Github PK Tool

HXT's repositories

pymarl3

We extend pymarl2 to pymarl3, equipping the MARL algorithms with permutation invariance and permutation equivariance properties. The enhanced algorithm achieves 100% win rates on SMAC-V1 and superior performance on SMAC-V2.

Language:PythonLicense:Apache-2.0Stargazers:106Issues:3Issues:9

GASIL

Independent Generative Adversarial Self-Imitation Learning In Cooperative Multiagent Systems

ICML-2020-MSBCB

Code of ICML-2020 paper Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising

Qfamily_for_MatrixGame

We provide a very simple implementation of the typical value decomposition methods for solving single state Matrix Games.

Language:PythonLicense:MITStargazers:14Issues:2Issues:0

MA-MuZero

MuZero for Combinatorial Action Spaces: open-source codebase for MA-Gumbel-AlphaZero, MA-Sampled-AlphaZero, MA-Gumbel-MuZero and MA-Sampled-MuZero, from "Multiagent Gumbel MuZero: Efficient Planning in Combinatorial Action Spaces" at AAAI 2024.

InterestDemo

机器学习简单前端(带可视化)小程序

Language:JavaScriptStargazers:1Issues:2Issues:0

NRLPapers

Must-read papers on network representation learning (NRL) / network embedding (NE)

Language:TeXStargazers:1Issues:2Issues:0

pymarl_alpha

Alpha code release for Python Multi-Agent Reinforcement Learning framework

Language:PythonStargazers:1Issues:2Issues:0

smac

SMAC: The StarCraft Multi-Agent Challenge

Language:PythonLicense:MITStargazers:1Issues:2Issues:0
Language:PythonStargazers:0Issues:1Issues:0

easy-tf-log

Easy TensorFlow logging for quick prototypes

Language:PythonLicense:MITStargazers:0Issues:2Issues:0

gym

A toolkit for developing and comparing reinforcement learning algorithms.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:2Issues:0

Hands-On-Reinforcement-Learning-With-Python

Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow

Language:Jupyter NotebookStargazers:0Issues:1Issues:0

leetcode-master

LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀

Stargazers:0Issues:1Issues:0

ma-gym

A collection of multi agent environments based on OpenAI gym.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

Machine-Learning-Notes

白板推导系列课程笔记 初版

Stargazers:0Issues:1Issues:0

MAgent

A Platform for Many-agent Reinforcement Learning

Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:2Issues:0
Language:PythonStargazers:0Issues:1Issues:0

Paper-Writing-Tips

Paper Writing Tips

Stargazers:0Issues:1Issues:0

PettingZoo

Gym for multi-agent reinforcement learning

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

PIC

PIC: Permutation Invariant Critic for Multi-Agent Deep Reinforcement Learning

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

ray

A fast and simple framework for building and running distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

reinforcement-learning-an-introduction

Python Implementation of Reinforcement Learning: An Introduction

Language:PythonLicense:Apache-2.0Stargazers:0Issues:2Issues:0

smarts_track2

the track2 code of the SMARTS competition of NIPS-22

Language:PythonStargazers:0Issues:2Issues:0

the-gan-zoo

A list of all named GANs!

Language:PythonLicense:MITStargazers:0Issues:2Issues:0
Language:HTMLStargazers:0Issues:1Issues:0