Beast code in Giters

燦哲's repositories

Differentially Private Communication for Cooperative Multi-Agent Reinforcement Learning

Language:Python3 30

GraphConUCB

Language:Shell3 10

Differentially Private Temporal Difference Learning (DPTD)

Language:Shell200

Some mathematics on my journey of machine learning.

This is the repo for the source code of the AAAI2021 paper ``Near-Optimal MNL Bandits Under Risk Criteria"

Language:Python000

Language:TeX000

FTRL-PBM

Language:Python010

PyTorch implementations of popular off-policy multi-agent reinforcement learning algorithms, including QMix, VDN, MADDPG, and MATD3.

Language:PythonMIT000

A tensorflow implementation of RippleNet

Language:PythonMIT000

Code repository for SARNet: Learning Multi-Agent Communication through Structured Attentive Reasoning (NeurIPS 2020)

Language:PythonMIT000

暑期实训-验证码识别

Language:PythonMIT000

GLaDOS AutoCheckIn 定时自动签到

BSD-3-Clause000