燦哲's repositories
Graph-Conversational-UCB
GraphConUCB
Differentially-Private-TD-Learning
Differentially Private Temporal Difference Learning (DPTD)
In-Mathematics-We-Trust
Some mathematics on my journey of machine learning.
aaai2021
This is the repo for the source code of the AAAI2021 paper ``Near-Optimal MNL Bandits Under Risk Criteria"
Language:Python000
Language:TeX000
BOBW-Bandits-under-PBM
FTRL-PBM
off-policy
PyTorch implementations of popular off-policy multi-agent reinforcement learning algorithms, including QMix, VDN, MADDPG, and MATD3.
Language:PythonMIT000
RippleNet
A tensorflow implementation of RippleNet
Language:PythonMIT000
SARNet
Code repository for SARNet: Learning Multi-Agent Communication through Structured Attentive Reasoning (NeurIPS 2020)
Language:PythonMIT000
SummerProject
暑期实训-验证码识别
Language:PythonMIT000
GLaDOS-CheckIn
GLaDOS AutoCheckIn 定时自动签到
BSD-3-Clause000