Joy Jiang (Joy1112)

Joy1112

Geek Repo

Company:Tsinghua University

Location:Beijing, China

Github PK Tool:Github PK Tool


Organizations
thu-rllab

Joy Jiang's repositories

Language:PythonStargazers:1Issues:0Issues:0

pymarl-football

RIIT: Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

vanilla-docker-images

Build a vanilla docker image with Anaconda3 & Pytorch (cuda version) installed.

Language:DockerfileStargazers:1Issues:0Issues:0

baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

CDS

[NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

football

Check out the new game server:

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

fucking-algorithm

刷算法全靠套路,认准 labuladong 就够了!English version supported! Crack LeetCode, not only how, but also why.

Language:MarkdownStargazers:0Issues:0Issues:0

gr2

Appendix and Code for Modelling Bounded Rationality in Multi-Agent Interactions by Generalized Recursive Reasoning

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

MARL-Doc

The Documentation for the MARL PLATFORM

Language:HTMLStargazers:0Issues:0Issues:0

ml-protobuf

Protocol Buffers for machine learning projects, supporting Numpy & Pytorch.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

MusicBox

:blush: :musical_note: MusicPlayer 一站式收听多平台音乐(网易云, 虾米, QQ)的跨平台音乐播放器,尽情享受吧~:sparkles:

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

mx-DeepIM

Deep Iterative Matching for 6D Pose Estimation

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

PPO-clip-and-PPO-penalty-on-Atari-Domain

Implement PPO-clip and PPO-penalty on Atari, which is the only open source of PPO-penalty

Language:PythonStargazers:0Issues:0Issues:0

PPO-Gluon

Implementation of PPO in Gluon / MXnet

Language:PythonStargazers:0Issues:0Issues:0

pymarl

Python Multi-Agent Reinforcement Learning framework

License:Apache-2.0Stargazers:0Issues:0Issues:0

pytorch-mobilenet-v2

A PyTorch implementation of MobileNet V2 architecture and pretrained model.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Reinforcement-learning-with-tensorflow

Simple Reinforcement learning tutorials

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

scalable_agent

A TensorFlow implementation of Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures.

License:Apache-2.0Stargazers:0Issues:0Issues:0

UCRL_implementation

Various implementations and modification of algorithm around UCRL.

Language:PythonStargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0