Beast code in Giters

Joy Jiang's repositories

nt_transformer

Language:Python100

pymarl-football

RIIT: Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)

Language:PythonApache-2.0100

vanilla-docker-images

Build a vanilla docker image with Anaconda3 & Pytorch (cuda version) installed.

Language:Dockerfile100

baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Language:PythonMIT000

CDS

[NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.

Language:PythonApache-2.0000

Collaborative-Filtering-Classical-Algorithm

Language:Python000

DRL_HW

Language:Python000

football

Check out the new game server:

Language:PythonApache-2.0000

fucking-algorithm

刷算法全靠套路，认准 labuladong 就够了！English version supported! Crack LeetCode, not only how, but also why.

Language:Markdown000

gr2

Appendix and Code for Modelling Bounded Rationality in Multi-Agent Interactions by Generalized Recursive Reasoning

Language:PythonMIT000

MARL-Doc

The Documentation for the MARL PLATFORM

Language:HTML000

ml-protobuf

Protocol Buffers for machine learning projects, supporting Numpy & Pytorch.

Language:PythonApache-2.0000

MusicBox

:blush: :musical_note: MusicPlayer 一站式收听多平台音乐(网易云, 虾米, QQ)的跨平台音乐播放器，尽情享受吧~:sparkles:

Language:PythonMIT000

mx-DeepIM

Deep Iterative Matching for 6D Pose Estimation

Language:PythonApache-2.0000

PPO-clip-and-PPO-penalty-on-Atari-Domain

Implement PPO-clip and PPO-penalty on Atari, which is the only open source of PPO-penalty

Language:Python000

PPO-Gluon

Implementation of PPO in Gluon / MXnet

Language:Python000

pymarl

Python Multi-Agent Reinforcement Learning framework

Apache-2.0000

pytorch-mobilenet-v2

A PyTorch implementation of MobileNet V2 architecture and pretrained model.

Language:PythonApache-2.0000

Reinforcement-learning-with-tensorflow

Simple Reinforcement learning tutorials

Language:PythonMIT000

scalable_agent

A TensorFlow implementation of Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures.

Apache-2.0000

UCRL_implementation

Various implementations and modification of algorithm around UCRL.

Language:Python000

url_benchmark_fork

Language:PythonMIT000