Beast code in Giters

Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)

Language:Python100900

PPO-PyTorch

Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch

Language:PythonMIT158600

off-policy

PyTorch implementations of popular off-policy multi-agent reinforcement learning algorithms, including QMix, VDN, MADDPG, and MATD3.

Language:PythonMIT37900

annotated_deep_learning_paper_implementations

🧑‍🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

Language:PythonMIT5237800

Multi-Agent-Constrained-Policy-Optimisation

Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).

Language:PythonNOASSERTION13400

TRPO-in-MARL

Language:PythonMIT17800

DRL-Networking

Research on incentive mechanism design in mobile crowdsensing and mobile edge computing by deep reinforcement learning approaches.

Language:Python11100

DC-DRL

Language:Python1500

v2ray

VPS搭建VPN教程2019-V2ray教程

4400

pymarl2

Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)

Language:PythonApache-2.058400

dfac

[ICML 2021] DFAC Framework: Factorizing the Value Function via Quantile Mixture for Multi-Agent Distributional Q-Learning

Language:PythonApache-2.02900

evolution-strategies-starter

Code for the paper "Evolution Strategies as a Scalable Alternative to Reinforcement Learning"

Language:PythonMIT154900

pymarl

Python Multi-Agent Reinforcement Learning framework

Language:PythonApache-2.0179100

sacred

Sacred is a tool to help you configure, organize, log and reproduce experiments developed at IDSIA.

Language:PythonMIT419500

deeprl_network

multi-agent deep reinforcement learning for networked system control.

Language:Python36800

Paper-with-Code-of-Wireless-communication-Based-on-DL

无线与深度学习结合的论文代码整理/Paper-with-Code-of-Wireless-communication-Based-on-DL

178700

MARL-Algorithms

Implementations of IQL, QMIX, VDN, COMA, QTRAN, MAVEN, CommNet, DyMA-CL, and G2ANet on SMAC, the decentralised micromanagement scenario of StarCraft II

Language:Python138200