lhc0512's starred repositories
ppo-implementation-details
The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization
CS-Xmind-Note
计算机专业课(408)思维导图和笔记:计算机组成原理(第五版 王爱英),数据结构(王道),计算机网络(第七版 谢希仁),操作系统(第四版 汤小丹)
DRL-Pytorch
Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)
PPO-PyTorch
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
off-policy
PyTorch implementations of popular off-policy multi-agent reinforcement learning algorithms, including QMix, VDN, MADDPG, and MATD3.
annotated_deep_learning_paper_implementations
🧑🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
Multi-Agent-Constrained-Policy-Optimisation
Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).
DRL-Networking
Research on incentive mechanism design in mobile crowdsensing and mobile edge computing by deep reinforcement learning approaches.
evolution-strategies-starter
Code for the paper "Evolution Strategies as a Scalable Alternative to Reinforcement Learning"
deeprl_network
multi-agent deep reinforcement learning for networked system control.
Paper-with-Code-of-Wireless-communication-Based-on-DL
无线与深度学习结合的论文代码整理/Paper-with-Code-of-Wireless-communication-Based-on-DL
MARL-Algorithms
Implementations of IQL, QMIX, VDN, COMA, QTRAN, MAVEN, CommNet, DyMA-CL, and G2ANet on SMAC, the decentralised micromanagement scenario of StarCraft II