lhc0512

lhc0512

Geek Repo

Company:China

Location:Guangzhou

Github PK Tool:Github PK Tool

lhc0512's starred repositories

d2l-java

The Java implementation of Dive into Deep Learning (D2L.ai)

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:168Issues:0Issues:0

tablesaw

Java dataframe and visualization library

Language:JavaLicense:Apache-2.0Stargazers:3497Issues:0Issues:0

DRL

Deep Reinforcement Learning

License:NOASSERTIONStargazers:3103Issues:0Issues:0

ppo-implementation-details

The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization

Language:PythonLicense:NOASSERTIONStargazers:593Issues:0Issues:0

Gymnasium

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

Language:PythonLicense:MITStargazers:6407Issues:0Issues:0

djl

An Engine-Agnostic Deep Learning Framework in Java

Language:JavaLicense:Apache-2.0Stargazers:4023Issues:0Issues:0

MSRGCN

Official implementation of MSR-GCN (ICCV2021 paper)

Language:PythonStargazers:61Issues:0Issues:0

minimalRL

Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)

Language:PythonLicense:MITStargazers:2813Issues:0Issues:0

EDAC

Official PyTorch implementation of "Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble" (NeurIPS'21)

Language:PythonLicense:MITStargazers:67Issues:0Issues:0

LeetCode

My C++ Code for LeetCode OJ

Language:C++Stargazers:1313Issues:0Issues:0

epymarl

An extension of the PyMARL codebase that includes additional algorithms and environment support

Language:PythonLicense:Apache-2.0Stargazers:439Issues:0Issues:0

CS-Xmind-Note

计算机专业课(408)思维导图和笔记:计算机组成原理(第五版 王爱英),数据结构(王道),计算机网络(第七版 谢希仁),操作系统(第四版 汤小丹)

License:Apache-2.0Stargazers:8876Issues:0Issues:0

CS-Notes

:books: 技术面试必备基础知识、Leetcode、计算机操作系统、计算机网络、系统设计

Stargazers:174011Issues:0Issues:0

DRL-Pytorch

Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)

Language:PythonStargazers:1009Issues:0Issues:0

PPO-PyTorch

Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch

Language:PythonLicense:MITStargazers:1586Issues:0Issues:0

off-policy

PyTorch implementations of popular off-policy multi-agent reinforcement learning algorithms, including QMix, VDN, MADDPG, and MATD3.

Language:PythonLicense:MITStargazers:379Issues:0Issues:0

annotated_deep_learning_paper_implementations

🧑‍🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

Language:PythonLicense:MITStargazers:52378Issues:0Issues:0

Multi-Agent-Constrained-Policy-Optimisation

Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).

Language:PythonLicense:NOASSERTIONStargazers:134Issues:0Issues:0
Language:PythonLicense:MITStargazers:178Issues:0Issues:0

DRL-Networking

Research on incentive mechanism design in mobile crowdsensing and mobile edge computing by deep reinforcement learning approaches.

Language:PythonStargazers:111Issues:0Issues:0
Language:PythonStargazers:15Issues:0Issues:0

v2ray

VPS搭建VPN教程2019-V2ray教程

Stargazers:44Issues:0Issues:0

pymarl2

Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)

Language:PythonLicense:Apache-2.0Stargazers:584Issues:0Issues:0

dfac

[ICML 2021] DFAC Framework: Factorizing the Value Function via Quantile Mixture for Multi-Agent Distributional Q-Learning

Language:PythonLicense:Apache-2.0Stargazers:29Issues:0Issues:0

evolution-strategies-starter

Code for the paper "Evolution Strategies as a Scalable Alternative to Reinforcement Learning"

Language:PythonLicense:MITStargazers:1549Issues:0Issues:0

pymarl

Python Multi-Agent Reinforcement Learning framework

Language:PythonLicense:Apache-2.0Stargazers:1791Issues:0Issues:0

sacred

Sacred is a tool to help you configure, organize, log and reproduce experiments developed at IDSIA.

Language:PythonLicense:MITStargazers:4195Issues:0Issues:0

deeprl_network

multi-agent deep reinforcement learning for networked system control.

Language:PythonStargazers:368Issues:0Issues:0

Paper-with-Code-of-Wireless-communication-Based-on-DL

无线与深度学习结合的论文代码整理/Paper-with-Code-of-Wireless-communication-Based-on-DL

Stargazers:1787Issues:0Issues:0

MARL-Algorithms

Implementations of IQL, QMIX, VDN, COMA, QTRAN, MAVEN, CommNet, DyMA-CL, and G2ANet on SMAC, the decentralised micromanagement scenario of StarCraft II

Language:PythonStargazers:1382Issues:0Issues:0