SaneLYX

SaneLYX

Geek Repo

Location:China

Github PK Tool:Github PK Tool

SaneLYX's repositories

ray

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a toolkit of libraries (Ray AIR) for accelerating ML workloads.

License:Apache-2.0Stargazers:0Issues:0Issues:0

MASA-QMIX

Paper《Solving job scheduling problems in a resource preemption environment with multi-agent reinforcement learning》

Stargazers:0Issues:0Issues:0

on-policy

Paper《The Surprising Effectiveness of PPO in Cooperative Multi-Agent Games》

License:MITStargazers:0Issues:0Issues:0

pymarl

Python Multi-Agent Reinforcement Learning framework

License:Apache-2.0Stargazers:0Issues:0Issues:0

Multi-Agent-Transformer

Paper《Multi-Agent Reinforcement Learning is a Sequence Modeling Problem》

Stargazers:0Issues:0Issues:0

TRPO-in-MARL

Paper《Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning》

License:MITStargazers:0Issues:0Issues:0

TSP_Att-GCRN-MCTS

Paper《Generalize a Small Pre-trained Model to Arbitrarily Large TSP Instances》

License:MITStargazers:0Issues:0Issues:0

tensorflow

An Open Source Machine Learning Framework for Everyone

License:Apache-2.0Stargazers:0Issues:0Issues:0

or-tools

Google's Operations Research tools:

License:Apache-2.0Stargazers:0Issues:0Issues:0

keras

Deep Learning for humans

License:Apache-2.0Stargazers:0Issues:0Issues:0

easy-rl

强化学习中文教程(蘑菇书),在线阅读地址:https://datawhalechina.github.io/easy-rl/

License:NOASSERTIONStargazers:0Issues:0Issues:0

MARL-Papers

Paper list of multi-agent reinforcement learning (MARL)

Stargazers:0Issues:0Issues:0

End-to-end-DRL-for-FJSP

Paper《A Multi-action Deep Reinforcement Learning Framework for Flexible Job-shop Scheduling Problem》

Stargazers:0Issues:0Issues:0

pumpkin-book

《机器学习》(西瓜书)公式详解

License:NOASSERTIONStargazers:0Issues:0Issues:0

ElegantRL

Cloud-native Deep Reinforcement Learning. 🔥

License:NOASSERTIONStargazers:0Issues:0Issues:0

Deep-Reinforcement-Learning-Algorithms-with-PyTorch

PyTorch implementations of deep reinforcement learning algorithms and environments

License:MITStargazers:0Issues:0Issues:0

d2l-zh

《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被60个国家的400所大学用于教学。

License:Apache-2.0Stargazers:0Issues:0Issues:0

lihang-code

《统计学习方法》的代码实现

Stargazers:0Issues:0Issues:0

MARL-Algorithms

Implementations of IQL, QMIX, VDN, COMA, QTRAN, MAVEN, CommNet, DyMA-CL, and G2ANet on SMAC, the decentralised micromanagement scenario of StarCraft II

Stargazers:0Issues:0Issues:0

L2D

Paper《Learning to Dispatch for Job Shop Scheduling via Deep Reinforcement Learning》

Stargazers:0Issues:0Issues:0

tianshou

An elegant PyTorch deep reinforcement learning library.

License:MITStargazers:0Issues:0Issues:0

nndl.github.io

《神经网络与深度学习》 邱锡鹏著 Neural Network and Deep Learning

Stargazers:0Issues:0Issues:0

RoutingProblemGANN

Paper《Solve routing problems with a residual edge-graph attention neural network》

Stargazers:0Issues:0Issues:0

DRL-code-pytorch

Concise pytorch implements of DRL algorithms, including REINFORCE, A2C, DQN, PPO(discrete and continuous), DDPG, TD3, SAC.

License:MITStargazers:0Issues:0Issues:0

Deep-reinforcement-learning-with-pytorch

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

License:MITStargazers:0Issues:0Issues:0

Reinforcement-Implementation

Implementation of benchmark RL algorithms

Stargazers:0Issues:0Issues:0

fjsp-drl

Paper《Flexible Job Shop Scheduling via Graph Neural Network and Deep Reinforcement Learning》

License:Apache-2.0Stargazers:0Issues:0Issues:0

scikit-opt

Genetic Algorithm, Particle Swarm Optimization, Simulated Annealing, Ant Colony Optimization Algorithm,Immune Algorithm, Artificial Fish Swarm Algorithm, Differential Evolution and TSP(Traveling salesman)

License:MITStargazers:0Issues:0Issues:0

noisy-mappo

Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)

License:MITStargazers:0Issues:0Issues:0

baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

License:MITStargazers:0Issues:0Issues:0