AndyYue1893's repositories

COVID-19-SEIR-LSTM

本项目实现2019新型冠状病毒肺炎预测,分别采用经典传染病动力学模型SEIR和LSTM神经网络实现,通过控制模型参数来改变干预程度,体现防控的意义。

Deep-reinforcement-learning-with-pytorch

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

Language:PythonLicense:MITStargazers:48Issues:1Issues:0

pwc

Papers with code. Sorted by stars. Updated weekly.

spinningup

An educational resource to help anyone learn deep reinforcement learning.

Language:PythonLicense:MITStargazers:5Issues:1Issues:0

DeepRL-1

Deep Reinforcement Learning Lab, a platform designed to make DRL technology and fun for everyone

License:MITStargazers:3Issues:1Issues:0

Safe-Reinforcement-Learning-Baseline

The repository is for safe reinforcement learning baselines.

Language:Jupyter NotebookStargazers:2Issues:1Issues:0
Language:PythonLicense:Apache-2.0Stargazers:1Issues:1Issues:0

Deep-Reinforcement-Learning-Algorithms-with-PyTorch

PyTorch implementations of deep reinforcement learning algorithms and environments

Language:PythonStargazers:1Issues:1Issues:0

DeepRL

Modularized Implementation of Deep RL Algorithms in PyTorch

Language:PythonLicense:MITStargazers:1Issues:1Issues:0

ElegantRL

Lightweight, stable, efficient PyTorch implement of reinforcement learning. I want to call this PyTorch implement as "3-Python-file-RL".

Language:PythonLicense:NOASSERTIONStargazers:1Issues:1Issues:0

ml-agents

Unity Machine Learning Agents Toolkit

License:Apache-2.0Stargazers:1Issues:0Issues:0

SQDDPG

This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled by ''Shapley Q-value: A Local Reward Approach to Solve Global Reward Games''.

Language:PythonStargazers:1Issues:1Issues:0

wqmix

Code for Weighted QMIX

Stargazers:1Issues:0Issues:0

adeptRL

Reinforcement learning framework to accelerate research

Language:PythonLicense:GPL-3.0Stargazers:0Issues:1Issues:0

arxiv.py

Python wrapper for the arXiv API

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

CloseAirCombat

An environment based on JSBSIM aimed at one-to-one close air combat.

Stargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

Deep-Reinforcement-Learning-Hands-On

Hands-on Deep Reinforcement Learning, published by Packt

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

epymarl

An extension of the PyMARL codebase that includes additional algorithms and environment support

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

invalid-action-masking

Source Code for A Closer Look at Invalid Action Masking in Policy Gradient Algorithms

License:MITStargazers:0Issues:0Issues:0

maddpg

Code for the MADDPG algorithm from the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

pymarl

Python Multi-Agent Reinforcement Learning framework

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

random-network-distillation

Code for the paper "Exploration by Random Network Distillation"

Language:PythonStargazers:0Issues:1Issues:0

ray

A fast and simple framework for building and running distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

Reinforcement-Implementation

Implementation of benchmark RL algorithms

Language:PythonStargazers:0Issues:1Issues:0

rl-baselines-zoo

A collection of 100+ pre-trained RL agents using Stable Baselines, training and hyperparameter optimization included.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

RL-Implementation

simple code to reinforcement learning

Stargazers:0Issues:0Issues:0

smac

SMAC: The StarCraft Multi-Agent Challenge

License:MITStargazers:0Issues:0Issues:0

StarCraft

Implementations of IQL, QMIX, VDN, COMA, QTRAN, MAVEN, CommNet, DyMA-CL, and G2ANet on SMAC, the decentralised micromanagement scenario of StarCraft II

Stargazers:0Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0