Blank Shuo's repositories

awesome-offline-rl

An index of algorithms for offline reinforcement learning (offline-rl)

Stargazers:0Issues:0Issues:0

BCQ

Author's PyTorch implementation of BCQ for continuous and discrete actions

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

CORL

High-quality single-file implementations of SOTA Offline RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

CQL

Code for conservative Q-learning

Language:PythonStargazers:0Issues:0Issues:0

CQL-1

PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and continuous action spaces.

Language:PythonStargazers:0Issues:0Issues:0

CQL-2

Conservative Q Learning on top of SAC

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

d3rlpy

An offline deep reinforcement learning library

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

D4RL

A benchmark for offline reinforcement learning.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Deep-reinforcement-learning-with-pytorch

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

DI-engine

OpenDILab Decision AI Engine

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

mjrl

Reinforcement learning algorithms for MuJoCo tasks

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Offline-Online-RL

Code for OFFLINE-ONLINE REINFORCEMENT LEARNING: EXTENDING BATCH AND ONLINE RL

Language:PythonStargazers:0Issues:0Issues:0

offline_rl

Pytorch implementation of state-of-the-art offline reinforcement learning algorithms.

Language:PythonStargazers:0Issues:0Issues:0

Papers-of-Offline-RL

Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)

Stargazers:0Issues:0Issues:0

pytorch-soft-actor-critic

PyTorch implementation of soft actor critic

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

pytorch_sac

PyTorch implementation of Soft Actor-Critic (SAC)

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

rl-plotter

:sparkles: A plotter for reinforcement learning (RL)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

RL-Unplugged-tfds-

This repository contains implementations and illustrative code to accompany DeepMind publications

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

rlkit

Collection of reinforcement learning algorithms

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

sac

Soft Actor-Critic

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

TD3

Author's PyTorch implementation of TD3 for OpenAI gym tasks

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

TD3_BC

Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

td3_bc_jax

Direct port of TD3_BC to JAX using Haiku and optax.

Language:PythonStargazers:0Issues:0Issues:0

v-d4rl

Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

YOLOAir

🔥🔥🔥YOLOv7, YOLOv5, YOLOv4, Transformer, YOLOX, YOLOR, YOLOv3 and Improved-YOLOv5... Support to improve backbone, head, loss, IoU, NMS and other modules

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0