Beast code in Giters

Blank Shuo's repositories

awesome-offline-rl

An index of algorithms for offline reinforcement learning (offline-rl)

000

BCQ

Author's PyTorch implementation of BCQ for continuous and discrete actions

Language:PythonMIT000

CORL

High-quality single-file implementations of SOTA Offline RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC

Language:PythonApache-2.0000

CQL

Code for conservative Q-learning

Language:Python000

CQL-1

PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and continuous action spaces.

Language:Python000

CQL-2

Conservative Q Learning on top of SAC

Language:PythonMIT000

d3rlpy

An offline deep reinforcement learning library

Language:PythonMIT000

D4RL

A benchmark for offline reinforcement learning.

Language:PythonApache-2.0000

Deep-reinforcement-learning-with-pytorch

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

Language:PythonMIT000

DI-engine

OpenDILab Decision AI Engine

Language:PythonApache-2.0000

mjrl

Reinforcement learning algorithms for MuJoCo tasks

Language:PythonApache-2.0000

Offline-Online-RL

Code for OFFLINE-ONLINE REINFORCEMENT LEARNING: EXTENDING BATCH AND ONLINE RL

Language:Python000

offline_rl

Pytorch implementation of state-of-the-art offline reinforcement learning algorithms.

Language:Python000

Papers-of-Offline-RL

Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)

000

pytorch-soft-actor-critic

PyTorch implementation of soft actor critic

Language:PythonMIT000

pytorch_sac

PyTorch implementation of Soft Actor-Critic (SAC)

Language:Jupyter NotebookMIT000

rl-plotter

:sparkles: A plotter for reinforcement learning (RL)

Language:PythonMIT000

RL-Unplugged-tfds-

This repository contains implementations and illustrative code to accompany DeepMind publications

Language:Jupyter NotebookApache-2.0000

rlkit

Collection of reinforcement learning algorithms

Language:PythonMIT000

sac

Soft Actor-Critic

Language:PythonNOASSERTION000

TD3

Author's PyTorch implementation of TD3 for OpenAI gym tasks

Language:PythonMIT000

TD3_BC

Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL

Language:PythonMIT000

td3_bc_jax

Direct port of TD3_BC to JAX using Haiku and optax.

Language:Python000

v-d4rl

Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations

Language:PythonMIT000

YOLOAir

🔥🔥🔥YOLOv7, YOLOv5, YOLOv4, Transformer, YOLOX, YOLOR, YOLOv3 and Improved-YOLOv5... Support to improve backbone, head, loss, IoU, NMS and other modules

Language:PythonGPL-3.0000