R-Ceph

never's repositories

Hierarchical-Actor-Critic-HAC-PyTorch

PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments

Language:PythonMIT100

baselines

Baselines for Neural MMO -- new users should treat this repo as a starter project

Language:PythonMIT000

bd_rd_psro

Code for Towards Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games

Language:Python000

BOReL

Official implementation for the paper "Offline Meta RL - Identifiability Challenges and Effective Data Collection Strategies", NeurIPS 2021.

000

CARL

https://carl.readthedocs.io/en/latest/

Language:PythonApache-2.0000

Griddly

A grid-world game engine for game AI research

Language:C++MIT000

InsertionAI

Residual Reinforcement Learning used for insertion

000

It is my belief that you the postgraduate students and job-seekers for whom the book is primarily meant will benefit from reading it; however, it is my hope that even the most experienced researchers will find it fascinating as well.

000

invalid-action-masking

Source Code for A Closer Look at Invalid Action Masking in Policy Gradient Algorithms

Language:PythonMIT000

Kaggle_Lux_AI_2021

Language:Jupyter NotebookMIT000

language-planner

Official Code for "Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents"

Language:Jupyter NotebookMIT000

LSD

Lipschitz-constrained Unsupervised Skill Discovery

Language:PythonMIT000

ML-For-Beginners

12 weeks, 24 lessons, classic Machine Learning for all

MIT000

Off2OnRL

000

OpenAI-Reinforcement-Learning

000

pearl_reproduce

Meta RL codebase for Unstable Baselines

Language:Python000

PLAS

Code for Latent Action Space for Offline Reinforcement Learning [CoRL 2020]

Language:PythonMIT000

Pytorch-CoDy

Language:Python000

QRec

QRec: A Python Framework for quick implementation of recommender systems (TensorFlow Based)

GPL-3.0000

raps

[NeurIPS 2021] PyTorch Code for Accelerating Robotic Reinforcement Learning with Parameterized Action Primitives

MIT000

ray-neural-mmo

Ray framework with neural-mmo compatibility hacks

Language:PythonApache-2.0000

REDQ

Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.

Language:PythonMIT000

Reinforcement-Learning-2nd-Edition-by-Sutton-Exercise-Solutions

Solutions of Reinforcement Learning, An Introduction

Language:Jupyter NotebookMIT000

RL-Process-Design

Deep reinforcement learning for design of chemical engineering processes

000

tdmpc

Code for "Temporal Difference Learning for Model Predictive Control"

MIT000

TiKick

Learning-based agent for Google Research Football (足球游戏智能体)

Language:PythonApache-2.0000

R-Ceph

never's repositories

CAGrad

Hierarchical-Actor-Critic-HAC-PyTorch

baselines

bd_rd_psro

BOReL

CARL

Competition_3v3snakes

Griddly

hfr

implicit_q_learning

InsertionAI

interviews.ai

invalid-action-masking

Kaggle_Lux_AI_2021

language-planner

LSD

ML-For-Beginners

Off2OnRL

OpenAI-Reinforcement-Learning

pearl_reproduce

PLAS

Pytorch-CoDy

QRec

raps

ray-neural-mmo

REDQ

Reinforcement-Learning-2nd-Edition-by-Sutton-Exercise-Solutions

RL-Process-Design

tdmpc

TiKick