never's repositories
Hierarchical-Actor-Critic-HAC-PyTorch
PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments
baselines
Baselines for Neural MMO -- new users should treat this repo as a starter project
bd_rd_psro
Code for Towards Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games
BOReL
Official implementation for the paper "Offline Meta RL - Identifiability Challenges and Effective Data Collection Strategies", NeurIPS 2021.
CARL
https://carl.readthedocs.io/en/latest/
Griddly
A grid-world game engine for game AI research
InsertionAI
Residual Reinforcement Learning used for insertion
interviews.ai
It is my belief that you the postgraduate students and job-seekers for whom the book is primarily meant will benefit from reading it; however, it is my hope that even the most experienced researchers will find it fascinating as well.
invalid-action-masking
Source Code for A Closer Look at Invalid Action Masking in Policy Gradient Algorithms
language-planner
Official Code for "Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents"
LSD
Lipschitz-constrained Unsupervised Skill Discovery
ML-For-Beginners
12 weeks, 24 lessons, classic Machine Learning for all
pearl_reproduce
Meta RL codebase for Unstable Baselines
PLAS
Code for Latent Action Space for Offline Reinforcement Learning [CoRL 2020]
QRec
QRec: A Python Framework for quick implementation of recommender systems (TensorFlow Based)
raps
[NeurIPS 2021] PyTorch Code for Accelerating Robotic Reinforcement Learning with Parameterized Action Primitives
ray-neural-mmo
Ray framework with neural-mmo compatibility hacks
REDQ
Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.
Reinforcement-Learning-2nd-Edition-by-Sutton-Exercise-Solutions
Solutions of Reinforcement Learning, An Introduction
RL-Process-Design
Deep reinforcement learning for design of chemical engineering processes
tdmpc
Code for "Temporal Difference Learning for Model Predictive Control"
TiKick
Learning-based agent for Google Research Football (足球游戏智能体)