never's repositories

Language:PythonStargazers:1Issues:0Issues:0

Hierarchical-Actor-Critic-HAC-PyTorch

PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

baselines

Baselines for Neural MMO -- new users should treat this repo as a starter project

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

bd_rd_psro

Code for Towards Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games

Language:PythonStargazers:0Issues:0Issues:0

BOReL

Official implementation for the paper "Offline Meta RL - Identifiability Challenges and Effective Data Collection Strategies", NeurIPS 2021.

Stargazers:0Issues:0Issues:0

CARL

https://carl.readthedocs.io/en/latest/

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

Griddly

A grid-world game engine for game AI research

Language:C++License:MITStargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

InsertionAI

Residual Reinforcement Learning used for insertion

Stargazers:0Issues:0Issues:0

interviews.ai

It is my belief that you the postgraduate students and job-seekers for whom the book is primarily meant will benefit from reading it; however, it is my hope that even the most experienced researchers will find it fascinating as well.

Stargazers:0Issues:0Issues:0

invalid-action-masking

Source Code for A Closer Look at Invalid Action Masking in Policy Gradient Algorithms

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

language-planner

Official Code for "Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents"

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

LSD

Lipschitz-constrained Unsupervised Skill Discovery

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

ML-For-Beginners

12 weeks, 24 lessons, classic Machine Learning for all

License:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

pearl_reproduce

Meta RL codebase for Unstable Baselines

Language:PythonStargazers:0Issues:0Issues:0

PLAS

Code for Latent Action Space for Offline Reinforcement Learning [CoRL 2020]

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

QRec

QRec: A Python Framework for quick implementation of recommender systems (TensorFlow Based)

License:GPL-3.0Stargazers:0Issues:0Issues:0

raps

[NeurIPS 2021] PyTorch Code for Accelerating Robotic Reinforcement Learning with Parameterized Action Primitives

License:MITStargazers:0Issues:0Issues:0

ray-neural-mmo

Ray framework with neural-mmo compatibility hacks

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

REDQ

Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Reinforcement-Learning-2nd-Edition-by-Sutton-Exercise-Solutions

Solutions of Reinforcement Learning, An Introduction

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

RL-Process-Design

Deep reinforcement learning for design of chemical engineering processes

Stargazers:0Issues:0Issues:0

tdmpc

Code for "Temporal Difference Learning for Model Predictive Control"

License:MITStargazers:0Issues:0Issues:0

TiKick

Learning-based agent for Google Research Football (足球游戏智能体)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0