Johnny He (sweetice)

sweetice

Geek Repo

Location:Tuebingen, Germany

Home Page:sweetice.github.io

Github PK Tool:Github PK Tool

Johnny He's starred repositories

dm_control

Google DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.

Language:PythonLicense:Apache-2.0Stargazers:3781Issues:129Issues:413

LibMTL

A PyTorch Library for Multi-Task Learning

Language:PythonLicense:MITStargazers:1984Issues:18Issues:79

MineDojo

Building Open-Ended Embodied Agents with Internet-Scale Knowledge

Language:JavaLicense:MITStargazers:1776Issues:29Issues:121

bsuite

bsuite is a collection of carefully-designed experiments that investigate core capabilities of a reinforcement learning (RL) agent

Language:PythonLicense:Apache-2.0Stargazers:1505Issues:60Issues:31

awesome-offline-rl

An index of algorithms for offline reinforcement learning (offline-rl)

iris

Transformers are Sample-Efficient World Models. ICLR 2023, notable top 5%.

Language:PythonLicense:GPL-3.0Stargazers:800Issues:23Issues:23

curl

CURL: Contrastive Unsupervised Representation Learning for Sample-Efficient Reinforcement Learning

Language:PythonLicense:MITStargazers:571Issues:11Issues:26

drq

DrQ: Data regularized Q

Language:Jupyter NotebookLicense:MITStargazers:405Issues:13Issues:26

drqv2

DrQ-v2: Improved Data-Augmented Reinforcement Learning

Language:PythonLicense:MITStargazers:349Issues:9Issues:26

realworldrl_suite

Real-World RL Benchmark Suite

Language:PythonLicense:Apache-2.0Stargazers:346Issues:14Issues:4

TD3_BC

Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL

Language:PythonLicense:MITStargazers:318Issues:4Issues:4

phasic-policy-gradient

Code for the paper "Phasic Policy Gradient"

Language:PythonLicense:MITStargazers:249Issues:7Issues:7

wrench

[NeurIPS 2021] WRENCH: Weak supeRvision bENCHmark

Language:PythonLicense:Apache-2.0Stargazers:218Issues:6Issues:27

Evolutionary-Reinforcement-Learning

Codebase for Evolutionary Reinforcement Learning (ERL) from the paper "Evolution-Guided Policy Gradients in Reinforcement Learning" published at NeurIPS 2018

REDQ

Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.

Language:PythonLicense:MITStargazers:148Issues:5Issues:7

sunrise

SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning

Language:PythonLicense:Apache-2.0Stargazers:98Issues:6Issues:10

mpo

PyTorch Implementation of the Maximum a Posteriori Policy Optimisation

Language:PythonLicense:GPL-3.0Stargazers:70Issues:2Issues:10

DA-in-visualRL

Collection of papers and resources for data augmentation (DA) in visual reinforcement learning (RL).

generalized_dt

Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)

Language:PythonStargazers:65Issues:0Issues:4
Language:Jupyter NotebookLicense:MITStargazers:47Issues:4Issues:0

deep-successor-features-for-transfer

A reusable framework for successor features for transfer in deep reinforcement learning using keras.

Language:PythonLicense:NOASSERTIONStargazers:39Issues:3Issues:0

OffCon3

đź“´ OffCon^3: SOTA PyTorch SAC and TD3 Implementations (arxiv: 2101.11331)

Language:PythonLicense:MITStargazers:24Issues:1Issues:1

neural-approx-ss-lfi

Codes for ICLR 21 paper: Neural Approximate Sufficient Statistics for Implicit Models

Language:Jupyter NotebookStargazers:19Issues:2Issues:0

KWNG

A Pytorch implementation of the KWNG estimator

Language:PythonLicense:BSD-3-ClauseStargazers:14Issues:2Issues:0

LeagueSandbox-RL-Learning

Modified version of the LeagueSandbox project which relies on a Redis server to accept actions and send observations. Intended for reinforcement learning within v4.20 League of Legends.

Language:C#License:AGPL-3.0Stargazers:10Issues:2Issues:2

WNPG

implementation of Wasserstein Natural Policy Gradients and Wasserstein Natural Evolution Strategies

Language:PythonStargazers:10Issues:1Issues:0

ppg

Phasic Policy Gradient

adaptive_estimators

Code for ICLR 2019 paper "Adaptive Estimators Show Information Compression in Deep Neural Networks" (https://openreview.net/forum?id=SkeZisA5t7)

Language:PythonLicense:MITStargazers:5Issues:1Issues:0