Yingru Li's repositories

muzero-cpp

A C++ pytorch implementation of MuZero

Language:C++License:Apache-2.0Stargazers:2Issues:2Issues:0

Distributed-Multi-Label-Continual-Learning

This is a distributed training framework for continual and incremental learning for multi-label multi-class image tasks

Language:PythonStargazers:0Issues:6Issues:0
Language:ShellLicense:MITStargazers:0Issues:0Issues:0
Language:ShellLicense:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:3Issues:0

bsuite

bsuite is a collection of carefully-designed experiments that investigate core capabilities of a reinforcement learning (RL) agent

Language:PythonLicense:Apache-2.0Stargazers:0Issues:3Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0
Language:Jupyter NotebookStargazers:0Issues:2Issues:0

graphbackup

Code release for Graph Backup: Data Efficient Backup Exploiting Markovian Transitions https://arxiv.org/abs/2205.15824

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

hustthesis

:notebook_with_decorative_cover: An Unofficial Thesis Template in LaTeX for Huazhong University of Science and Technology

Language:TeXLicense:LPPL-1.3cStargazers:0Issues:2Issues:0

HyperAgent

The official code repo for HyperAgent: A Simple, Scalable, Efficient and Provable Reinforcement Learning Framework for Complex Environments, ICML 2024.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:2Issues:0

Information_Directed_Sampling

Implementation of Russo and Van Roy work on Information Directed Sampling (2017)

Language:PythonStargazers:0Issues:2Issues:0

LangevinDQN

Code for the Langevin DQN agent

Language:Jupyter NotebookLicense:MITStargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:1Issues:0

logistic_bandit

Logistic Bandit experiments. Official code for the paper "Jointly Efficient and Optimal Algorithms for Logistic Bandits".

Language:PythonStargazers:0Issues:1Issues:0

model-based-muesli

muesli implementation based on muzero implementation from JimOhman (https://github.com/JimOhman/model-based-rl)

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

MuZero-Tensor-Batch-MCTS

An idea to implement MCTS by tensors. This implementation is able to process a batch of observations on GPU.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

OB2I

Code for "Principled Exploration via Optimistic Bootstrapping and Backward Induction"

Language:PythonStargazers:0Issues:2Issues:0
Stargazers:0Issues:0Issues:0

omega

A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environments.

Language:PythonLicense:GPL-3.0Stargazers:0Issues:2Issues:0

optimistic-init

Accompanying code for "Optimistic Initialization for Exploration in Continuous Control"

Language:PythonStargazers:0Issues:1Issues:0

rlberry

An easy-to-use reinforcement learning library for research and education.

Language:PythonLicense:MITStargazers:0Issues:2Issues:0

rltf

Reinforcement Learning implementations and research prototyping in TensorFlow

Language:PythonLicense:MITStargazers:0Issues:2Issues:0

sigmazero

Generalizing DeepMind's MuZero algorithm on stochastic environments

Stargazers:0Issues:0Issues:0
Language:TeXLicense:MITStargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:2Issues:0
Language:Jupyter NotebookLicense:MITStargazers:0Issues:2Issues:0
Language:PythonStargazers:0Issues:2Issues:0

vae-anomaly-detector

Experiments on unsupervised anomaly detection using variational autoencoder. The variational autoencoder is implemented in Pytorch.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0