Yingru Li's repositories
muzero-cpp
A C++ pytorch implementation of MuZero
Distributed-Multi-Label-Continual-Learning
This is a distributed training framework for continual and incremental learning for multi-label multi-class image tasks
graphbackup
Code release for Graph Backup: Data Efficient Backup Exploiting Markovian Transitions https://arxiv.org/abs/2205.15824
hustthesis
:notebook_with_decorative_cover: An Unofficial Thesis Template in LaTeX for Huazhong University of Science and Technology
HyperAgent
The official code repo for HyperAgent: A Simple, Scalable, Efficient and Provable Reinforcement Learning Framework for Complex Environments, ICML 2024.
Information_Directed_Sampling
Implementation of Russo and Van Roy work on Information Directed Sampling (2017)
LangevinDQN
Code for the Langevin DQN agent
logistic_bandit
Logistic Bandit experiments. Official code for the paper "Jointly Efficient and Optimal Algorithms for Logistic Bandits".
model-based-muesli
muesli implementation based on muzero implementation from JimOhman (https://github.com/JimOhman/model-based-rl)
MuZero-Tensor-Batch-MCTS
An idea to implement MCTS by tensors. This implementation is able to process a batch of observations on GPU.
optimistic-init
Accompanying code for "Optimistic Initialization for Exploration in Continuous Control"
sigmazero
Generalizing DeepMind's MuZero algorithm on stochastic environments
vae-anomaly-detector
Experiments on unsupervised anomaly detection using variational autoencoder. The variational autoencoder is implemented in Pytorch.