Toru Hishinuma's repositories
Language:Jupyter Notebook000
Language:Python000
BOReL
Official implementation for the paper "Offline Meta RL - Identifiability Challenges and Effective Data Collection Strategies", NeurIPS 2021.
Language:Python000
mbpo
Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"
Language:PythonMIT000
Language:TeX000
oyster
Implementation of Efficient Off-policy Meta-learning via Probabilistic Context Variables (PEARL)
MIT000
pomdp-baselines
Simple (but often Strong) Baselines for POMDPs in PyTorch - ICML 2022
MIT000
pytorch-soft-actor-critic
PyTorch implementation of soft actor critic
Language:PythonMIT000
RepBM
Representation Balancing MDPs for Off-Policy Policy Evaluation
MIT000
soft-actor-critic.pytorch
PyTorch implementation of Soft Actor-Critic(SAC).
Language:PythonMIT000
Language:Python000
Language:Jupyter Notebook000
varibad
Implementation of VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning - Zintgraf et al. (ICLR 2020)
NOASSERTION000