Takuya Hiraoka's repositories
Dropout-Q-Functions-for-Doubly-Efficient-Reinforcement-Learning
Source files to replicate experiments in my ICLR 2022 paper.
Efficient-SRGC-RL-with-a-High-RR-and-Regularization
Source files to replicate experiments in my Arxiv 2023 paper.
Soft-Actor-Critic-and-Extensions
PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL + D2RL and parallel Environments.
Which-Experiences-Are-Influential-for-RL-Agents
Source files to replicate experiments in my ArXiv 2024 paper.
d3rlpy
An offline deep reinforcement learning library
d4rl
A benchmark for offline reinforcement learning.
deep_bisim4control
Learning Invariant Representations for Reinforcement Learning without Reconstruction
dm_control
DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.
ElegantRL
Scalable and Elastic Deep Reinforcement Learning Using PyTorch. Please star. 🔥
JARVIS
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
mbrl-lib
Library for Model Based RL
Meta-Model-Based-Meta-Policy-Optimization
Source files to replicate experiments in my ACML 2021 paper.
metaworld
An open source robotics benchmark for meta- and multi-task reinforcement learning
mujoco
Multi-Joint dynamics with Contact. A general purpose physics simulator.
mujoco-maze
Simple maze environments using mujoco-py
oyster
Implementation of Efficient Off-policy Meta-learning via Probabilistic Context Variables (PEARL)
pianoplayer
Automatic fingering generator for piano scores
pomdp-baselines
Simple (but often Strong) Baselines for POMDPs in PyTorch - ICML 2022
ray
An open source framework that provides a simple, universal API for building distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.
REDQ
Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.
rltorch
A simple framework for distributed reinforcement learning in PyTorch.
robopianist
🎹 🤖 A benchmark for high-dimensional robot control.
soft-actor-critic.pytorch
A PyTorch implementation of Soft Actor-Critic(SAC).
ToolBench
An open platform for training, serving, and evaluating large language model for tool learning.