RuohLiuq's repositories
morl-baselines
Multi-Objective Reinforcement Learning algorithms implementations.
PDMORL-Preference-Driven-Multi-Objective-Reinforcement-Learning-Algorithm
A novel preference-driven multi-objective reinforcement learning algorithm using a single policy network that covers the entire preference space in a given domain.
PGMORL
[ICML 2020] Prediction-Guided Multi-Objective Reinforcement Learning for Continuous Robot Control
pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
PyTorch-RL
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
rl4uc
Reinforcement learning for unit commitment
sustaingym
Reinforcement Learning Environments for Sustainable Energy Systems