Xiaoteng Ma's repositories
pytorch_car_caring
Reinforcement Learning for Gym CarRacing-v0 with PyTorch
simple-pytorch-rl
Reinforcement Learning Methods with PyTorch
pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
ray-maddpg
MADDPG implementation with Ray
PGPortfolio
PGPortfolio: Policy Gradient Portfolio, the source code of "A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem"(https://arxiv.org/pdf/1706.10059.pdf).
rl-portfolio-management
Attempting to replicate "A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem" https://arxiv.org/abs/1706.10059 (and an openai gym environment)
rlpyt
Reinforcement Learning in PyTorch
self-play-pong
RoboSchool Pony in Self-Play Mode
VEM
Codes accompanying the paper "Offline Reinforcement Learning with Value-Based Episodic Memory" (ICLR 2022 https://arxiv.org/abs/2110.09796)