xtma - Giters

Xiaoteng Ma's repositories

pytorch_car_caring

Reinforcement Learning for Gym CarRacing-v0 with PyTorch

Language:Python147 5 3

dsac

Distributional Soft Actor Critic

Language:PythonMIT47 1 5

simple-pytorch-rl

Reinforcement Learning Methods with PyTorch

Language:Python37 1 3

apo

Average-Reward Reinforcement Learning with Trust Region Methods

Language:PythonMIT4 10

msvpo

The official implementation of "Mean-Semivariance Policy Optimization via Risk-Averse Reinforcement Learning"

Language:Python1 10

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

Language:PythonMIT1 10

ray-maddpg

MADDPG implementation with Ray

Language:PythonMIT1 20

vimrc

The ultimate Vim configuration: vimrc

Language:Vim ScriptMIT100

xtma.github.io

Language:SCSSMIT1 10

PGPortfolio

PGPortfolio: Policy Gradient Portfolio, the source code of "A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem"(https://arxiv.org/pdf/1706.10059.pdf).

Language:PythonGPL-3.0010

rl-portfolio-management

Attempting to replicate "A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem" https://arxiv.org/abs/1706.10059 (and an openai gym environment)

Language:Jupyter Notebook010

rlpyt

Reinforcement Learning in PyTorch

Language:PythonMIT000

self-play-pong

RoboSchool Pony in Self-Play Mode

Language:Python010

VEM

Codes accompanying the paper "Offline Reinforcement Learning with Value-Based Episodic Memory" (ICLR 2022 https://arxiv.org/abs/2110.09796)

Language:Python000