Xiaoteng Ma (xtma)

xtma

Geek Repo

Company:Tsinghua University

Home Page:https://xtma.github.io/

Github PK Tool:Github PK Tool

Xiaoteng Ma's repositories

pytorch_car_caring

Reinforcement Learning for Gym CarRacing-v0 with PyTorch

dsac

Distributional Soft Actor Critic

Language:PythonLicense:MITStargazers:46Issues:1Issues:5

simple-pytorch-rl

Reinforcement Learning Methods with PyTorch

apo

Average-Reward Reinforcement Learning with Trust Region Methods

Language:PythonLicense:MITStargazers:4Issues:1Issues:0

msvpo

The official implementation of "Mean-Semivariance Policy Optimization via Risk-Averse Reinforcement Learning"

Language:PythonStargazers:1Issues:1Issues:0

pytorch-a2c-ppo-acktr-gail

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

Language:PythonLicense:MITStargazers:1Issues:1Issues:0

ray-maddpg

MADDPG implementation with Ray

Language:PythonLicense:MITStargazers:1Issues:2Issues:0

vimrc

The ultimate Vim configuration: vimrc

Language:Vim scriptLicense:MITStargazers:1Issues:0Issues:0
Language:SCSSLicense:MITStargazers:1Issues:1Issues:0

PGPortfolio

PGPortfolio: Policy Gradient Portfolio, the source code of "A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem"(https://arxiv.org/pdf/1706.10059.pdf).

Language:PythonLicense:GPL-3.0Stargazers:0Issues:1Issues:0

rl-portfolio-management

Attempting to replicate "A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem" https://arxiv.org/abs/1706.10059 (and an openai gym environment)

Language:Jupyter NotebookStargazers:0Issues:1Issues:0

rlpyt

Reinforcement Learning in PyTorch

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

self-play-pong

RoboSchool Pony in Self-Play Mode

Language:PythonStargazers:0Issues:1Issues:0

VEM

Codes accompanying the paper "Offline Reinforcement Learning with Value-Based Episodic Memory" (ICLR 2022 https://arxiv.org/abs/2110.09796)

Language:PythonStargazers:0Issues:0Issues:0