liuqi8827

This project visualizes the knowledge of an agent trained by Deep Reinforcement Learning (paper will be published) using Backpropagation, Guided Backpropagation, GradCam and Guided gradCam. It shows why the agent is performing the action. Which pixels had the biggest influence on the decision of the agent.

000

Deep-CFR

Scalable Implementation of Deep CFR and Single Deep CFR

MIT000

RL-Double-Q-learning

A project comparing regular and double Q-learning reinforcement learning algorithms on different grid-world environments

000

why-clipping-accelerates

A pytorch implementation for the LSTM experiments in the paper: Why Gradient Clipping Accelerates Training: A Theoretical Justification for Adaptivity

BSD-3-Clause000

SV-RL

[ICLR 2020, Oral] Harnessing Structures for Value-Based Planning and Reinforcement Learning

MIT000

Meta-MDP-Reproduction

Code for reproduction of "A Meta-MDP Approach to Exploration for Lifelong Reinforcement Learning", submitted for the replication track of the NeurIPS 2019 Reproducibility Challenge.

000

DR-PG

Code for the paper "From Importance Sampling to Doubly Robust Policy Gradient"

Apache-2.0000

rlpy

A pytorch-version implementation of RL algorithms. Now it collects TRPO, ClipPPO, A2C, GAIL and ADCV.

000

multiagent-competition

Code for the paper "Emergent Complexity via Multi-agent Competition"

000

darknet_ros

YOLO ROS: Real-Time Object Detection for ROS

BSD-3-Clause000

optimaltransport.github.io

Web site of the Computational Optimal Transport book

000

hand_eye_calibration

Python tools to perform time-synchronization and hand-eye calibration.

BSD-3-Clause000

DoubleReinforcementLearningMDP

000

pytorch-a3c

PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".

MIT000

liuqi8827

seven8827's repositories

world_models

tleague_projpage

TLeague

understanding-rl-vision

The-Mean-Squared-Error-of-Double-Q-Learning

AEGD

gym-recording

episodic-curiosity

distribution-is-all-you-need

safety-starter-agents

reinforcement-learning

copg

House3D

tabular-methods

reinforcement-learning-an-introduction-2

gradient_descent_viz

Visual-Explanation-in-Deep-Reinforcement-Learning

Deep-CFR

RL-Double-Q-learning

why-clipping-accelerates

SV-RL

Meta-MDP-Reproduction

DR-PG

rlpy

multiagent-competition

darknet_ros

optimaltransport.github.io

hand_eye_calibration

DoubleReinforcementLearningMDP

pytorch-a3c