seven8827 (liuqi8827)

liuqi8827

Geek Repo

Company:Harbin Institute of Technology

Location:Shenzhen, China

Github PK Tool:Github PK Tool

seven8827's repositories

License:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

understanding-rl-vision

Code for the paper "Understanding RL Vision"

License:MITStargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

AEGD

Adaptive gradient descent with energy

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

gym-recording

Add-on package to gym, to record sequences of actions, observations, and rewards

Stargazers:0Issues:0Issues:0

episodic-curiosity

Tensorflow/Keras code and trained models for Episodic Curiosity Through Reachability

License:Apache-2.0Stargazers:0Issues:0Issues:0

distribution-is-all-you-need

The basic distribution probability Tutorial for Deep Learning Researchers

License:MITStargazers:0Issues:0Issues:0

safety-starter-agents

Basic constrained RL agents used in experiments for the "Benchmarking Safe Exploration in Deep Reinforcement Learning" paper.

License:MITStargazers:0Issues:0Issues:0

reinforcement-learning

GridWorld solved with VI, PI, SARSA, Expected SARSA, SARSA Lambda, Q learning, Double Q learning.

Stargazers:0Issues:0Issues:0

copg

This repository contains all code and experiments for competitive policy gradient (CoPG) algorithm.

License:MITStargazers:0Issues:0Issues:0

House3D

a Realistic and Rich 3D Environment

License:Apache-2.0Stargazers:0Issues:0Issues:0

tabular-methods

Tabular methods for reinforcement learning

License:MITStargazers:0Issues:0Issues:0

reinforcement-learning-an-introduction-2

My solutions to Sutton & Barto - Reinforcement Learning

Stargazers:0Issues:0Issues:0

gradient_descent_viz

interactive visualization of 5 popular gradient descent methods with step-by-step illustration and hyperparameter tuning UI

Stargazers:0Issues:0Issues:0

Visual-Explanation-in-Deep-Reinforcement-Learning

This project visualizes the knowledge of an agent trained by Deep Reinforcement Learning (paper will be published) using Backpropagation, Guided Backpropagation, GradCam and Guided gradCam. It shows why the agent is performing the action. Which pixels had the biggest influence on the decision of the agent.

Stargazers:0Issues:0Issues:0

Deep-CFR

Scalable Implementation of Deep CFR and Single Deep CFR

License:MITStargazers:0Issues:0Issues:0

RL-Double-Q-learning

A project comparing regular and double Q-learning reinforcement learning algorithms on different grid-world environments

Stargazers:0Issues:0Issues:0

why-clipping-accelerates

A pytorch implementation for the LSTM experiments in the paper: Why Gradient Clipping Accelerates Training: A Theoretical Justification for Adaptivity

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

SV-RL

[ICLR 2020, Oral] Harnessing Structures for Value-Based Planning and Reinforcement Learning

License:MITStargazers:0Issues:0Issues:0

Meta-MDP-Reproduction

Code for reproduction of "A Meta-MDP Approach to Exploration for Lifelong Reinforcement Learning", submitted for the replication track of the NeurIPS 2019 Reproducibility Challenge.

Stargazers:0Issues:0Issues:0

DR-PG

Code for the paper "From Importance Sampling to Doubly Robust Policy Gradient"

License:Apache-2.0Stargazers:0Issues:0Issues:0

rlpy

A pytorch-version implementation of RL algorithms. Now it collects TRPO, ClipPPO, A2C, GAIL and ADCV.

Stargazers:0Issues:0Issues:0

multiagent-competition

Code for the paper "Emergent Complexity via Multi-agent Competition"

Stargazers:0Issues:0Issues:0

darknet_ros

YOLO ROS: Real-Time Object Detection for ROS

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

optimaltransport.github.io

Web site of the Computational Optimal Transport book

Stargazers:0Issues:0Issues:0

hand_eye_calibration

Python tools to perform time-synchronization and hand-eye calibration.

License:BSD-3-ClauseStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

pytorch-a3c

PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".

License:MITStargazers:0Issues:0Issues:0