JHB's starred repositories

WU-UCT

A novel parallel UCT algorithm with linear speedup and negligible performance loss.

Language:PythonLicense:MITStargazers:99Issues:0Issues:0

AC_CDQ

Action Candidate based Clipped Double Q-learning (Accepted by AAAI 2021)

Language:PythonLicense:MITStargazers:5Issues:0Issues:0

PPO-PyTorch

Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch

Language:PythonLicense:MITStargazers:1526Issues:0Issues:0

3D-PointCloud

Papers and Datasets about Point Cloud.

Language:PythonStargazers:2200Issues:0Issues:0

locating-objects-without-bboxes

PyTorch code for "Locating objects without bounding boxes" - Loss function and trained models

Language:PythonLicense:NOASSERTIONStargazers:249Issues:0Issues:0

deepgaze

Computer Vision library for human-computer interaction. It implements Head Pose and Gaze Direction Estimation Using Convolutional Neural Networks, Skin Detection through Backprojection, Motion Detection and Tracking, Saliency Map.

Language:PythonLicense:MITStargazers:1772Issues:0Issues:0

chainerrl

ChainerRL is a deep reinforcement learning library built on top of Chainer.

Language:PythonLicense:MITStargazers:1156Issues:0Issues:0

PathPlanning

Common used path planning algorithms with animations.

Language:PythonLicense:MITStargazers:7551Issues:0Issues:0

PointFlowRenderer

Code for rendering the point cloud figures in paper: "PointFlow : 3D Point Cloud Generation with Continuous Normalizing Flows"

Language:PythonStargazers:302Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:85Issues:0Issues:0

FoldingNet

Organized code for the paper "FoldingNet: Point Cloud Auto-encoder via Deep Grid Deformation" (CVPR 2018).

Language:PythonStargazers:10Issues:0Issues:0

FQF

FQF(Fully parameterized Quantile Function for distributional reinforcement learning) is a general reinforcement learning framework for Atari games, which can learn to play Atari games automatically by predicting return distribution in the form of a fully parameterized quantile function.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:40Issues:0Issues:0

DeepRL_PyTorch

Deep Reinforcement Learning codes for study. Currently, there are only codes for algorithms: DQN, C51, QR-DQN, IQN, QUOTA.

Language:PythonLicense:Apache-2.0Stargazers:193Issues:0Issues:0

njustPhDRoad

从开题到离校——南京理工大学博士毕业之路

Language:TeXStargazers:118Issues:0Issues:0

dcem

The Differentiable Cross-Entropy Method

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:124Issues:0Issues:0

atari-representation-learning

Code for "Unsupervised State Representation Learning in Atari"

Language:PythonLicense:MITStargazers:233Issues:0Issues:0

pytorch_sac

PyTorch implementation of Soft Actor-Critic (SAC)

Language:Jupyter NotebookLicense:MITStargazers:472Issues:0Issues:0

gradcem

Code for the paper Model-Predictive Control via Cross-Entropy and Gradient-Based Optimization

Language:PythonStargazers:67Issues:0Issues:0

dreamer-pytorch

Dream to Control: Learning Behaviors by Latent Imagination, implemented in PyTorch.

Language:PythonLicense:MITStargazers:265Issues:0Issues:0

world-models

Reimplementation of World-Models (Ha and Schmidhuber 2018) in pytorch

Language:PythonLicense:MITStargazers:549Issues:0Issues:0

self-imitation-learning

ICML 2018 Self-Imitation Learning

Language:PythonLicense:MITStargazers:273Issues:0Issues:0

pytorch-a2c-ppo-acktr-gail

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

Language:PythonLicense:MITStargazers:3500Issues:0Issues:0
Language:PythonStargazers:28Issues:0Issues:0

tianshou

An elegant PyTorch deep reinforcement learning library.

Language:PythonLicense:MITStargazers:7523Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:2Issues:0Issues:0

pysot

SenseTime Research platform for single object tracking, implementing algorithms like SiamRPN and SiamMask.

Language:PythonLicense:Apache-2.0Stargazers:4386Issues:0Issues:0
Language:PythonLicense:MITStargazers:87Issues:0Issues:0

Variational_Discriminator_Bottleneck

Implementation (with some experimentation) of the paper titled "VARIATIONAL DISCRIMINATOR BOTTLENECK: IMPROVING IMITATION LEARNING, INVERSE RL, AND GANS BY CONSTRAINING INFORMATION FLOW" (arxiv -> https://arxiv.org/pdf/1810.00821.pdf)

Language:PythonLicense:MITStargazers:153Issues:0Issues:0

VIB-pytorch

Pytorch implementation of Deep Variational Information Bottleneck

Language:PythonStargazers:169Issues:0Issues:0

Explorer

Explorer is a PyTorch reinforcement learning framework for exploring new ideas.

Language:PythonLicense:MITStargazers:86Issues:0Issues:0