JHB's starred repositories
PPO-PyTorch
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
3D-PointCloud
Papers and Datasets about Point Cloud.
locating-objects-without-bboxes
PyTorch code for "Locating objects without bounding boxes" - Loss function and trained models
PathPlanning
Common used path planning algorithms with animations.
PointFlowRenderer
Code for rendering the point cloud figures in paper: "PointFlow : 3D Point Cloud Generation with Continuous Normalizing Flows"
FoldingNet
Organized code for the paper "FoldingNet: Point Cloud Auto-encoder via Deep Grid Deformation" (CVPR 2018).
FQF
FQF(Fully parameterized Quantile Function for distributional reinforcement learning) is a general reinforcement learning framework for Atari games, which can learn to play Atari games automatically by predicting return distribution in the form of a fully parameterized quantile function.
DeepRL_PyTorch
Deep Reinforcement Learning codes for study. Currently, there are only codes for algorithms: DQN, C51, QR-DQN, IQN, QUOTA.
njustPhDRoad
从开题到离校——南京理工大学博士毕业之路
atari-representation-learning
Code for "Unsupervised State Representation Learning in Atari"
pytorch_sac
PyTorch implementation of Soft Actor-Critic (SAC)
dreamer-pytorch
Dream to Control: Learning Behaviors by Latent Imagination, implemented in PyTorch.
world-models
Reimplementation of World-Models (Ha and Schmidhuber 2018) in pytorch
self-imitation-learning
ICML 2018 Self-Imitation Learning
pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
Variational_Discriminator_Bottleneck
Implementation (with some experimentation) of the paper titled "VARIATIONAL DISCRIMINATOR BOTTLENECK: IMPROVING IMITATION LEARNING, INVERSE RL, AND GANS BY CONSTRAINING INFORMATION FLOW" (arxiv -> https://arxiv.org/pdf/1810.00821.pdf)
VIB-pytorch
Pytorch implementation of Deep Variational Information Bottleneck