deeplearnerJHB

JHB's starred repositories

WU-UCT

A novel parallel UCT algorithm with linear speedup and negligible performance loss.

Language:PythonMIT9900

AC_CDQ

Action Candidate based Clipped Double Q-learning (Accepted by AAAI 2021)

Language:PythonMIT500

PPO-PyTorch

Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch

Language:PythonMIT152600

3D-PointCloud

Papers and Datasets about Point Cloud.

Language:Python220000

locating-objects-without-bboxes

PyTorch code for "Locating objects without bounding boxes" - Loss function and trained models

Language:PythonNOASSERTION24900

Computer Vision library for human-computer interaction. It implements Head Pose and Gaze Direction Estimation Using Convolutional Neural Networks, Skin Detection through Backprojection, Motion Detection and Tracking, Saliency Map.

Language:PythonMIT177200

chainerrl

ChainerRL is a deep reinforcement learning library built on top of Chainer.

Language:PythonMIT115600

PathPlanning

Common used path planning algorithms with animations.

Language:PythonMIT755100

PointFlowRenderer

Code for rendering the point cloud figures in paper: "PointFlow : 3D Point Cloud Generation with Continuous Normalizing Flows"

Language:Python30200

dm_hard_eight

Language:PythonApache-2.08500

FoldingNet

Organized code for the paper "FoldingNet: Point Cloud Auto-encoder via Deep Grid Deformation" (CVPR 2018).

Language:Python1000

FQF

FQF(Fully parameterized Quantile Function for distributional reinforcement learning) is a general reinforcement learning framework for Atari games, which can learn to play Atari games automatically by predicting return distribution in the form of a fully parameterized quantile function.

Language:Jupyter NotebookNOASSERTION4000

DeepRL_PyTorch

Deep Reinforcement Learning codes for study. Currently, there are only codes for algorithms: DQN, C51, QR-DQN, IQN, QUOTA.

Language:PythonApache-2.019300

njustPhDRoad

从开题到离校——南京理工大学博士毕业之路

Language:TeX11800

dcem

The Differentiable Cross-Entropy Method

Language:Jupyter NotebookNOASSERTION12400

atari-representation-learning

Code for "Unsupervised State Representation Learning in Atari"

Language:PythonMIT23300

pytorch_sac

PyTorch implementation of Soft Actor-Critic (SAC)

Language:Jupyter NotebookMIT47200

gradcem

Code for the paper Model-Predictive Control via Cross-Entropy and Gradient-Based Optimization

Language:Python6700

dreamer-pytorch

Dream to Control: Learning Behaviors by Latent Imagination, implemented in PyTorch.

Language:PythonMIT26500

world-models

Reimplementation of World-Models (Ha and Schmidhuber 2018) in pytorch

Language:PythonMIT54900

self-imitation-learning

ICML 2018 Self-Imitation Learning

Language:PythonMIT27300

pytorch-a2c-ppo-acktr-gail

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

Language:PythonMIT350000