havenoname's repositories
awesome-knowledge-distillation
Awesome Knowledge Distillation
baselines-rudder
RUDDER for ATARI games with delayed rewards in OpenAI Baselines package
cvpr15deepcompare
Code and models for "Learning to Compare Image Patches via Convolutional Neural Networks"
DropoutUncertaintyExps
Experiments used in "Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning"
exemplar_models
Exemplar models for approximate density estimation
FeatureControlHRL
Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning
Free_Energy_experiments
An exploration of the free energy principle using deep reinforcement learning
Hierarchical-Deep-Reinforcement-Learning
paper <<Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation>> python implementation
hierarchical-deep-RL
Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstractions and Intrinsic Motivation
Hierarchical-DQN
Simple implementation of the model presented in Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation
large-scale-curiosity
Code for the paper "Large-Scale Study of Curiosity-Driven Learning"
MARL-Papers
Paper list of multi-agent reinforcement learning (MARL)
mlsh
Code for the paper "Meta-Learning Shared Hierarchies"
Neural-Episodic-Control
Implementation of Deepmind's Neural Episodic Control
noreward-rl
[ICML 2017] TensorFlow code for Curiosity-driven Exploration for Deep Reinforcement Learning
ParamNoise
A comparison of parameter space noise methods for exploration in deep reinforcement learning
pixel_exploration
PyTorch implementation of Count-Based Exploration with Neural Density Models
pytorch-rl
Deep Reinforcement Learning with pytorch & visdom
Rainbow
Rainbow: Combining Improvements in Deep Reinforcement Learning
random-network-distillation
Code for the paper "Exploration by Random Network Distillation"
temporal_abstraction
Option Critic with subgoal discovery by spectral decomposition of the Successor Features Matrix or clustering in Successor features space.
UncertaintyNN
Implementation and evaluation of different approaches to get uncertainty in neural networks