Johnny He's repositories
learning-to-communicate-pytorch
Learning to Communicate with Deep Multi-Agent Reinforcement Learning in PyTorch
RL-Adventure
Pytorch Implementation of DQN / DDQN / Prioritized replay/ noisy networks/ distributional values/ Rainbow/ hierarchical RL
PyTorch-GAN
PyTorch implementations of Generative Adversarial Networks.
reinforcement-learning-algorithms
This repository contains most of classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, A3C, PPO, TRPO. (More algorithms are still in progress)
Algorithm_Interview_Notes-Chinese
2018/2019/校招/春招/秋招/算法/机器学习(Machine Learning)/深度学习(Deep Learning)/自然语言处理(NLP)/C/C++/Python/面试笔记
VirtualTaobao
Virtual-Taobao simulators with OpenAI Gym interface
feudal-montezuma
Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge
glow
Code for reproducing results in "Glow: Generative Flow with Invertible 1x1 Convolutions"
go-explore
Code for Go-Explore: a New Approach for Hard-Exploration Problems
gym-super-mario-bros
An OpenAI Gym interface to Super Mario Bros. & Super Mario Bros. 2 (Lost Levels) on The NES
ItChat
A complete and graceful API for Wechat. 微信个人号接口、微信机器人及命令行微信,三十行即可自定义个人号机器人。
learning-to-communicate
Learning to Communicate with Deep Multi-Agent Reinforcement Learning
Lihang
Statistical learning methods, 统计学习方法 [李航] 值得反复读. [笔记, 代码, notebook, 参考文献, Errata]
loss-landscape
Code for visualizing the loss landscape of neural nets
machine-learning-notes
My continuously updated Machine Learning, Probabilistic Models and Deep Learning notes and demos (1000+ slides) 我不间断更新的机器学习,概率模型和深度学习的讲义(1000+页)和视频链接
models
Models and examples built with TensorFlow
noreward-rl
[ICML 2017] TensorFlow code for Curiosity-driven Exploration for Deep Reinforcement Learning
pytorch-noreward-rl
pytorch implementation of Curiosity-driven Exploration by Self-supervised Prediction
PyTorch-RL
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
random-network-distillation
Code for the paper "Exploration by Random Network Distillation"
Recommenders
Recommender Systems
RL-Gallery
A gallery for reinforcement learning, including frameworks, tutorials, papers, implementations, applications, etc.
rlkit
Collection of reinforcement learning algorithms
softlearning
Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains.
spinningup
An educational resource to help anyone learn deep reinforcement learning.
stanford-cs-229-machine-learning
VIP cheatsheets for Stanford's CS 229 Machine Learning
Super-Mario-Bros-RL
This project explores deep reinforcement learning, hybrid actor-critic approach with A3C/PPO combined with curiosity for playing Super Mario Bros