sweetice

Johnny He's repositories

learning-to-communicate-pytorch

Learning to Communicate with Deep Multi-Agent Reinforcement Learning in PyTorch

Language:PythonApache-2.03 20

RL-Adventure

Pytorch Implementation of DQN / DDQN / Prioritized replay/ noisy networks/ distributional values/ Rainbow/ hierarchical RL

Language:Jupyter Notebook3 30

PyTorch-GAN

PyTorch implementations of Generative Adversarial Networks.

Language:PythonMIT2 30

reinforcement-learning-algorithms

This repository contains most of classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, A3C, PPO, TRPO. (More algorithms are still in progress)

Language:PythonMIT2 30

Algorithm_Interview_Notes-Chinese

2018/2019/校招/春招/秋招/算法/机器学习(Machine Learning)/深度学习(Deep Learning)/自然语言处理(NLP)/C/C++/Python/面试笔记

Language:Python1 20

baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Language:PythonMIT1 20

VirtualTaobao

Virtual-Taobao simulators with OpenAI Gym interface

Language:Python100

feudal-montezuma

Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge

Language:PythonMIT000

glow

Code for reproducing results in "Glow: Generative Flow with Invertible 1x1 Convolutions"

Language:PythonMIT000

go-explore

Code for Go-Explore: a New Approach for Hard-Exploration Problems

Language:PythonNOASSERTION000

gym-super-mario-bros

An OpenAI Gym interface to Super Mario Bros. & Super Mario Bros. 2 (Lost Levels) on The NES

Language:PythonNOASSERTION000

ItChat

A complete and graceful API for Wechat. 微信个人号接口、微信机器人及命令行微信，三十行即可自定义个人号机器人。

Language:PythonMIT000

learning-to-communicate

Learning to Communicate with Deep Multi-Agent Reinforcement Learning

Language:LuaApache-2.0000

Lihang

Statistical learning methods, 统计学习方法 [李航] 值得反复读. [笔记, 代码, notebook, 参考文献, Errata]

Language:Python000

loss-landscape

Code for visualizing the loss landscape of neural nets

Language:PythonMIT000

machine-learning-notes

My continuously updated Machine Learning, Probabilistic Models and Deep Learning notes and demos (1000+ slides) 我不间断更新的机器学习，概率模型和深度学习的讲义(1000+页)和视频链接

Language:Jupyter Notebook000

models

Models and examples built with TensorFlow

Language:PythonApache-2.0000

noreward-rl

[ICML 2017] TensorFlow code for Curiosity-driven Exploration for Deep Reinforcement Learning

Language:PythonNOASSERTION020

pytorch-noreward-rl

pytorch implementation of Curiosity-driven Exploration by Self-supervised Prediction

Language:PythonMIT000

PyTorch-RL

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.

Language:Python000