Johnny He (sweetice)

sweetice

Geek Repo

Location:Tuebingen, Germany

Home Page:sweetice.github.io

Github PK Tool:Github PK Tool

Johnny He's repositories

learning-to-communicate-pytorch

Learning to Communicate with Deep Multi-Agent Reinforcement Learning in PyTorch

Language:PythonLicense:Apache-2.0Stargazers:3Issues:2Issues:0

RL-Adventure

Pytorch Implementation of DQN / DDQN / Prioritized replay/ noisy networks/ distributional values/ Rainbow/ hierarchical RL

Language:Jupyter NotebookStargazers:3Issues:3Issues:0

PyTorch-GAN

PyTorch implementations of Generative Adversarial Networks.

Language:PythonLicense:MITStargazers:2Issues:3Issues:0

reinforcement-learning-algorithms

This repository contains most of classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, A3C, PPO, TRPO. (More algorithms are still in progress)

Language:PythonLicense:MITStargazers:2Issues:3Issues:0

Algorithm_Interview_Notes-Chinese

2018/2019/校招/春招/秋招/算法/机器学习(Machine Learning)/深度学习(Deep Learning)/自然语言处理(NLP)/C/C++/Python/面试笔记

Language:PythonStargazers:1Issues:2Issues:0

baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Language:PythonLicense:MITStargazers:1Issues:2Issues:0

VirtualTaobao

Virtual-Taobao simulators with OpenAI Gym interface

Language:PythonStargazers:1Issues:0Issues:0

feudal-montezuma

Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

glow

Code for reproducing results in "Glow: Generative Flow with Invertible 1x1 Convolutions"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

go-explore

Code for Go-Explore: a New Approach for Hard-Exploration Problems

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

gym-super-mario-bros

An OpenAI Gym interface to Super Mario Bros. & Super Mario Bros. 2 (Lost Levels) on The NES

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

ItChat

A complete and graceful API for Wechat. 微信个人号接口、微信机器人及命令行微信,三十行即可自定义个人号机器人。

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

learning-to-communicate

Learning to Communicate with Deep Multi-Agent Reinforcement Learning

Language:LuaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Lihang

Statistical learning methods, 统计学习方法 [李航] 值得反复读. [笔记, 代码, notebook, 参考文献, Errata]

Language:PythonStargazers:0Issues:0Issues:0

loss-landscape

Code for visualizing the loss landscape of neural nets

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

machine-learning-notes

My continuously updated Machine Learning, Probabilistic Models and Deep Learning notes and demos (1000+ slides) 我不间断更新的机器学习,概率模型和深度学习的讲义(1000+页)和视频链接

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

models

Models and examples built with TensorFlow

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

noreward-rl

[ICML 2017] TensorFlow code for Curiosity-driven Exploration for Deep Reinforcement Learning

Language:PythonLicense:NOASSERTIONStargazers:0Issues:2Issues:0

pytorch-noreward-rl

pytorch implementation of Curiosity-driven Exploration by Self-supervised Prediction

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

PyTorch-RL

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.

Language:PythonStargazers:0Issues:0Issues:0

random-network-distillation

Code for the paper "Exploration by Random Network Distillation"

Language:PythonStargazers:0Issues:0Issues:0

Recommenders

Recommender Systems

Language:Jupyter NotebookLicense:MITStargazers:0Issues:2Issues:0

RL-Gallery

A gallery for reinforcement learning, including frameworks, tutorials, papers, implementations, applications, etc.

License:MITStargazers:0Issues:0Issues:0

rlkit

Collection of reinforcement learning algorithms

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

softlearning

Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

spinningup

An educational resource to help anyone learn deep reinforcement learning.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

stanford-cs-229-machine-learning

VIP cheatsheets for Stanford's CS 229 Machine Learning

License:MITStargazers:0Issues:2Issues:0

Super-Mario-Bros-RL

This project explores deep reinforcement learning, hybrid actor-critic approach with A3C/PPO combined with curiosity for playing Super Mario Bros

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

TD3

PyTorch implementation of TD3 and DDPG for OpenAI gym tasks

Language:PythonStargazers:0Issues:3Issues:0

tushare

TuShare is a utility for crawling historical data of China stocks

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:2Issues:0