Hongyao Tang (bluecontra)

bluecontra

Geek Repo

Company:Tianjin University

Location:Beijing, China

Home Page:https://bluecontra.github.io/

Github PK Tool:Github PK Tool

Hongyao Tang's repositories

AAAI2021-VDFP

Source code and raw data of learning curves for AAAI 2021 paper - 《Foresee then Evaluate: Decomposing Value Estimation with Latent Future Prediction》

pymarl_alpha

Alpha code release for Python Multi-Agent Reinforcement Learning framework

Language:PythonStargazers:1Issues:0Issues:0

Awesome-pytorch-list

A comprehensive list of pytorch related content on github,such as different models,implementations,helper libraries,tutorials etc.

Stargazers:0Issues:0Issues:0

Bayesian-Neural-Networks

Pytorch implementations of Bayes By Backprop, MC Dropout, SGLD, the Local Reparametrization Trick, KF-Laplace and more

License:MITStargazers:0Issues:0Issues:0
Language:HTMLStargazers:0Issues:2Issues:0

CommNet-BiCnet

CommNet and BiCnet implementation in tensorflow

Language:PythonStargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

ddrl

Deep Developmental Reinforcement Learning

Language:C++License:MITStargazers:0Issues:0Issues:0

Deep-Reinforcement-Learning-Algorithms-with-PyTorch

PyTorch implementations of deep reinforcement learning algorithms and environments

Stargazers:0Issues:0Issues:0

deterministic-variational-inference

Sample code for running deterministic variational inference to train Bayesian neural networks

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

dreamer

Dream to Control: Learning Behaviors by Latent Imagination

License:MITStargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

gym-minigrid

Minimalistic gridworld environment for OpenAI Gym

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

icnn

Input Convex Neural Networks

License:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

MATC_Env

Multi-agent Trash Collecting domains used in research paper 《Hierarchical Deep Multiagent Reinforcement Learning》 (arXiv:1809.09332)

Stargazers:0Issues:0Issues:0

models

Models and examples built with TensorFlow

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

MPHRL

Model Primitive Hierarchical Reinforcement Learning

Language:PythonLicense:MITStargazers:0Issues:2Issues:0

P3O

P3O paper code

Stargazers:0Issues:0Issues:0

planet

Deep Planning Network: Control from pixels by latent planning with learned dynamics

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

pytorch-a2c-ppo-acktr-gail

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

scalable_agent

A TensorFlow implementation of Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

SLM-Lab

Modular Deep Reinforcement Learning framework in PyTorch.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

smac

SMAC: The StarCraft Multi-Agent Challenge

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

SteinGAN

code for steinGAN - Learning to Draw Samples: With Application to Amortized MLE for Generative Adversarial Learning

License:MITStargazers:0Issues:0Issues:0

TD3

PyTorch implementation of TD3 and DDPG for OpenAI gym tasks

Language:PythonStargazers:0Issues:0Issues:0

transformer-tensorflow

Implementation of Transformer Model in Tensorflow

Language:PythonStargazers:0Issues:2Issues:0

tsallis_actor_critic_mujoco

Implementation of Tsallis Actor Critic method

Language:Jupyter NotebookStargazers:0Issues:1Issues:0