bluecontra

followers

following

stars

Tianjin University

Beijing, China

https://bluecontra.github.io/

Hongyao Tang's repositories

AAAI2021-VDFP

Source code and raw data of learning curves for AAAI 2021 paper - 《Foresee then Evaluate: Decomposing Value Estimation with Latent Future Prediction》

Language:Python2 3 1

pymarl_alpha

Alpha code release for Python Multi-Agent Reinforcement Learning framework

Language:Python100

Awesome-pytorch-list

A comprehensive list of pytorch related content on github,such as different models,implementations,helper libraries,tutorials etc.

000

Bayesian-Neural-Networks

Pytorch implementations of Bayes By Backprop, MC Dropout, SGLD, the Local Reparametrization Trick, KF-Laplace and more

MIT000

bluecontra.github.io

Language:HTML020

CommNet-BiCnet

CommNet and BiCnet implementation in tensorflow

Language:Python000

count_based_exploration_sr

Language:PythonMIT000

ddrl

Deep Developmental Reinforcement Learning

Language:C++MIT000

Deep-Reinforcement-Learning-Algorithms-with-PyTorch

PyTorch implementations of deep reinforcement learning algorithms and environments

000

deterministic-variational-inference

Sample code for running deterministic variational inference to train Bayesian neural networks

Language:Jupyter NotebookMIT000

dreamer

Dream to Control: Learning Behaviors by Latent Imagination

MIT000

DRL

Language:PythonMIT000

gym-minigrid

Minimalistic gridworld environment for OpenAI Gym

Language:PythonBSD-3-Clause000

icnn

Input Convex Neural Networks

Apache-2.0000

large-scale-curiosity

000

MATC_Env

Multi-agent Trash Collecting domains used in research paper 《Hierarchical Deep Multiagent Reinforcement Learning》 (arXiv:1809.09332)

000

models

Models and examples built with TensorFlow

Language:PythonApache-2.0000

MPHRL

Model Primitive Hierarchical Reinforcement Learning

Language:PythonMIT020

P3O

P3O paper code

000

planet

Deep Planning Network: Control from pixels by latent planning with learned dynamics

Language:PythonApache-2.0000

pytorch-a2c-ppo-acktr-gail

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

Language:PythonMIT000

random-network-distillation

Language:Python000

revisiting-ppo

MIT000

scalable_agent

A TensorFlow implementation of Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures.

Language:PythonApache-2.0000

SLM-Lab

Modular Deep Reinforcement Learning framework in PyTorch.

Language:PythonMIT000

smac

SMAC: The StarCraft Multi-Agent Challenge

Language:PythonMIT000

SteinGAN

code for steinGAN - Learning to Draw Samples: With Application to Amortized MLE for Generative Adversarial Learning

MIT000

TD3

PyTorch implementation of TD3 and DDPG for OpenAI gym tasks

Language:Python000

transformer-tensorflow

Implementation of Transformer Model in Tensorflow

Language:Python020

tsallis_actor_critic_mujoco

Implementation of Tsallis Actor Critic method

Language:Jupyter Notebook010