muupan

Yasuhiro Fujita's repositories

deep-reinforcement-learning-papers

A list of papers and resources dedicated to deep reinforcement learning

834 113 1

async-rl

Replicating "Asynchronous Methods for Deep Reinforcement Learning" (http://arxiv.org/abs/1602.01783)

Language:PythonMIT401 29 28

dqn-in-the-caffe

An implementation of Deep Q-Network using Caffe

Language:C++MIT213 16 20

deep-ensemble-uncertainty

An implementation of "Simple and Scalable Predictive Uncertainty Estimation using Deep Ensembles" (http://arxiv.org/abs/1612.01474)

Language:Jupyter Notebook34 3 1

predictron

WIP implementation of "The Predictron: End-To-End Learning and Planning" (http://arxiv.org/abs/1612.08810) in Chainer

Language:Python11 60

chainer-cocob

COCOB-Backprop (https://arxiv.org/abs/1705.07795) implementation for Chainer

Language:Python6 40

chainerrl

ChainerRL is a deep reinforcement learning library built on top of Chainer.

Language:PythonMIT2 20

chainer-entropy-adam

Chainer-based implementation of Entropy-Adam https://arxiv.org/abs/1611.01838

Language:Python1 30

chainer-eve

An Eve optimizer implementation in Chainer

Language:Python1 30

chainer-oplu

Orthogonal Permuatation Linear Unit (OPLU) https://arxiv.org/abs/1604.02313v3

Language:Python1 40

chainer-weight-normalization

Weight normalization https://arxiv.org/abs/1602.07868

Language:Python1 2 1

gym_torcs

Language:C++MIT1 30

rl-teacher

Code for Deep RL from Human Preferences [Christiano et al]. Plus a webapp for collecting human feedback

Language:PythonMIT1 30

bullet3

Bullet Physics SDK: real-time collision detection and multi-physics simulation for VR, games, visual effects, robotics, machine learning etc.

Language:C++NOASSERTION020

chainer

A flexible framework of neural networks for deep learning

Language:PythonMIT000

chainer-santa

An experimental implementation of Santa for Chainer

Language:Python000

chainer-yogi

An unofficial implementation of Yogi optimizer in Chainer. See https://papers.nips.cc/paper/8186-adaptive-methods-for-nonconvex-optimization

Language:Python020

cupy

NumPy-like API accelerated with CUDA

Language:PythonNOASSERTION000

gvgai

This is the framework for the General Video Game Competition - http://www.gvgai.net/

Language:JavaNOASSERTION020

LC_NGSIM

lane change trajectories extracted from NGSIM

Language:MatlabMIT020

mujoco-py

MuJoCo is a physics engine for detailed, efficient rigid body simulations with contacts. mujoco-py allows using MuJoCo from Python 3.

Language:PythonNOASSERTION020

muupan.github.io

Language:HTML020

NGSIM.jl

A Julia package for handling the Next Generation Simulation (NGSIM) traffic dataset

Language:Jupyter NotebookNOASSERTION020

pfrl

PFRL: a PyTorch-based deep reinforcement learning library

Language:PythonMIT020

pybrain

Language:PythonBSD-3-Clause020

pytorch-optimizer

torch-optimizer -- collection of optimizers for Pytorch

Language:PythonApache-2.0010

resume

My resume

03 1

self-normalizing-networks

Chainer implementation of Self-Normalizing Networks (SNN)

Language:Python020

slimevolleygym

A simple OpenAI Gym environment for single and multi-agent reinforcement learning

Apache-2.0000

ViZDoom

Doom-based AI Research Platform for Reinforcement Learning from Raw Visual Information.

Language:C++000