Yasuhiro Fujita (muupan)

muupan

Geek Repo

Company:@pfnet

Home Page:https://github.com/muupan/resume

Github PK Tool:Github PK Tool


Organizations
chainer
pfnet

Yasuhiro Fujita's repositories

deep-reinforcement-learning-papers

A list of papers and resources dedicated to deep reinforcement learning

async-rl

Replicating "Asynchronous Methods for Deep Reinforcement Learning" (http://arxiv.org/abs/1602.01783)

Language:PythonLicense:MITStargazers:401Issues:29Issues:28

dqn-in-the-caffe

An implementation of Deep Q-Network using Caffe

Language:C++License:MITStargazers:213Issues:16Issues:20

deep-ensemble-uncertainty

An implementation of "Simple and Scalable Predictive Uncertainty Estimation using Deep Ensembles" (http://arxiv.org/abs/1612.01474)

Language:Jupyter NotebookStargazers:34Issues:3Issues:1

predictron

WIP implementation of "The Predictron: End-To-End Learning and Planning" (http://arxiv.org/abs/1612.08810) in Chainer

Language:PythonStargazers:11Issues:6Issues:0

chainer-cocob

COCOB-Backprop (https://arxiv.org/abs/1705.07795) implementation for Chainer

Language:PythonStargazers:6Issues:4Issues:0

chainerrl

ChainerRL is a deep reinforcement learning library built on top of Chainer.

Language:PythonLicense:MITStargazers:2Issues:2Issues:0

chainer-entropy-adam

Chainer-based implementation of Entropy-Adam https://arxiv.org/abs/1611.01838

Language:PythonStargazers:1Issues:3Issues:0

chainer-eve

An Eve optimizer implementation in Chainer

Language:PythonStargazers:1Issues:3Issues:0

chainer-oplu

Orthogonal Permuatation Linear Unit (OPLU) https://arxiv.org/abs/1604.02313v3

Language:PythonStargazers:1Issues:4Issues:0

chainer-weight-normalization

Weight normalization https://arxiv.org/abs/1602.07868

Language:C++License:MITStargazers:1Issues:3Issues:0

rl-teacher

Code for Deep RL from Human Preferences [Christiano et al]. Plus a webapp for collecting human feedback

Language:PythonLicense:MITStargazers:1Issues:3Issues:0

bullet3

Bullet Physics SDK: real-time collision detection and multi-physics simulation for VR, games, visual effects, robotics, machine learning etc.

Language:C++License:NOASSERTIONStargazers:0Issues:2Issues:0

chainer

A flexible framework of neural networks for deep learning

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

chainer-santa

An experimental implementation of Santa for Chainer

Language:PythonStargazers:0Issues:0Issues:0

chainer-yogi

An unofficial implementation of Yogi optimizer in Chainer. See https://papers.nips.cc/paper/8186-adaptive-methods-for-nonconvex-optimization

Language:PythonStargazers:0Issues:2Issues:0

cupy

NumPy-like API accelerated with CUDA

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

gvgai

This is the framework for the General Video Game Competition - http://www.gvgai.net/

Language:JavaLicense:NOASSERTIONStargazers:0Issues:2Issues:0

LC_NGSIM

lane change trajectories extracted from NGSIM

Language:MatlabLicense:MITStargazers:0Issues:2Issues:0

mujoco-py

MuJoCo is a physics engine for detailed, efficient rigid body simulations with contacts. mujoco-py allows using MuJoCo from Python 3.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:2Issues:0
Language:HTMLStargazers:0Issues:2Issues:0

NGSIM.jl

A Julia package for handling the Next Generation Simulation (NGSIM) traffic dataset

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:0Issues:2Issues:0

pfrl

PFRL: a PyTorch-based deep reinforcement learning library

Language:PythonLicense:MITStargazers:0Issues:2Issues:0
Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:2Issues:0

pytorch-optimizer

torch-optimizer -- collection of optimizers for Pytorch

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

resume

My resume

Stargazers:0Issues:3Issues:1

self-normalizing-networks

Chainer implementation of Self-Normalizing Networks (SNN)

Language:PythonStargazers:0Issues:2Issues:0

slimevolleygym

A simple OpenAI Gym environment for single and multi-agent reinforcement learning

License:Apache-2.0Stargazers:0Issues:0Issues:0

ViZDoom

Doom-based AI Research Platform for Reinforcement Learning from Raw Visual Information.

Language:C++Stargazers:0Issues:0Issues:0