Takuya Hiraoka (TakuyaHiraoka)

TakuyaHiraoka

Geek Repo

Location:Tokyo-3, Japan

Home Page:https://takuyahiraoka.github.io

Github PK Tool:Github PK Tool

Takuya Hiraoka's repositories

tensor2robot

Distributed machine learning infrastructure for large-scale robotics research

License:Apache-2.0Stargazers:0Issues:0Issues:0

mbpo

Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"

Stargazers:0Issues:0Issues:0

Learning-Robust-Options-by-Conditional-Value-at-Risk-Optimization

Source files to replicate experiments in my NeurIPS 2019 paper.

Language:PythonLicense:MITStargazers:10Issues:0Issues:0
Stargazers:0Issues:0Issues:0

snail-pytorch

Implementation of "A Simple Neural Attentive Meta-Learner" (SNAIL, https://arxiv.org/pdf/1707.03141.pdf) in PyTorch

Stargazers:0Issues:0Issues:0

learning_to_adapt

Learning to Adapt in Dynamic, Real-World Environment through Meta-Reinforcement Learning

Language:PythonStargazers:0Issues:0Issues:0

PPOC

Proximal Policy Option-Critic

Language:PythonStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

gym-extensions

This repo is intended as an extension for OpenAI Gym for auxiliary tasks (multitask learning, transfer learning, inverse reinforcement learning, etc.)

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0
License:NOASSERTIONStargazers:0Issues:0Issues:0

ConcreteDropout

Code for Concrete Dropout as presented in https://arxiv.org/abs/1705.07832

License:MITStargazers:0Issues:0Issues:0

rllab

rllab is a framework for developing and evaluating reinforcement learning algorithms, fully compatible with OpenAI Gym.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

naacl18-multitask_argument_mining

Code for the paper "Multi-Task Learning for Argumentation Mining in Low-Resource Settings"

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

gamepad

A Learning Environment for Theorem Proving

Language:CoqLicense:Apache-2.0Stargazers:0Issues:0Issues:0

marseille

Mining Argument Structures with Expressive Inference (Linear and LSTM Engines)

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

Grounded-Language-Learning-in-Pytorch

Implementation of Grounded Language Learning in a 3D Simulated World (DeepMind)

Language:CLicense:NOASSERTIONStargazers:0Issues:0Issues:0

Reinforcement-Learning-in-Multi-Party-Trading-Dialog

Source files to replicate experiments in my SigDial 2015 and JSAI papers.

Language:PythonStargazers:8Issues:0Issues:0

Dialogue-State-Tracking-using-LSTM

Source files to replicate experiments in my IWSDS 2016 paper.

Language:PythonStargazers:23Issues:0Issues:0

robustRL

Robust policy search algorithms which train on model ensembles

Language:PythonStargazers:0Issues:0Issues:0

Multi-Agent-Reinforcement-Learning-in-Stochastic-Games

Unofficial PyBrain extension for multi-agent reinforcement learning in general sum stochastic games.

Language:PythonStargazers:70Issues:0Issues:0

Active-Learning-for-Example-based-Dialog-Systems

Source files to replicate experiments in my IWSDS 2016 paper.

Language:PythonStargazers:9Issues:0Issues:0