hideki105 (Hideki105)

Hideki105

Geek Repo

Location:Tokyo

Github PK Tool:Github PK Tool

hideki105's repositories

bregman-proximal-dc-algorithm

Bregman Proximal type algorithms

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

CQL

PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and continuous action spaces.

Language:PythonStargazers:0Issues:0Issues:0

deep-learning-from-scratch-4

ゼロから作るDeep Learning④強化学習編

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

Diffusion-Models-pytorch

Pytorch implementation of Diffusion Models (https://arxiv.org/pdf/2006.11239.pdf)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

doro

Distributional and Outlier Robust Optimization (ICML 2021)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

gail-pytorch

PyTorch implementation of GAIL and PPO reinforcement learning algorithms

Language:PythonStargazers:0Issues:0Issues:0

GAIL_PPO

Generative Adversarial Imitation Learning

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Inverse_Reinforcement_Learning

逆強化学習のサンプル

License:MITStargazers:0Issues:0Issues:0

linear-programming

主双対内点法による線形計画法

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

mathematical-engineering

数理工学の講義資料

Stargazers:0Issues:0Issues:0

PyTorch-RL

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.

License:MITStargazers:0Issues:0Issues:0

Regularization-via-Transportation

Source files for our regularization paper!

License:MITStargazers:0Issues:0Issues:0

riemannian-optimization

リーマン多様体上の最適化

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

robust-optimization

ロバスト最適化

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

robustOT

Robust Optimal Transport code

Stargazers:0Issues:0Issues:0

semidefinite_programming

主双対内点法による半正定値計画

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

sinkhorn-imitation

Code for reproducing the experiment results of the paper Imitation Learning with Sinkhorn Distances.

Stargazers:0Issues:0Issues:0