hideki105's repositories
bregman-proximal-dc-algorithm
Bregman Proximal type algorithms
CQL
PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and continuous action spaces.
deep-learning-from-scratch-4
ゼロから作るDeep Learning④強化学習編
Diffusion-Models-pytorch
Pytorch implementation of Diffusion Models (https://arxiv.org/pdf/2006.11239.pdf)
doro
Distributional and Outlier Robust Optimization (ICML 2021)
gail-pytorch
PyTorch implementation of GAIL and PPO reinforcement learning algorithms
GAIL_PPO
Generative Adversarial Imitation Learning
Inverse_Reinforcement_Learning
逆強化学習のサンプル
linear-programming
主双対内点法による線形計画法
mathematical-engineering
数理工学の講義資料
PyTorch-RL
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
Regularization-via-Transportation
Source files for our regularization paper!
riemannian-optimization
リーマン多様体上の最適化
robust-optimization
ロバスト最適化
robustOT
Robust Optimal Transport code
semidefinite_programming
主双対内点法による半正定値計画
sinkhorn-imitation
Code for reproducing the experiment results of the paper Imitation Learning with Sinkhorn Distances.