hideki105's repositories
mathematical-engineering
数理工学の講義資料
Inverse_Reinforcement_Learning
逆強化学習のサンプル
botorch
Bayesian optimization in PyTorch
bregman-proximal-dc-algorithm
Bregman Proximal type algorithms
cet
CET: Counterfactual Explanation Tree [AISTATS-22]
CQL
PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and continuous action spaces.
dace
DACE: Distribution-Aware Counterfactual Explanation [IJCAI-20]
deep-learning-from-scratch-4
ゼロから作るDeep Learning④強化学習編
Diffusion-Models-pytorch
Pytorch implementation of Diffusion Models (https://arxiv.org/pdf/2006.11239.pdf)
doro
Distributional and Outlier Robust Optimization (ICML 2021)
gail-pytorch
PyTorch implementation of GAIL and PPO reinforcement learning algorithms
linear-programming
主双対内点法による線形計画法
gpt-2
Code for the paper "Language Models are Unsupervised Multitask Learners"
gpytorch
A highly efficient implementation of Gaussian Processes in PyTorch
graduate_exam
京都大学数学系の院試の問題と解答です
GraduateSchoolEntranceExamination
東京大学大学院情報理工学系研究科入試問題過去問解答など
lm-human-preferences
Code for the paper Fine-Tuning Language Models from Human Preferences
manifold-optimization-book
『多様体上の最適化理論』サポートページ
minGPT
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
mogp
Mixture of Gaussian Processes Model for Sparse Longitudinal Data
nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
PaLM-rlhf-pytorch
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
python_simple_mppi
Python implementation of MPPI (Model Predictive Path-Integral) controller to understand the basic idea. Mandatory dependencies are numpy and matplotlib only.
riemannian-optimization
リーマン多様体上の最適化
robust-optimization
ロバスト最適化
robustOT
Robust Optimal Transport code
sam
SAM: Sharpness-Aware Minimization (PyTorch)
self-rag
This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.