hideki105's repositories

mathematical-engineering

数理工学の講義資料

Stargazers:3Issues:0Issues:0

Inverse_Reinforcement_Learning

逆強化学習のサンプル

License:MITStargazers:1Issues:0Issues:0

botorch

Bayesian optimization in PyTorch

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

bregman-proximal-dc-algorithm

Bregman Proximal type algorithms

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

cet

CET: Counterfactual Explanation Tree [AISTATS-22]

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:C++Stargazers:0Issues:0Issues:0

CQL

PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and continuous action spaces.

Language:PythonStargazers:0Issues:0Issues:0

dace

DACE: Distribution-Aware Counterfactual Explanation [IJCAI-20]

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

deep-learning-from-scratch-4

ゼロから作るDeep Learning④強化学習編

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

Diffusion-Models-pytorch

Pytorch implementation of Diffusion Models (https://arxiv.org/pdf/2006.11239.pdf)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

doro

Distributional and Outlier Robust Optimization (ICML 2021)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

gail-pytorch

PyTorch implementation of GAIL and PPO reinforcement learning algorithms

Language:PythonStargazers:0Issues:0Issues:0

linear-programming

主双対内点法による線形計画法

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

gpt-2

Code for the paper "Language Models are Unsupervised Multitask Learners"

License:NOASSERTIONStargazers:0Issues:0Issues:0

gpytorch

A highly efficient implementation of Gaussian Processes in PyTorch

License:MITStargazers:0Issues:0Issues:0

graduate_exam

京都大学数学系の院試の問題と解答です

License:CC-BY-SA-4.0Stargazers:0Issues:0Issues:0

GraduateSchoolEntranceExamination

東京大学大学院情報理工学系研究科入試問題過去問解答など

Language:TeXStargazers:0Issues:0Issues:0

lm-human-preferences

Code for the paper Fine-Tuning Language Models from Human Preferences

License:MITStargazers:0Issues:0Issues:0

manifold-optimization-book

『多様体上の最適化理論』サポートページ

Stargazers:0Issues:0Issues:0

minGPT

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

License:MITStargazers:0Issues:0Issues:0

mogp

Mixture of Gaussian Processes Model for Sparse Longitudinal Data

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

License:MITStargazers:0Issues:0Issues:0

PaLM-rlhf-pytorch

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

License:MITStargazers:0Issues:0Issues:0

python_simple_mppi

Python implementation of MPPI (Model Predictive Path-Integral) controller to understand the basic idea. Mandatory dependencies are numpy and matplotlib only.

License:NOASSERTIONStargazers:0Issues:0Issues:0

riemannian-optimization

リーマン多様体上の最適化

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

robust-optimization

ロバスト最適化

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

robustOT

Robust Optimal Transport code

Stargazers:0Issues:0Issues:0

sam

SAM: Sharpness-Aware Minimization (PyTorch)

License:MITStargazers:0Issues:0Issues:0

self-rag

This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.

License:MITStargazers:0Issues:0Issues:0