Mitsuhiko Nakamoto (nakamotoo)

nakamotoo

Geek Repo

Location:UC Berkeley

Github PK Tool:Github PK Tool

Mitsuhiko Nakamoto's repositories

Cal-QL

official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning

ecg-pytorch-sample

A PyTorch implementation for training deep learning models for 12-lead ECGs (2D-CNN, 1D-CNN, Transformer)

dopamine

forked from https://github.com/google/dopamine

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2Issues:1Issues:0
Language:Jupyter NotebookStargazers:1Issues:1Issues:0
Language:HTMLLicense:MITStargazers:1Issues:1Issues:0

3S_Hardware_Design

Homework of Verilog HDL

Language:VerilogStargazers:0Issues:1Issues:0

batch_rl

Offline Reinforcement Learning (aka Batch Reinforcement Learning) on Atari 2600 games

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

CORL

High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

echo-3D-sample

A PyTorch implementation for training 3D-CNN models for cardiac echocardiography.

Language:Jupyter NotebookStargazers:0Issues:2Issues:0

eeic-opencv-opengl

A course project using OpenCV and OpenGL.

Language:Jupyter NotebookStargazers:0Issues:2Issues:0

perceptron-pos-tagging

This is a work of Part-of-Speech tagging using averaged perceptron.

Language:Jupyter NotebookStargazers:0Issues:2Issues:0

D4RL

A collection of reference environments for offline reinforcement learning

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

D4RL-fork

D4RL-fork

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

eeic-ai-experiment

eeic人工知能演習2020 第二ターム

Language:Jupyter NotebookStargazers:0Issues:2Issues:0

eeic_experiment_I

Socket Programming in C

Language:CStargazers:0Issues:1Issues:0

genaug

main augmentation script for real world robot dataset.

License:MITStargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

interbotix_ros_toolboxes

Support-level ROS Packages for Interbotix Robots

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

jaxrl

JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

lifelong_rl

Pytorch implementations of RL algorithms, focusing on model-based, lifelong, reset-free, and offline algorithms. Official codebase for Reset-Free Lifelong Learning with Skill-Space Planning.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

MCQ

Code for Mildly Conservative Q-learning for Offline Reinforcement Learning (NeurIPS 2022)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

mj_envs

A collection of MuJoCo based environments.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

online-dt

Online Decision Transformer

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

reversi-othelo-ai

A gam AI for playing Reversi (Othelo). Implemented the mini-max algorithm in C++.

Language:C++Stargazers:0Issues:2Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

TwitterUI

I made a Twitter home UI as a training of HTML/CSS.

Language:HTMLStargazers:0Issues:1Issues:0