Irvine (wadx2019)

wadx2019

Geek Repo

Company:ShanghaiTech University

Location:Shanghai, China

Github PK Tool:Github PK Tool

Irvine's repositories

rpo

Official implementation for the NeurIPS 2023 paper: "Reduced Policy Optimization for Continuous Control with Hard Constraints"

Language:PythonLicense:MITStargazers:21Issues:2Issues:2

Neural-Bandit

A collection of the pytorch implementation of neural bandit algorithm includes neuralUCB(Neural Contextual Bandits with UCB-based Exploration) and neuralTS(Neural Thompson sampling)

Language:PythonStargazers:5Issues:1Issues:0

homoode

Official implementation for the NeurIPS 2023 paper: "Two Sides of The Same Coin: Bridging Deep Equilibrium Models and Neural ODEs via Homotopy Continuation"

Language:PythonLicense:MITStargazers:3Issues:1Issues:0
Language:MATLABLicense:MITStargazers:1Issues:1Issues:0

homoode-webpage

HomoODE official page

Language:CSSStargazers:0Issues:1Issues:0

Pymol-script-repo

Collected scripts for Pymol

Language:PythonStargazers:0Issues:0Issues:0

rpo-webpage

RPO official page

Language:CSSStargazers:0Issues:0Issues:0

wadx2019.github.io

Homopage of Shutong Ding

Language:SCSSLicense:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:1Issues:0

awesome-virtual-try-on

A curated list of awesome research papers, projects, code, dataset, workshops etc. related to virtual try-on.

Stargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Language:JavaScriptLicense:MITStargazers:0Issues:0Issues:0

CQL

Code for conservative Q-learning

Language:PythonStargazers:0Issues:0Issues:0

cs231n-2

cs231n assignments sovled by https://ghli.org

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

DC3

DC3: A Learning Method for Optimization with Hard Constraints

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

decision-transformer

Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

LaTeXLive

LateX公式编辑器-妈叔出品

License:Apache-2.0Stargazers:0Issues:0Issues:0

MaskFormer

Per-Pixel Classification is Not All You Need for Semantic Segmentation (NeurIPS 2021, spotlight)

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

metadrive

MetaDrive: Composing Diverse Scenarios for Generalizable Reinforcement Learning

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:1Issues:0

PYPOWER

Port of MATPOWER to Python

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

pytorch-soft-actor-critic

PyTorch implementation of soft actor critic

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:1Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

Safe-Policy-Optimization

This is a benchmark repository for safe reinforcement learning algorithms

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

svs

Re-implementation of Communication-Efficient Distributed Covariance Sketch, with Application to Distributed PCA

Language:PythonLicense:MITStargazers:0Issues:1Issues:0
License:BSD-3-ClauseStargazers:0Issues:0Issues:0

V2G-Predictor

A RL implementation of a V2G system

Language:PythonStargazers:0Issues:0Issues:0