wadx2019

followers

following

stars

ShanghaiTech University

Shanghai, China

Irvine's repositories

rpo

Official implementation for the NeurIPS 2023 paper: "Reduced Policy Optimization for Continuous Control with Hard Constraints"

Language:PythonMIT21 2 2

Neural-Bandit

A collection of the pytorch implementation of neural bandit algorithm includes neuralUCB(Neural Contextual Bandits with UCB-based Exploration) and neuralTS(Neural Thompson sampling)

Language:Python5 10

homoode

Official implementation for the NeurIPS 2023 paper: "Two Sides of The Same Coin: Bridging Deep Equilibrium Models and Neural ODEs via Homotopy Continuation"

Language:PythonMIT3 10

homotopy

Language:MATLABMIT1 10

homoode-webpage

HomoODE official page

Language:CSS010

Pymol-script-repo

Collected scripts for Pymol

Language:Python000

rpo-webpage

RPO official page

Language:CSS000

wadx2019.github.io

Homopage of Shutong Ding

Language:SCSSMIT000

alicia

010

awesome-virtual-try-on

A curated list of awesome research papers, projects, code, dataset, workshops etc. related to virtual try-on.

000

BigN

Language:PythonMIT010

Competition_3v3snakes

Language:JavaScriptMIT000

CQL

Code for conservative Q-learning

Language:Python000

cs231n-2

cs231n assignments sovled by https://ghli.org

Language:Jupyter Notebook000

DC3

DC3: A Learning Method for Optimization with Hard Constraints

Language:PythonApache-2.0000

decision-transformer

Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.

Language:PythonMIT000

LaTeXLive

LateX公式编辑器-妈叔出品

Apache-2.0000

MaskFormer

Per-Pixel Classification is Not All You Need for Semantic Segmentation (NeurIPS 2021, spotlight)

Language:PythonNOASSERTION000

metadrive

MetaDrive: Composing Diverse Scenarios for Generalizable Reinforcement Learning

Language:PythonApache-2.0000

N3C-MAB

Language:PythonMIT010

PYPOWER

Port of MATPOWER to Python

Language:PythonNOASSERTION000

pytorch-soft-actor-critic

PyTorch implementation of soft actor critic

Language:PythonMIT000

res

010

RL-EVCP

Language:Jupyter Notebook000

RL-Paper-notes

000

RL_OPF

Language:Python000

Safe-Policy-Optimization

This is a benchmark repository for safe reinforcement learning algorithms

Language:PythonApache-2.0000

svs

Re-implementation of Communication-Efficient Distributed Covariance Sketch, with Application to Distributed PCA

Language:PythonMIT010

unitree_ros

BSD-3-Clause000

V2G-Predictor

A RL implementation of a V2G system

Language:Python000