Yangli0505

followers

following

stars

China

yangli's starred repositories

academicpages.github.io

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

Language:JavaScriptMIT11911 91 362

optuna

A hyperparameter optimization framework

Language:PythonMIT10626 117 1669

imitation

Clean PyTorch implementations of imitation and reward learning algorithms

Language:PythonMIT1280 18 340

rl-baselines-zoo

A collection of 100+ pre-trained RL agents using Stable Baselines, training and hyperparameter optimization included.

Language:PythonMIT1129 32 86

examples

Example deep learning projects that use wandb's features.

Language:Jupyter Notebook1118 16 84

random-network-distillation

Code for the paper "Exploration by Random Network Distillation"

Language:Python872 26 20

RLcode

Language:Python854 2 8

rliable

[NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.

Language:Jupyter NotebookApache-2.0753 11 17

rl-tutorial-jnrr19

Stable-Baselines tutorial for Journées Nationales de la Recherche en Robotique 2019

Language:Jupyter NotebookMIT603 11 13

stable-baselines3-contrib

Contrib package for Stable-Baselines3 - Experimental reinforcement learning (RL) code

Language:PythonMIT464 15 139

Carla-ppo

This repository hosts a customized PPO based agent for Carla. The goal of this project is to make it easier to interact with and experiment in Carla with reinforcement learning based agents -- this, by wrapping Carla in a gym like environment that can handle custom reward functions, custom debug output, etc.

Language:PythonMIT227 4 25

rl-colab-notebooks

Colab notebooks part of the documentation of Stable Baselines reinforcement learning library

Language:Jupyter NotebookMIT202 6 6

Reinforcement-Learning-Pytorch-Cartpole

Simple Cartpole example writed with pytorch.

Language:PythonMIT165 9 3

stable-baselines-zh

Stable Baselines官方文档中文版

Language:Python93 10

hierarchical-deep-RL

Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstractions and Intrinsic Motivation

Language:Lua86 14 1

HRL-Rec

"Hierarchical Reinforcement Learning for Integrated Recommendation" (AAAI 2021) https://ojs.aaai.org/index.php/AAAI/article/view/16580

Language:PythonApache-2.052 5 6

WCSAC

Code for the paper "WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning"

Language:PythonMIT50 4 5

rlrd

PyTorch implementation of our paper Reinforcement Learning with Random Delays (ICLR 2020)

Language:PythonMIT38 6 1

DrawFigureForPaper

Some python scripts for drawing figures in scientific papers

Language:Python26 10

eagerx_tutorials

Tutorials on how to use EAGERx

Language:Jupyter NotebookApache-2.016 1 2

Reward-shaping-to-improve-the-performance-of-DRL-in-inventory-management

Link to paper: https://www.ssrn.com/abstract=3804655

Language:Python1300

drqn

Exploring whether DRQN + action prior + state-based expert + history-based entropy-reduction expert

Language:PythonMIT8 10

gym-industrial

A fork of the Industrial Benchmark, refactored and packaged for PyPI

Language:PythonApache-2.04 10

jvgemert.github.io

Language:HTML4 20

action-balance-exploration

Language:PythonGPL-3.04 20

IAOP

Language:C++3 3 1

RL-POMDP-MEM

Memory-based approaches to Reinforcement learning for POMDPs

Language:Jupyter Notebook3 20

highway-env

A minimalist environment for decision-making in autonomous driving

Language:PythonMIT100

thiagopbueno.github.io

About me page!

Language:CSS100

sac_discrete

SAC discrete action space

Language:Python100