yangli (Yangli0505)

Yangli0505

Geek Repo

Location:China

Github PK Tool:Github PK Tool

yangli's starred repositories

academicpages.github.io

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

Language:JavaScriptLicense:MITStargazers:11911Issues:91Issues:362

optuna

A hyperparameter optimization framework

Language:PythonLicense:MITStargazers:10626Issues:117Issues:1669

imitation

Clean PyTorch implementations of imitation and reward learning algorithms

Language:PythonLicense:MITStargazers:1280Issues:18Issues:340

rl-baselines-zoo

A collection of 100+ pre-trained RL agents using Stable Baselines, training and hyperparameter optimization included.

Language:PythonLicense:MITStargazers:1129Issues:32Issues:86

examples

Example deep learning projects that use wandb's features.

Language:Jupyter NotebookStargazers:1118Issues:16Issues:84

random-network-distillation

Code for the paper "Exploration by Random Network Distillation"

rliable

[NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:753Issues:11Issues:17

rl-tutorial-jnrr19

Stable-Baselines tutorial for Journées Nationales de la Recherche en Robotique 2019

Language:Jupyter NotebookLicense:MITStargazers:603Issues:11Issues:13

stable-baselines3-contrib

Contrib package for Stable-Baselines3 - Experimental reinforcement learning (RL) code

Language:PythonLicense:MITStargazers:464Issues:15Issues:139

Carla-ppo

This repository hosts a customized PPO based agent for Carla. The goal of this project is to make it easier to interact with and experiment in Carla with reinforcement learning based agents -- this, by wrapping Carla in a gym like environment that can handle custom reward functions, custom debug output, etc.

Language:PythonLicense:MITStargazers:227Issues:4Issues:25

rl-colab-notebooks

Colab notebooks part of the documentation of Stable Baselines reinforcement learning library

Language:Jupyter NotebookLicense:MITStargazers:202Issues:6Issues:6

Reinforcement-Learning-Pytorch-Cartpole

Simple Cartpole example writed with pytorch.

Language:PythonLicense:MITStargazers:165Issues:9Issues:3

stable-baselines-zh

Stable Baselines官方文档中文版

Language:PythonStargazers:93Issues:1Issues:0

hierarchical-deep-RL

Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstractions and Intrinsic Motivation

HRL-Rec

"Hierarchical Reinforcement Learning for Integrated Recommendation" (AAAI 2021) https://ojs.aaai.org/index.php/AAAI/article/view/16580

Language:PythonLicense:Apache-2.0Stargazers:52Issues:5Issues:6

WCSAC

Code for the paper "WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning"

Language:PythonLicense:MITStargazers:50Issues:4Issues:5

rlrd

PyTorch implementation of our paper Reinforcement Learning with Random Delays (ICLR 2020)

Language:PythonLicense:MITStargazers:38Issues:6Issues:1

DrawFigureForPaper

Some python scripts for drawing figures in scientific papers

Language:PythonStargazers:26Issues:1Issues:0

eagerx_tutorials

Tutorials on how to use EAGERx

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:16Issues:1Issues:2

Reward-shaping-to-improve-the-performance-of-DRL-in-inventory-management

Link to paper: https://www.ssrn.com/abstract=3804655

Language:PythonStargazers:13Issues:0Issues:0

drqn

Exploring whether DRQN + action prior + state-based expert + history-based entropy-reduction expert

Language:PythonLicense:MITStargazers:8Issues:1Issues:0

gym-industrial

A fork of the Industrial Benchmark, refactored and packaged for PyPI

Language:PythonLicense:Apache-2.0Stargazers:4Issues:1Issues:0
Language:PythonLicense:GPL-3.0Stargazers:4Issues:2Issues:0

RL-POMDP-MEM

Memory-based approaches to Reinforcement learning for POMDPs

Language:Jupyter NotebookStargazers:3Issues:2Issues:0

highway-env

A minimalist environment for decision-making in autonomous driving

Language:PythonLicense:MITStargazers:1Issues:0Issues:0
Language:CSSStargazers:1Issues:0Issues:0

sac_discrete

SAC discrete action space

Language:PythonStargazers:1Issues:0Issues:0