Linjie Xu's repositories

MCEP

source codes for our paper: Mildly Constrained Evaluation Policy for Offline Reinforcement Learning

Language:PythonLicense:MITStargazers:3Issues:1Issues:0

Stratega

Documentation: https://stratega.readthedocs.io/en/latest/

Language:C++Stargazers:1Issues:1Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

CORL

High-quality single-file implementations of SOTA Offline RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

D4RL

A collection of reference environments for offline reinforcement learning

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

egg-blog

Post out small science

Language:SCSSStargazers:0Issues:1Issues:0
Language:SCSSLicense:CC0-1.0Stargazers:0Issues:1Issues:0

impact-driven-exploration

impact-driven-exploration

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

License:NOASSERTIONStargazers:0Issues:0Issues:0

edp

[NeurIPS 2023] Efficient Diffusion Policy

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

epymarl

An extension of the PyMARL codebase that includes additional algorithms and environment support

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

homomorphic_policy_gradient

Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022

Language:PythonStargazers:0Issues:0Issues:0

IDQL

Repo for Implicit Diffusion Q-Learning

Language:PythonStargazers:0Issues:0Issues:0

LaMo-2023

Official code for "Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning".

License:MITStargazers:0Issues:0Issues:0

latentplan

Code release for Efficient Planning in a Compact Latent Action Space (ICLR2023) https://arxiv.org/abs/2208.10291.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

PDMORL-Preference-Driven-Multi-Objective-Reinforcement-Learning-Algorithm

A novel preference-driven multi-objective reinforcement learning algorithm using a single policy network that covers the entire preference space in a given domain.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

pymarl

Python Multi-Agent Reinforcement Learning framework

License:Apache-2.0Stargazers:0Issues:0Issues:0

rlkit

Collection of reinforcement learning algorithms

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

smac

SMAC: The StarCraft Multi-Agent Challenge

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

spr

Code for "Data-Efficient Reinforcement Learning with Self-Predictive Representations"

License:MITStargazers:0Issues:0Issues:0

SynthER

Synthetic Experience Replay

License:MITStargazers:0Issues:0Issues:0

TD7

Author's PyTorch implementation of TD7 for online and offline RL

License:MITStargazers:0Issues:0Issues:0

VE-principle-for-model-based-RL

Repository for ML Reproducibility Challenge 2020 for the Neurips paper, "The Value Equivalence Principle for Model-Based Reinforcement Learning"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

XQL

Extreme Q-Learning: Max Entropy RL without Entropy

Language:PythonStargazers:0Issues:0Issues:0