Beast code in Giters

Haiyan Yin's starred repositories

lamorel

Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).

Language:PythonMIT16700

awesome-diffusion-model-in-rl

A curated list of Diffusion Model in RL resources (continually updated)

Apache-2.059100

gym-cooking: Code for "Too many cooks: Bayesian inference for coordinating multi-agent collaboration", Winner of the CogSci 2020 Computational Modeling Prize in High Cognition, and a NeurIPS 2020 CoopAI Workshop Best Paper.

Language:PythonMIT17900

muzero-general

MuZero

Language:PythonMIT239700

FQF-and-Extensions

PyTorch implementation of the state-of-the-art distributional reinforcement learning algorithm Fully Parameterized Quantile Function (FQF) and Extensions: N-step Bootstrapping, PER, Noisy Layer, Dueling Networks, and parallelization.

Language:Jupyter NotebookMIT2500

rljax

A collection of RL algorithms written in JAX.

Language:PythonMIT9000

envpool

C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.

Language:C++Apache-2.0102700

verify_rl_torch

Language:PythonMIT100

discovering-reinforcement-learning-algorithms

A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. and Silver, D., 2020. Discovering reinforcement learning algorithms. Advances in Neural Information Processing Systems, 33.

Language:PythonApache-2.02000

spr

Code for "Data-Efficient Reinforcement Learning with Self-Predictive Representations"

Language:PythonMIT15600

DistributedRL-Pytorch-Ray

Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)

Language:PythonMIT2900

dqn_zoo

DQN Zoo is a collection of reference implementations of reinforcement learning agents developed at DeepMind based on the Deep Q-Network (DQN) agent.

Language:PythonApache-2.043000

moolib

A library for distributed ML training with PyTorch

Language:C++MIT36500

muzero

A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each other, and investigate reliability of learned MuZero MDP models.

Language:Jupyter NotebookMIT14600

warpgrad

Meta-Learning with Warped Gradient Descent

Language:PythonApache-2.09000

torchbeast

A PyTorch Platform for Distributed RL

Language:PythonApache-2.073500

sample-factory

High throughput synchronous and asynchronous reinforcement learning

Language:PythonMIT75500

adeptRL

Reinforcement learning framework to accelerate research

Language:PythonGPL-3.020100

minecraft_ai

Language:Python4400

stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Language:PythonMIT810300

open_spiel

OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.

Language:C++Apache-2.0403900

awesome-model-based-RL

A curated list of awesome model based RL resources (continually updated)

Apache-2.073900

awesome-continual-learning

A repository to keep track of literature on catastrophic forgetting

3600

online-continual-learning

A collection of online continual learning paper implementations and tricks for computer vision in PyTorch, including our ASER(AAAI-21), SCR(CVPR21-W) and an online continual learning survey (Neurocomputing).

Language:Python36000