Haiyan Yin (haiyanyin)

haiyanyin

Geek Repo

Company:Nanyang Technological University

Location:Singapore

Github PK Tool:Github PK Tool

Haiyan Yin's starred repositories

lamorel

Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).

Language:PythonLicense:MITStargazers:167Issues:0Issues:0

awesome-diffusion-model-in-rl

A curated list of Diffusion Model in RL resources (continually updated)

License:Apache-2.0Stargazers:591Issues:0Issues:0

gym-cooking

gym-cooking: Code for "Too many cooks: Bayesian inference for coordinating multi-agent collaboration", Winner of the CogSci 2020 Computational Modeling Prize in High Cognition, and a NeurIPS 2020 CoopAI Workshop Best Paper.

Language:PythonLicense:MITStargazers:179Issues:0Issues:0
Language:PythonLicense:MITStargazers:2397Issues:0Issues:0

FQF-and-Extensions

PyTorch implementation of the state-of-the-art distributional reinforcement learning algorithm Fully Parameterized Quantile Function (FQF) and Extensions: N-step Bootstrapping, PER, Noisy Layer, Dueling Networks, and parallelization.

Language:Jupyter NotebookLicense:MITStargazers:25Issues:0Issues:0

rljax

A collection of RL algorithms written in JAX.

Language:PythonLicense:MITStargazers:90Issues:0Issues:0

envpool

C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.

Language:C++License:Apache-2.0Stargazers:1027Issues:0Issues:0
Language:PythonLicense:MITStargazers:1Issues:0Issues:0

discovering-reinforcement-learning-algorithms

A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. and Silver, D., 2020. Discovering reinforcement learning algorithms. Advances in Neural Information Processing Systems, 33.

Language:PythonLicense:Apache-2.0Stargazers:20Issues:0Issues:0

spr

Code for "Data-Efficient Reinforcement Learning with Self-Predictive Representations"

Language:PythonLicense:MITStargazers:156Issues:0Issues:0

DistributedRL-Pytorch-Ray

Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)

Language:PythonLicense:MITStargazers:29Issues:0Issues:0

dqn_zoo

DQN Zoo is a collection of reference implementations of reinforcement learning agents developed at DeepMind based on the Deep Q-Network (DQN) agent.

Language:PythonLicense:Apache-2.0Stargazers:430Issues:0Issues:0

moolib

A library for distributed ML training with PyTorch

Language:C++License:MITStargazers:365Issues:0Issues:0

muzero

A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each other, and investigate reliability of learned MuZero MDP models.

Language:Jupyter NotebookLicense:MITStargazers:146Issues:0Issues:0

warpgrad

Meta-Learning with Warped Gradient Descent

Language:PythonLicense:Apache-2.0Stargazers:90Issues:0Issues:0

torchbeast

A PyTorch Platform for Distributed RL

Language:PythonLicense:Apache-2.0Stargazers:735Issues:0Issues:0

sample-factory

High throughput synchronous and asynchronous reinforcement learning

Language:PythonLicense:MITStargazers:755Issues:0Issues:0

adeptRL

Reinforcement learning framework to accelerate research

Language:PythonLicense:GPL-3.0Stargazers:201Issues:0Issues:0
Language:PythonStargazers:44Issues:0Issues:0

stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Language:PythonLicense:MITStargazers:8103Issues:0Issues:0

open_spiel

OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.

Language:C++License:Apache-2.0Stargazers:4039Issues:0Issues:0

awesome-model-based-RL

A curated list of awesome model based RL resources (continually updated)

License:Apache-2.0Stargazers:739Issues:0Issues:0

awesome-continual-learning

A repository to keep track of literature on catastrophic forgetting

Stargazers:36Issues:0Issues:0

online-continual-learning

A collection of online continual learning paper implementations and tricks for computer vision in PyTorch, including our ASER(AAAI-21), SCR(CVPR21-W) and an online continual learning survey (Neurocomputing).

Language:PythonStargazers:360Issues:0Issues:0

babywalk

PyTorch code for the ACL 2020 paper: "BabyWalk: Going Farther in Vision-and-Language Navigationby Taking Baby Steps"

Language:PythonLicense:MITStargazers:40Issues:0Issues:0

merlin

(NeurIPS 2020) Meta-Consolidation for Continual Learning

Language:PythonStargazers:36Issues:0Issues:0

matching-networks-pytorch

Matching Networks for one shot learning

Language:PythonStargazers:228Issues:0Issues:0

PlaNet

Deep Planning Network: Control from pixels by latent planning with learned dynamics

Language:PythonLicense:MITStargazers:354Issues:0Issues:0

slac.pytorch

PyTorch implementation of Stochastic Latent Actor-Critic(SLAC).

Language:PythonLicense:MITStargazers:84Issues:0Issues:0

deep_bisim4control

Learning Invariant Representations for Reinforcement Learning without Reconstruction

Language:PythonLicense:NOASSERTIONStargazers:135Issues:0Issues:0