Stone Tao's starred repositories

pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Language:PythonLicense:NOASSERTIONStargazers:80673Issues:1740Issues:43489

resume.github.com

Resumes generated using the GitHub informations

cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Language:PythonLicense:NOASSERTIONStargazers:4964Issues:35Issues:176

acme

A library of reinforcement learning components and agents

Language:PythonLicense:Apache-2.0Stargazers:3438Issues:83Issues:264

bsuite

bsuite is a collection of carefully-designed experiments that investigate core capabilities of a reinforcement learning (RL) agent

Language:PythonLicense:Apache-2.0Stargazers:1486Issues:61Issues:31

mujoco_menagerie

A collection of high-quality models for the MuJoCo physics engine, curated by Google DeepMind.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:1164Issues:26Issues:57

envpool

C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.

Language:C++License:Apache-2.0Stargazers:1049Issues:23Issues:137
Language:PythonLicense:Apache-2.0Stargazers:747Issues:18Issues:45

rliable

[NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:726Issues:11Issues:15

ManiSkill

SAPIEN Manipulation Skill Framework, a GPU parallelized robotics simulator and benchmark

Language:PythonLicense:Apache-2.0Stargazers:594Issues:17Issues:191
Language:PythonLicense:Apache-2.0Stargazers:521Issues:19Issues:44

salina

a Lightweight library for sequential learning agents, including reinforcement learning

Language:PythonLicense:MITStargazers:426Issues:13Issues:12

lleaves

Compiler for LightGBM gradient-boosted trees, based on LLVM. Speeds up prediction by ā‰„10x.

Language:PythonLicense:MITStargazers:330Issues:10Issues:42

tdmpc

Code for "Temporal Difference Learning for Model Predictive Control"

Language:PythonLicense:MITStargazers:313Issues:6Issues:18

glum

High performance Python GLMs with all the features!

Language:PythonLicense:BSD-3-ClauseStargazers:292Issues:15Issues:295

DeepRLInTheWorld

From search engines, to science, to robotics, this reposity is meant to showcase the use of reinforcement learning in the world..

ARM

Q-attention (within the ARM system) and coarse-to-fine Q-attention (within C2F-ARM system).

Language:PythonLicense:NOASSERTIONStargazers:152Issues:7Issues:14

scikit.js

JavaScript package for predictive data analysis and machine learning

Language:TypeScriptLicense:MITStargazers:126Issues:6Issues:37

tabmat

Efficient matrix representations for working with tabular data

Language:PythonLicense:BSD-3-ClauseStargazers:106Issues:14Issues:83

metamorph

Code for "MetaMorph: Learning Universal Controllers with Transformers", Gupta et al, ICLR 2022

Language:Jupyter NotebookLicense:MITStargazers:98Issues:3Issues:1

alm

Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective

Language:PythonLicense:MITStargazers:77Issues:4Issues:0
Language:PythonLicense:Apache-2.0Stargazers:76Issues:5Issues:27

rl3d

[RA-L 2023 & IROS 2023] Visual Reinforcement Learning with Self-Supervised 3D Representations

Language:PythonLicense:MITStargazers:75Issues:5Issues:3

sapai

Super auto pets engine built with reinforment learning training in mind

Language:PythonLicense:MITStargazers:65Issues:5Issues:56

entity-factored-rl

Source code for the paper "Policy Architectures for Compositional Generalization in Control"

Language:PythonLicense:NOASSERTIONStargazers:29Issues:8Issues:4

acm-ai-workshops

Repository of AI resources from workshops hosted by ACM AI at UCSD šŸ§ 

Language:Jupyter NotebookStargazers:21Issues:8Issues:12

jax-bandits

bandit algorithms in jax

Language:PythonLicense:MITStargazers:2Issues:2Issues:0