fuyw's repositories

RepL4RL

Representation Learning for RL

jrlzoo

A collection of RL baselines in Jax.

Language:PythonStargazers:2Issues:1Issues:0

CARL

Benchmarking RL generalization in an interpretable way.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

dreamerv3

Mastering Diverse Domains through World Models

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

drl-memory-gym

Challenging Memory-based Deep Reinforcement Learning Agents

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

easytrader

提供同花顺客户端/国金/华泰客户端/雪球的基金、股票自动程序化交易以及自动打新,支持跟踪 joinquant /ricequant 模拟交易 和 实盘雪球组合, 量化交易组件

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

fastbook

The fastai book, published as Jupyter Notebooks

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:0Issues:0Issues:0

Hsuanwu

Long-Term Evolution Project of Reinforcement Learning

Language:C++License:MITStargazers:0Issues:0Issues:0

hyper-nn

Easy Hypernetworks in Pytorch and Jax

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

HyQ

Official code repo for paper: Hybrid RL: Using both offline and online data can make RL efficient.

Language:PythonStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

IVR

Author's implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

lleaves

Compiler for LightGBM gradient-boosted trees, based on LLVM. Speeds up prediction by ≥10x.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

LogicStack-LeetCode

公众号「宫水三叶的刷题日记」刷穿 LeetCode 系列文章源码

License:Apache-2.0Stargazers:0Issues:0Issues:0

mammoth

An Extendible (General) Continual Learning Framework based on Pytorch - official codebase of Dark Experience for General Continual Learning

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

NECSA

Official implementation of Neural Episodic Control with State Abstraction

Language:PythonStargazers:0Issues:0Issues:0

outer-value-function-meta-rl

Code of the paper: Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

pgx

A collection of highly-parallel RL game environments written in JAX

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

plan2explore

Repository for the paper "Planning to Explore via Self-Supervised World Models"

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

pml-book

"Probabilistic Machine Learning" - a book series by Kevin Murphy

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

pyprobml

Python code for "Probabilistic Machine learning" book by Kevin Murphy

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

rl_with_resets

JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

TabPFN

Official implementation of the TabPFN and the tabpfn package.

Language:PythonStargazers:0Issues:0Issues:0

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

License:NOASSERTIONStargazers:0Issues:0Issues:0

v-d4rl

Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations

Language:PythonLicense:MITStargazers:0Issues:0Issues:0