Beast code in Giters

fuyw's repositories

RepL4RL

Representation Learning for RL

110 70

jrlzoo

A collection of RL baselines in Jax.

Language:Python8 2 2

rlmc

Language:Python2 10

CARL

Benchmarking RL generalization in an interpretable way.

Language:PythonApache-2.0000

dreamerv3

Mastering Diverse Domains through World Models

Language:PythonMIT000

drl-memory-gym

Challenging Memory-based Deep Reinforcement Learning Agents

Language:PythonMIT000

easytrader

提供同花顺客户端/国金/华泰客户端/雪球的基金、股票自动程序化交易以及自动打新，支持跟踪 joinquant /ricequant 模拟交易和实盘雪球组合, 量化交易组件

Language:PythonMIT000

fastbook

The fastai book, published as Jupyter Notebooks

Language:Jupyter NotebookNOASSERTION000

Hsuanwu

Long-Term Evolution Project of Reinforcement Learning

Language:C++MIT000

hyper-nn

Easy Hypernetworks in Pytorch and Jax

Language:Jupyter NotebookMIT000

HyQ

Official code repo for paper: Hybrid RL: Using both offline and online data can make RL efficient.

Language:Python000

inac_pytorch

Language:Python000

IVR

Author's implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization"

Language:PythonMIT000

lge

Language:Jupyter NotebookMIT000

lleaves

Compiler for LightGBM gradient-boosted trees, based on LLVM. Speeds up prediction by ≥10x.

Language:PythonMIT000

LogicStack-LeetCode

公众号「宫水三叶的刷题日记」刷穿 LeetCode 系列文章源码

Apache-2.0000

mammoth

An Extendible (General) Continual Learning Framework based on Pytorch - official codebase of Dark Experience for General Continual Learning

Language:PythonMIT000

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Language:PythonMIT000

NECSA

Official implementation of Neural Episodic Control with State Abstraction

Language:Python000

outer-value-function-meta-rl

Code of the paper: Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function

Language:Jupyter Notebook000

pgx

A collection of highly-parallel RL game environments written in JAX

Language:PythonApache-2.0000

plan2explore

Repository for the paper "Planning to Explore via Self-Supervised World Models"

Language:PythonApache-2.0000

pml-book

"Probabilistic Machine Learning" - a book series by Kevin Murphy

Language:Jupyter NotebookMIT000

pyprobml

Python code for "Probabilistic Machine learning" book by Kevin Murphy

Language:Jupyter NotebookMIT000

rl_with_resets

JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"

Language:PythonMIT000

rlpd

Language:PythonMIT000

TabPFN

Official implementation of the TabPFN and the tabpfn package.

Language:Python000

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonApache-2.0000

tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

NOASSERTION000

v-d4rl

Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations

Language:PythonMIT000