wbrenton

wbrenton's repositories

nanax

Minimal implementations of various deep learning architectures and training procedures in JAX (Flax)

Language:Python100

cleanba

CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL

Language:PythonNOASSERTION000

cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Language:PythonNOASSERTION000

dm-haiku

JAX-based neural network library

Language:PythonApache-2.0000

ent-reg-marl

Language:Python000

jax_sebulba

🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX

Language:PythonApache-2.0000

lm-human-preference-details

[lm-human-preference-details](https://github.com/vwxyzjn/lm-human-preference-details) ported to JAX

Language:PythonMIT000

open_spiel

OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.

Language:C++Apache-2.0000

openrlbenchmark

Language:PythonMIT000

quick_tpu

Git clone this repo to develop JAX on TPU

Language:Shell010