wbrenton's repositories
cleanba
CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL
Language:PythonNOASSERTION000
cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Language:PythonNOASSERTION000
dm-haiku
JAX-based neural network library
Language:PythonApache-2.0000
Language:Python000
jax_sebulba
🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX
Language:PythonApache-2.0000
lm-human-preference-details
[lm-human-preference-details](https://github.com/vwxyzjn/lm-human-preference-details) ported to JAX
Language:PythonMIT000
open_spiel
OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.
Language:C++Apache-2.0000
Language:PythonMIT000