wbrenton's repositories

nanax

Minimal implementations of various deep learning architectures and training procedures in JAX (Flax)

Language:PythonStargazers:1Issues:0Issues:0

cleanba

CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

dm-haiku

JAX-based neural network library

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

jax_sebulba

🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

lm-human-preference-details

[lm-human-preference-details](https://github.com/vwxyzjn/lm-human-preference-details) ported to JAX

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

open_spiel

OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

quick_tpu

Git clone this repo to develop JAX on TPU

Language:ShellStargazers:0Issues:1Issues:0