Jens Tuyls's starred repositories
flash-attention
Fast and memory-efficient exact attention
intelligent-go-explore
Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models
pomdp-baselines
Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022
wandb-offline-sync-hook
A convenient way to trigger synchronizations to wandb / Weights & Biases if your compute nodes don't have internet!
diff_history
[arXiv preprint 2024] Official code release accompanying the paper "diff History for Neural Language Agents" (Piterbarg, Pinto, Fergus)
il-scaling-in-games
Official code repo of "Scaling Laws for Imitation Learning in NetHack"
sample-factory
High throughput synchronous and asynchronous reinforcement learning
causal-conv1d
Causal depthwise conv1d in CUDA, with a PyTorch interface
controllable_agent
The Controllable Agent project trains RL Agents able to optimize any reward function specified in real time, without any further learning or fine-tuning. Training is reward-free and based on the Forward-Backward representation.
quasimetric-rl
Open source code for paper "Optimal Goal-Reaching Reinforcement Learning via Quasimetric Learning" ICML 2023
broken_neural_scaling_laws
Code Release for "Broken Neural Scaling Laws" (BNSL) paper