jens321

Jens Tuyls's starred repositories

SWE-agent

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.47% of bugs in the SWE-bench evaluation set and takes just 1 minute to run.

Language:PythonMIT1192700

PufferLib

Simplifying reinforcement learning for complex game environments

Language:PythonMIT61500

gymnax

RL Environments in JAX 🌍

Language:PythonApache-2.057800

purejaxrl

Really Fast End-to-End Jax RL Implementations

Language:PythonApache-2.062000

cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Language:PythonNOASSERTION490800

Samba

Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"

Language:PythonMIT68700

intelligent-go-explore

Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models

Language:Inform 7MIT3900

xlstm

Official repository of the xLSTM.

Language:PythonAGPL-3.097500

pomdp-baselines

Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022

Language:PythonMIT28200

wandb-offline-sync-hook

A convenient way to trigger synchronizations to wandb / Weights & Biases if your compute nodes don't have internet!

Language:PythonMIT4300

nle

The NetHack Learning Environment

Language:CNOASSERTION3200

Craftax

(Crafter + NetHack) in JAX. ICML 2024 Spotlight.

Language:PythonMIT15700

diff_history

[arXiv preprint 2024] Official code release accompanying the paper "diff History for Neural Language Agents" (Piterbarg, Pinto, Fergus)

Language:PythonMIT1700

il-scaling-in-games

Official code repo of "Scaling Laws for Imitation Learning in NetHack"

Language:Python300

sample-factory

High throughput synchronous and asynchronous reinforcement learning

Language:PythonMIT400

causal-conv1d

Causal depthwise conv1d in CUDA, with a PyTorch interface

Language:CudaBSD-3-Clause22500

lwm

We develop world models that can be adapted with natural language. Intergrating these models into artificial agents allows humans to effectively control these agents through verbal communication.

Language:PythonGPL-3.0900

HIQL

HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)

Language:PythonMIT6600

katakomba

Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)

Language:PythonNOASSERTION3700

procthor

🏘️ Scaling Embodied AI by Procedurally Generating Interactive 3D Houses

Language:PythonApache-2.024300

The Controllable Agent project trains RL Agents able to optimize any reward function specified in real time, without any further learning or fine-tuning. Training is reward-free and based on the Forward-Backward representation.

Language:PythonMIT5500

VIMA

Official Algorithm Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"

Language:PythonMIT71500

hihack

[NeurIPS 2023] Official code release accompanying the paper "NetHack is Hard to Hack" (Piterbarg, Pinto, Fergus)

Language:PythonMIT900

quasimetric-rl

Open source code for paper "Optimal Goal-Reaching Reinforcement Learning via Quasimetric Learning" ICML 2023

Language:PythonMIT3900

acme

A library of reinforcement learning components and agents

Language:PythonApache-2.0342900

broken_neural_scaling_laws

Code Release for "Broken Neural Scaling Laws" (BNSL) paper

Language:Python5600

VIMABench

Official Task Suite Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"

Language:PythonMIT24700