jens321

Jens Tuyls's starred repositories

flash-attention

Fast and memory-efficient exact attention

Language:PythonBSD-3-Clause1280400

SWE-agent

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.47% of bugs in the SWE-bench evaluation set and takes just 1 minute to run.

Language:PythonMIT1222400

PufferLib

Simplifying reinforcement learning for complex game environments

Language:PythonMIT75800

gymnax

RL Environments in JAX 🌍

Language:PythonApache-2.058800

purejaxrl

Really Fast End-to-End Jax RL Implementations

Language:PythonApache-2.065200

cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Language:PythonNOASSERTION503600

Samba

Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"

Language:PythonMIT73400

intelligent-go-explore

Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models

Language:Inform 7MIT3900

xlstm

Official repository of the xLSTM.

Language:PythonAGPL-3.0109500

pomdp-baselines

Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022

Language:PythonMIT28700

wandb-offline-sync-hook

A convenient way to trigger synchronizations to wandb / Weights & Biases if your compute nodes don't have internet!

Language:PythonMIT4600

nle

The NetHack Learning Environment

Language:CNOASSERTION3700

Craftax

(Crafter + NetHack) in JAX. ICML 2024 Spotlight.

Language:PythonMIT17900

diff_history

[arXiv preprint 2024] Official code release accompanying the paper "diff History for Neural Language Agents" (Piterbarg, Pinto, Fergus)

Language:PythonMIT1700

il-scaling-in-games

Official code repo of "Scaling Laws for Imitation Learning in NetHack"

Language:Python400

sample-factory

High throughput synchronous and asynchronous reinforcement learning

Language:PythonMIT400

causal-conv1d

Causal depthwise conv1d in CUDA, with a PyTorch interface

Language:CudaBSD-3-Clause24900

lwm

We develop world models that can be adapted with natural language. Intergrating these models into artificial agents allows humans to effectively control these agents through verbal communication.

Language:PythonGPL-3.0900

HIQL

HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)

Language:PythonMIT6700

katakomba

Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)

Language:PythonNOASSERTION3700

procthor

🏘️ Scaling Embodied AI by Procedurally Generating Interactive 3D Houses

Language:PythonApache-2.024800

The Controllable Agent project trains RL Agents able to optimize any reward function specified in real time, without any further learning or fine-tuning. Training is reward-free and based on the Forward-Backward representation.

Language:PythonMIT5500

VIMA

Official Algorithm Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"

Language:PythonMIT73200

hihack

[NeurIPS 2023] Official code release accompanying the paper "NetHack is Hard to Hack" (Piterbarg, Pinto, Fergus)

Language:PythonMIT900

quasimetric-rl

Open source code for paper "Optimal Goal-Reaching Reinforcement Learning via Quasimetric Learning" ICML 2023

Language:PythonMIT3900

acme

A library of reinforcement learning components and agents

Language:PythonApache-2.0344700

broken_neural_scaling_laws

Code Release for "Broken Neural Scaling Laws" (BNSL) paper

Language:Python5600