Jens Tuyls's starred repositories

SWE-agent

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.47% of bugs in the SWE-bench evaluation set and takes just 1 minute to run.

Language:PythonLicense:MITStargazers:11927Issues:0Issues:0
Language:PythonLicense:MITStargazers:39Issues:0Issues:0

PufferLib

Simplifying reinforcement learning for complex game environments

Language:PythonLicense:MITStargazers:615Issues:0Issues:0

gymnax

RL Environments in JAX 🌍

Language:PythonLicense:Apache-2.0Stargazers:578Issues:0Issues:0

purejaxrl

Really Fast End-to-End Jax RL Implementations

Language:PythonLicense:Apache-2.0Stargazers:620Issues:0Issues:0

cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Language:PythonLicense:NOASSERTIONStargazers:4908Issues:0Issues:0

Samba

Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"

Language:PythonLicense:MITStargazers:687Issues:0Issues:0

intelligent-go-explore

Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models

Language:Inform 7License:MITStargazers:39Issues:0Issues:0

xlstm

Official repository of the xLSTM.

Language:PythonLicense:AGPL-3.0Stargazers:975Issues:0Issues:0

pomdp-baselines

Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022

Language:PythonLicense:MITStargazers:282Issues:0Issues:0

wandb-offline-sync-hook

A convenient way to trigger synchronizations to wandb / Weights & Biases if your compute nodes don't have internet!

Language:PythonLicense:MITStargazers:43Issues:0Issues:0

nle

The NetHack Learning Environment

Language:CLicense:NOASSERTIONStargazers:32Issues:0Issues:0
Language:PythonStargazers:10Issues:0Issues:0

Craftax

(Crafter + NetHack) in JAX. ICML 2024 Spotlight.

Language:PythonLicense:MITStargazers:157Issues:0Issues:0

diff_history

[arXiv preprint 2024] Official code release accompanying the paper "diff History for Neural Language Agents" (Piterbarg, Pinto, Fergus)

Language:PythonLicense:MITStargazers:17Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:23Issues:0Issues:0

il-scaling-in-games

Official code repo of "Scaling Laws for Imitation Learning in NetHack"

Language:PythonStargazers:3Issues:0Issues:0

sample-factory

High throughput synchronous and asynchronous reinforcement learning

Language:PythonLicense:MITStargazers:4Issues:0Issues:0

causal-conv1d

Causal depthwise conv1d in CUDA, with a PyTorch interface

Language:CudaLicense:BSD-3-ClauseStargazers:225Issues:0Issues:0

lwm

We develop world models that can be adapted with natural language. Intergrating these models into artificial agents allows humans to effectively control these agents through verbal communication.

Language:PythonLicense:GPL-3.0Stargazers:9Issues:0Issues:0

HIQL

HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)

Language:PythonLicense:MITStargazers:66Issues:0Issues:0

katakomba

Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)

Language:PythonLicense:NOASSERTIONStargazers:37Issues:0Issues:0

procthor

🏘️ Scaling Embodied AI by Procedurally Generating Interactive 3D Houses

Language:PythonLicense:Apache-2.0Stargazers:243Issues:0Issues:0

controllable_agent

The Controllable Agent project trains RL Agents able to optimize any reward function specified in real time, without any further learning or fine-tuning. Training is reward-free and based on the Forward-Backward representation.

Language:PythonLicense:MITStargazers:55Issues:0Issues:0

VIMA

Official Algorithm Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"

Language:PythonLicense:MITStargazers:715Issues:0Issues:0

hihack

[NeurIPS 2023] Official code release accompanying the paper "NetHack is Hard to Hack" (Piterbarg, Pinto, Fergus)

Language:PythonLicense:MITStargazers:9Issues:0Issues:0

quasimetric-rl

Open source code for paper "Optimal Goal-Reaching Reinforcement Learning via Quasimetric Learning" ICML 2023

Language:PythonLicense:MITStargazers:39Issues:0Issues:0

acme

A library of reinforcement learning components and agents

Language:PythonLicense:Apache-2.0Stargazers:3429Issues:0Issues:0

broken_neural_scaling_laws

Code Release for "Broken Neural Scaling Laws" (BNSL) paper

Language:PythonStargazers:56Issues:0Issues:0

VIMABench

Official Task Suite Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"

Language:PythonLicense:MITStargazers:247Issues:0Issues:0