backpropper

followers

following

stars

MILA

London, United Kingdom

https://www.guabhinav.com

Abhinav Gupta's repositories

acme

A library of reinforcement learning components and agents

Language:PythonApache-2.0020

alphafold

Open source code for AlphaFold.

Apache-2.0000

bsuite

bsuite is a collection of carefully-designed experiments that investigate core capabilities of a reinforcement learning (RL) agent

Language:PythonApache-2.0010

coinrun

Code for the paper "Quantifying Transfer in Reinforcement Learning"

MIT000

deep-rl-class

This repo contain the syllabus of the Hugging Face Deep Reinforcement Learning Class.

010

dm-haiku

JAX-based neural network library

Apache-2.0000

dm_control

DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.

Apache-2.0000

dopamine

Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.

Apache-2.0000

enn

Language:PythonApache-2.0010

flax

Flax is a neural network library for JAX that is designed for flexibility.

Language:PythonApache-2.0010

football

Check out the new game server:

Apache-2.0000

gym-minigrid

Minimalistic gridworld package for OpenAI Gym

Apache-2.0000

idaac

MIT000

jax

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Apache-2.0000

KAT

Research code for "KAT: A Knowledge Augmented Transformer for Vision-and-Language"

Language:Python000

lab2d

A customisable 2D platform for agent-based AI research

Language:C++Apache-2.0010

mctx

Monte Carlo tree search in JAX

Language:PythonApache-2.0010

meltingpot

Apache-2.0000

mujoco

Multi-Joint dynamics with Contact. A general purpose physics simulator.

Language:CApache-2.0010

multi-agent-emergence-environments

Environment generation code for the paper "Emergent Tool Use From Multi-Agent Autocurricula"

MIT000

nle

The NetHack Learning Environment

Language:CNOASSERTION010

objax

Language:PythonApache-2.0010

ray

An open source framework that provides a simple, universal API for building distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.

Language:PythonApache-2.0010

rl-baselines3-zoo

A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.

Language:PythonMIT010

rlax

Language:PythonApache-2.0010

rliable

[NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.

Language:Jupyter NotebookApache-2.0010

rlpyt

Reinforcement Learning in PyTorch

Language:PythonMIT010

sequential_social_dilemma_games

Repo for reproduction of sequential social dilemmas

MIT000

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonApache-2.0010

xmanager

A platform for managing machine learning experiments

Apache-2.0000