Abhinav Gupta (backpropper)

backpropper

Geek Repo

Company:MILA

Location:London, United Kingdom

Home Page:https://www.guabhinav.com

Twitter:@backpropper

Github PK Tool:Github PK Tool

Abhinav Gupta's repositories

acme

A library of reinforcement learning components and agents

Language:PythonLicense:Apache-2.0Stargazers:0Issues:2Issues:0

alphafold

Open source code for AlphaFold.

License:Apache-2.0Stargazers:0Issues:0Issues:0

bsuite

bsuite is a collection of carefully-designed experiments that investigate core capabilities of a reinforcement learning (RL) agent

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

coinrun

Code for the paper "Quantifying Transfer in Reinforcement Learning"

License:MITStargazers:0Issues:0Issues:0

deep-rl-class

This repo contain the syllabus of the Hugging Face Deep Reinforcement Learning Class.

Stargazers:0Issues:1Issues:0

dm-haiku

JAX-based neural network library

License:Apache-2.0Stargazers:0Issues:0Issues:0

dm_control

DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.

License:Apache-2.0Stargazers:0Issues:0Issues:0

dopamine

Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.

License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

flax

Flax is a neural network library for JAX that is designed for flexibility.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

football

Check out the new game server:

License:Apache-2.0Stargazers:0Issues:0Issues:0

gym-minigrid

Minimalistic gridworld package for OpenAI Gym

License:Apache-2.0Stargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

jax

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

License:Apache-2.0Stargazers:0Issues:0Issues:0

KAT

Research code for "KAT: A Knowledge Augmented Transformer for Vision-and-Language"

Language:PythonStargazers:0Issues:0Issues:0

lab2d

A customisable 2D platform for agent-based AI research

Language:C++License:Apache-2.0Stargazers:0Issues:1Issues:0

mctx

Monte Carlo tree search in JAX

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

mujoco

Multi-Joint dynamics with Contact. A general purpose physics simulator.

Language:CLicense:Apache-2.0Stargazers:0Issues:1Issues:0

multi-agent-emergence-environments

Environment generation code for the paper "Emergent Tool Use From Multi-Agent Autocurricula"

License:MITStargazers:0Issues:0Issues:0

nle

The NetHack Learning Environment

Language:CLicense:NOASSERTIONStargazers:0Issues:1Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

ray

An open source framework that provides a simple, universal API for building distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

rl-baselines3-zoo

A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

rliable

[NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:1Issues:0

rlpyt

Reinforcement Learning in PyTorch

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

sequential_social_dilemma_games

Repo for reproduction of sequential social dilemmas

License:MITStargazers:0Issues:0Issues:0

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

xmanager

A platform for managing machine learning experiments

License:Apache-2.0Stargazers:0Issues:0Issues:0