manila95

followers

following

stars

Mila - Quebec AI Institute

Montreal

https://mani.github.io

Organizations

K-DAG

Kaustubh Mani's starred repositories

awesome-rl-envs

Multi-Agent-Reinforcement-Learning-Environment

Hello, I pushed some python environments for Multi Agent Reinforcement Learning.

Language:Python661 9 7

pgx

♟️ Vectorized RL game environments in JAX

Language:PythonApache-2.0371 8 239

gradient-descent-the-ultimate-optimizer

Code for our NeurIPS 2022 paper

Language:PythonMIT362 50

awesome-exploration-rl

A curated list of awesome exploration RL resources (continually updated)

Apache-2.0355 60

modular-rl

[ICML 2020] PyTorch Code for "One Policy to Control Them All: Shared Modular Policies for Agent-Agnostic Control"

Language:Jupyter NotebookNOASSERTION211 11 9

Deep_RL_with_pytorch

A pytorch tutorial for DRL(Deep Reinforcement Learning)

Language:Jupyter Notebook202 7 1

mc_gradients

Language:Jupyter NotebookApache-2.0157 11 2

OSRL

🤖 Elegant implementations of offline safe RL algorithms in PyTorch

Language:PythonApache-2.0153 4 18

navlie

A state estimation package for Lie groups!

Language:PythonMIT147 4 27

CARL

Benchmarking RL generalization in an interpretable way.

Language:PythonApache-2.0123 11 47

CQL

Conservative Q Learning on top of SAC

Language:PythonMIT116 5 7

CQL

PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and continuous action spaces.

Language:Python110 4 7

PyTorch-Counterfactual-Multi-Agent-Policy-Gradients-COMA

Language:Python70 2 4

exploring_exploration

This paper contains code for our work "An Exploration of Embodied Visual Exploration".

Language:PythonNOASSERTION62 8 2

python-psignifit

Python clone of psignifit providing basic functionality

Language:Python54 8 36

JaxCQL

Conservative Q learning in Jax

Language:PythonMIT47 3 4

PPO-RND

Random network distillation on Montezuma's Revenge and Super Mario Bros.

Language:Python40 2 2

safe_rl

Implementations of SAILR, PDO, and CSC

Language:PythonMIT29 1 5

mbppol

This repository has code for the paper "Model-based Safe Deep Reinforcement Learning via a Constrained Proximal Policy Optimization Algorithm" accepted at NeurIPS 2022.

Language:PythonMIT24 2 1

TOP

Implementation of Tactical Optimistic and Pessimistic value estimation

Language:Python23 1 3

pointMass

pointMass pybullet RL environment for simple experiments

Language:Python20 40

ReLMM

Codebase for ReLMM

Language:PythonNOASSERTION19 2 4

MFNLC

[IROS 22'] Model-free Neural Lyapunov Control

Language:Python1602

Safe-panda-gym

OpenaAI Gym Franka Emika Panda robot environment based on PyBullet.

Language:PythonMIT1100

Bullet-Safety-Gym

An open-source framework to benchmark and assess safety specifications of Reinforcement Learning problems.

Language:PythonMIT800

mesa-safe-rl

Language:Python7 10

random-network-distillation-pytorch

Implementation of random network distillation. paper link: https://arxiv.org/abs/1810.12894

Language:Python6 2 1

sqrl

Language:Python1 10

hydra_mnist

Example usage of Hydra for HPC clusters using Singularity or Venv

Language:PythonMIT1 40