Kaustubh Mani (manila95)

manila95

Geek Repo

Company:Mila - Quebec AI Institute

Location:Montreal

Home Page:https://mani.github.io

Github PK Tool:Github PK Tool


Organizations
K-DAG

Kaustubh Mani's starred repositories

Multi-Agent-Reinforcement-Learning-Environment

Hello, I pushed some python environments for Multi Agent Reinforcement Learning.

pgx

♟️ Vectorized RL game environments in JAX

Language:PythonLicense:Apache-2.0Stargazers:371Issues:8Issues:239

gradient-descent-the-ultimate-optimizer

Code for our NeurIPS 2022 paper

Language:PythonLicense:MITStargazers:362Issues:5Issues:0

awesome-exploration-rl

A curated list of awesome exploration RL resources (continually updated)

License:Apache-2.0Stargazers:355Issues:6Issues:0

modular-rl

[ICML 2020] PyTorch Code for "One Policy to Control Them All: Shared Modular Policies for Agent-Agnostic Control"

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:211Issues:11Issues:9

Deep_RL_with_pytorch

A pytorch tutorial for DRL(Deep Reinforcement Learning)

Language:Jupyter NotebookStargazers:202Issues:7Issues:1
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:157Issues:11Issues:2

OSRL

🤖 Elegant implementations of offline safe RL algorithms in PyTorch

Language:PythonLicense:Apache-2.0Stargazers:153Issues:4Issues:18

navlie

A state estimation package for Lie groups!

Language:PythonLicense:MITStargazers:147Issues:4Issues:27

CARL

Benchmarking RL generalization in an interpretable way.

Language:PythonLicense:Apache-2.0Stargazers:123Issues:11Issues:47

CQL

Conservative Q Learning on top of SAC

Language:PythonLicense:MITStargazers:116Issues:5Issues:7

CQL

PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and continuous action spaces.

exploring_exploration

This paper contains code for our work "An Exploration of Embodied Visual Exploration".

Language:PythonLicense:NOASSERTIONStargazers:62Issues:8Issues:2

python-psignifit

Python clone of psignifit providing basic functionality

JaxCQL

Conservative Q learning in Jax

Language:PythonLicense:MITStargazers:47Issues:3Issues:4

PPO-RND

Random network distillation on Montezuma's Revenge and Super Mario Bros.

safe_rl

Implementations of SAILR, PDO, and CSC

Language:PythonLicense:MITStargazers:29Issues:1Issues:5

mbppol

This repository has code for the paper "Model-based Safe Deep Reinforcement Learning via a Constrained Proximal Policy Optimization Algorithm" accepted at NeurIPS 2022.

Language:PythonLicense:MITStargazers:24Issues:2Issues:1

TOP

Implementation of Tactical Optimistic and Pessimistic value estimation

pointMass

pointMass pybullet RL environment for simple experiments

Language:PythonStargazers:20Issues:4Issues:0

ReLMM

Codebase for ReLMM

Language:PythonLicense:NOASSERTIONStargazers:19Issues:2Issues:4

MFNLC

[IROS 22'] Model-free Neural Lyapunov Control

Language:PythonStargazers:16Issues:0Issues:2

Safe-panda-gym

OpenaAI Gym Franka Emika Panda robot environment based on PyBullet.

Language:PythonLicense:MITStargazers:11Issues:0Issues:0

Bullet-Safety-Gym

An open-source framework to benchmark and assess safety specifications of Reinforcement Learning problems.

Language:PythonLicense:MITStargazers:8Issues:0Issues:0
Language:PythonStargazers:7Issues:1Issues:0

random-network-distillation-pytorch

Implementation of random network distillation. paper link: https://arxiv.org/abs/1810.12894

Language:PythonStargazers:1Issues:1Issues:0

hydra_mnist

Example usage of Hydra for HPC clusters using Singularity or Venv

Language:PythonLicense:MITStargazers:1Issues:4Issues:0