koulanurag

Anurag Koul's repositories

ma-gym

A collection of multi agent environments based on OpenAI gym.

Language:PythonApache-2.0544 7 28

muzero-pytorch

Pytorch Implementation of MuZero

Language:PythonMIT324 21 6

minimal-marl

Minimal implementation of multi-agent reinforcement learning algorithms

Language:PythonMIT46 3 4

mmn

Moore Machine Networks (MMN): Learning Finite-State Representations of Recurrent Policy Networks

Language:Python46 5 3

visTorch

Interacting with Latent Space of AutoEncoder

Language:Python21 3 1

conformal

Conformal prediction is a framework for providing accuracy guarantees on the predictions of a base predictor

Language:PythonNOASSERTION10 5 1

dream-and-search

Code for "Dream and Search to Control: Latent Space Planning for Continuous Control"

Language:Python10 40

gym-cartpole-continuous

CartPole env. with continuous action space

Language:PythonMIT7 30

gym_x

Gym environments for capture properties of hidden states(hx) of recurrent networks.

Language:Python5 40

marl-pytorch

Pytorch Implementations of Multi Agent Reinforcement Learning(marl) algorithms

Language:Python5 30

deep-conformal

Applying Conformal Prediction over Deep Neural Nets

Language:Python4 30

opcc

Benchmark for "Offline Policy Comparison with Confidence"

Language:PythonApache-2.03 10

policybazaar

A collection of multi-quality policies for continuous control tasks.

Language:PythonApache-2.03 40

variable-td3

Learning n-step actions for control tasks

Language:PythonMIT2 50

maze-world

Random maze environments with different size and complexity for reinforcement learning research.

Language:PythonApache-2.0100

opcc-baselines

Baselines for "Offline Policy Comparison with Confidence"

Language:PythonApache-2.01 10

pfa

Policy Fusion Architecture (PFA): We investigate policy gradient approaches for reward decomposition in reinforcement Learning

Language:Python1 30

tensorboard2seaborn

Plot Tensorflow Summary Event in a Beautiful Way 🌈

Language:PythonNOASSERTION1 20

vpn

PyTorch implementation of Value Prediction Network (VPN) :construction: :construction_worker:

Language:Python1 40

pid-pendulum

PID controller for open-ai gym's Pendulum.

Language:Jupyter Notebook040

abp

A library to create adaptive programs (abp) via Reinforcement Learning

Language:PythonMIT050

bmi

BMI Dashboard using NodeJs

Language:JavaScript030

card-arrangement-game

Card Arrangement Game to introduce statistical notions in fun way :game_die: :black_joker: :slot_machine:

Language:CSSNOASSERTION030

chatter-nodejs

Trying to make a chat channel similar to IRC. (Inspired by usage of slack)

Language:JavaScript030

d4rl

A benchmark for offline reinforcement learning.

Language:PythonApache-2.0020

device-config-app

Primary purpose of app is to configure Echo Sounders.

Language:CSS030

gym-sokoban

Sokoban environment for OpenAI Gym

Language:PythonMIT010

sokoban-bazaar

Language:PythonMIT000

sweatram_mean

SweatRam's Dashboard using a Mean Stack

Language:HTML030

tweet-node

This project is to analyse real time tweets

Language:CSS030