Anurag Koul (koulanurag)

koulanurag

Geek Repo

Company:Microsoft

Location:New York, New York

Home Page:https://koulanurag.dev

Twitter:@koulanurag

Github PK Tool:Github PK Tool

Anurag Koul's repositories

ma-gym

A collection of multi agent environments based on OpenAI gym.

Language:PythonLicense:Apache-2.0Stargazers:519Issues:7Issues:28

muzero-pytorch

Pytorch Implementation of MuZero

Language:PythonLicense:MITStargazers:321Issues:21Issues:6

mmn

Moore Machine Networks (MMN): Learning Finite-State Representations of Recurrent Policy Networks

minimal-marl

Minimal implementation of multi-agent reinforcement learning algorithms

Language:PythonLicense:MITStargazers:44Issues:3Issues:4

visTorch

Interacting with Latent Space of AutoEncoder

dream-and-search

Code for "Dream and Search to Control: Latent Space Planning for Continuous Control"

Language:PythonStargazers:10Issues:4Issues:0

conformal

Conformal prediction is a framework for providing accuracy guarantees on the predictions of a base predictor

Language:PythonLicense:NOASSERTIONStargazers:9Issues:5Issues:1

gym-cartpole-continuous

CartPole env. with continuous action space

Language:PythonLicense:MITStargazers:7Issues:3Issues:0

gym_x

Gym environments for capture properties of hidden states(hx) of recurrent networks.

Language:PythonStargazers:5Issues:4Issues:0

marl-pytorch

Pytorch Implementations of Multi Agent Reinforcement Learning(marl) algorithms

Language:PythonStargazers:5Issues:3Issues:0

deep-conformal

Applying Conformal Prediction over Deep Neural Nets

Language:PythonStargazers:3Issues:3Issues:0

opcc

Benchmark for "Offline Policy Comparison with Confidence"

Language:PythonLicense:Apache-2.0Stargazers:3Issues:1Issues:0

policybazaar

A collection of multi-quality policies for continuous control tasks.

Language:PythonLicense:Apache-2.0Stargazers:3Issues:4Issues:0

variable-td3

Learning n-step actions for control tasks

Language:PythonLicense:MITStargazers:2Issues:5Issues:0

maze-world

Random maze environments with different size and complexity for reinforcement learning research.

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

opcc-baselines

Baselines for "Offline Policy Comparison with Confidence"

Language:PythonLicense:Apache-2.0Stargazers:1Issues:2Issues:0

pfa

Policy Fusion Architecture (PFA): We investigate policy gradient approaches for reward decomposition in reinforcement Learning

Language:PythonStargazers:1Issues:3Issues:0

tensorboard2seaborn

Plot Tensorflow Summary Event in a Beautiful Way 🌈

Language:PythonLicense:NOASSERTIONStargazers:1Issues:2Issues:0

vpn

PyTorch implementation of Value Prediction Network (VPN) :construction: :construction_worker:

Language:PythonStargazers:1Issues:4Issues:0

pid-pendulum

PID controller for open-ai gym's Pendulum.

Language:Jupyter NotebookStargazers:0Issues:4Issues:0

abp

A library to create adaptive programs (abp) via Reinforcement Learning

Language:PythonLicense:MITStargazers:0Issues:5Issues:0

bmi

BMI Dashboard using NodeJs

Language:JavaScriptStargazers:0Issues:3Issues:0

card-arrangement-game

Card Arrangement Game to introduce statistical notions in fun way :game_die: :black_joker: :slot_machine:

Language:CSSLicense:NOASSERTIONStargazers:0Issues:3Issues:0

chatter-nodejs

Trying to make a chat channel similar to IRC. (Inspired by usage of slack)

Language:JavaScriptStargazers:0Issues:3Issues:0

d4rl

A benchmark for offline reinforcement learning.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:2Issues:0

data-science-coursera

Tracking progress in data science course on Coursera

Stargazers:0Issues:3Issues:0

device-config-app

Primary purpose of app is to configure Echo Sounders.

Language:CSSStargazers:0Issues:3Issues:0

gym-sokoban

Sokoban environment for OpenAI Gym

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

sweatram_mean

SweatRam's Dashboard using a Mean Stack

Language:HTMLStargazers:0Issues:3Issues:0

tweet-node

This project is to analyse real time tweets

Language:CSSStargazers:0Issues:3Issues:0