Ashique Rupam Mahmood's repositories

totd-rndmdp-experiments

Random MDP experiments on true online TD from a forthcoming work by van Seijen et al. (2015)

Language:PythonStargazers:8Issues:4Issues:0

nonstationary-experiments

A bunch of stationary and nonstationary experiments comparing LMS, RLS, IDBD and Autostep.

Language:PythonStargazers:7Issues:2Issues:0

wislstd-experiments

Random walk experiments on WIS-LSTD by Mahmood, van Hasselt, Sutton (2014, nips)

Language:PythonStargazers:2Issues:2Issues:0

deep-rl

Collection of Deep Reinforcement Learning algorithms

Language:PythonLicense:MITStargazers:1Issues:2Issues:0

DetectAndTrack

The implementation of an algorithm presented in the CVPR18 paper: "Detect-and-Track: Efficient Pose Estimation in Videos"

Language:PythonLicense:Apache-2.0Stargazers:1Issues:2Issues:0

gym

A toolkit for developing and comparing reinforcement learning algorithms.

Language:PythonLicense:NOASSERTIONStargazers:1Issues:2Issues:0

notebooks-1

An attempt to formalize my thoughts. A pythonic approach to mental housekeeping

Language:Jupyter NotebookStargazers:1Issues:2Issues:0

pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Language:C++License:NOASSERTIONStargazers:1Issues:1Issues:0

rnnlm

Tomas Mikolov's Recurrent Neural Networks Language Modeling Toolkit from http://www.rnnlm.org, with tagged historical releases.

Language:C++License:BSD-3-ClauseStargazers:1Issues:2Issues:0

scipy

Scipy library main repository

Language:PythonLicense:NOASSERTIONStargazers:1Issues:2Issues:0

usage-td-experiments

Random MDP experiments on the usage-based step-size adaptation idea by Mahmood and Sutton (2015)

Language:PythonStargazers:1Issues:2Issues:0

wis-td-experiments

Random MDP experiments on the WIS-based off-policy algorithms by Mahmood and Sutton (2015)

Language:PythonStargazers:1Issues:2Issues:0

653f20assignment2

Assignment 2, CMPUT 653, Fall 2020

Language:PythonStargazers:0Issues:4Issues:0
Language:PythonLicense:NOASSERTIONStargazers:0Issues:2Issues:0

repn-learning

Taapas and Rupam's work during Summer 2020

Language:PythonLicense:Apache-2.0Stargazers:0Issues:4Issues:1

SenseAct

SenseAct: A computational framework for real-world robot learning tasks

Language:PythonLicense:NOASSERTIONStargazers:0Issues:2Issues:0