Ashique Rupam Mahmood's repositories
totd-rndmdp-experiments
Random MDP experiments on true online TD from a forthcoming work by van Seijen et al. (2015)
nonstationary-experiments
A bunch of stationary and nonstationary experiments comparing LMS, RLS, IDBD and Autostep.
wislstd-experiments
Random walk experiments on WIS-LSTD by Mahmood, van Hasselt, Sutton (2014, nips)
DetectAndTrack
The implementation of an algorithm presented in the CVPR18 paper: "Detect-and-Track: Efficient Pose Estimation in Videos"
notebooks-1
An attempt to formalize my thoughts. A pythonic approach to mental housekeeping
usage-td-experiments
Random MDP experiments on the usage-based step-size adaptation idea by Mahmood and Sutton (2015)
wis-td-experiments
Random MDP experiments on the WIS-based off-policy algorithms by Mahmood and Sutton (2015)
653f20assignment2
Assignment 2, CMPUT 653, Fall 2020
repn-learning
Taapas and Rupam's work during Summer 2020