Beast code in Giters

Ashique Rupam Mahmood's repositories

totd-rndmdp-experiments

Random MDP experiments on true online TD from a forthcoming work by van Seijen et al. (2015)

Language:Python8 40

nonstationary-experiments

A bunch of stationary and nonstationary experiments comparing LMS, RLS, IDBD and Autostep.

Language:Python7 20

Create-Serial-Port-Packet-Processor

Language:CMIT3 20

wislstd-experiments

Random walk experiments on WIS-LSTD by Mahmood, van Hasselt, Sutton (2014, nips)

Language:Python2 20

deep-rl

Collection of Deep Reinforcement Learning algorithms

Language:PythonMIT1 20

DetectAndTrack

The implementation of an algorithm presented in the CVPR18 paper: "Detect-and-Track: Efficient Pose Estimation in Videos"

Language:PythonApache-2.01 20

gym

A toolkit for developing and comparing reinforcement learning algorithms.

Language:PythonNOASSERTION1 20

notebooks-1

An attempt to formalize my thoughts. A pythonic approach to mental housekeeping

Language:Jupyter Notebook1 20

pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Language:C++NOASSERTION1 10

rnnlm

Tomas Mikolov's Recurrent Neural Networks Language Modeling Toolkit from http://www.rnnlm.org, with tagged historical releases.

Language:C++BSD-3-Clause1 20

scipy

Scipy library main repository

Language:PythonNOASSERTION1 20

usage-td-experiments

Random MDP experiments on the usage-based step-size adaptation idea by Mahmood and Sutton (2015)

Language:Python1 20

wis-td-experiments

Random MDP experiments on the WIS-based off-policy algorithms by Mahmood and Sutton (2015)

Language:Python1 20

653f20assignment2

Assignment 2, CMPUT 653, Fall 2020

Language:Python040

deepdream

Language:PythonNOASSERTION020

repn-learning

Taapas and Rupam's work during Summer 2020

Language:PythonApache-2.004 1

SenseAct

SenseAct: A computational framework for real-world robot learning tasks

Language:PythonNOASSERTION020