Sridhar Thiagarajan's repositories
Convex-Optimization-Solver
Generic Solver (Primal Interior Point Method)
One-Shot-Imitation-Learning
Imitation Learning using context embedding
Markov-Chain-Monte-Carlo--Gibbs-Sampling
MCMC Method : Gibbs Sampling from 2D Gaussian
Monte-Carlo-Tree-Search
Monte Carlo Tree Search for receding horizon control
Deterministic-Policy-Gradient-Methods
C++ Implementation of Deterministic Policy Gradient Algorithms (ICML 2014, Silver Et al.) using Tile Coding
Dimensionality-Reduced-Reinforcement-Learning-for-Assistive-Robots
Reproducing AAAI 2016 Paper : Dimensionality Reduced Reinforcement Learning for Assistive Robots
Stochastic-Policy-Gradient-Methods
Monte-Carlo Policy Gradient, Stochastic Policy Gradient and Numerical Gradient Policy Gradient
Eligibility-Traces-RL
Performance Comparison of various Eligibility Traces on Maze Task
Integer_Programming_CVXPY
Integer programming problems solved using Gurobi backend and CVXPY
FourierBasis-Python
SARSA Lambda Fourier Basis
IntraOption-Learning
Intra Option Learning, SMDP Framework
QLearn-vs-SARSA-Cliff-Walk
Comparison of Q-Learning and SARSA On Cliff Walk
-Double-DQN-and-DQN
Implementation of DQN and Double DQN for OpenAI Gym Environments
Diverse-Density-Estimation-for-Subgoal-Detection
Autonomous Subgoal Discovery for Rl agent
Off-Policy-Eligibility-Traces
Tree based backup proposed by Diana Precup on N-Step Random Walk
offworld-gym
OffWorld Gym client library
Q-Learning
Q-Learning Discrete State Discrete Action
RNN-TensorFlow
Implementation of RNN in TensorFlow
sritee.github.io
Github page
Traffic_simulator_pygame
Pygame traffic simulator