sritee

Sridhar Thiagarajan's repositories

Convex-Optimization-Solver

Generic Solver (Primal Interior Point Method)

Language:MATLAB8 2 1

CPPGym

C++17 OpenAI gym

Language:C++GPL-2.07 10

One-Shot-Imitation-Learning

Imitation Learning using context embedding

Language:Python7 10

Markov-Chain-Monte-Carlo--Gibbs-Sampling

MCMC Method : Gibbs Sampling from 2D Gaussian

Language:MATLABMIT6 10

Monte-Carlo-Tree-Search

Monte Carlo Tree Search for receding horizon control

Language:PythonMIT4 20

Deterministic-Policy-Gradient-Methods

C++ Implementation of Deterministic Policy Gradient Algorithms (ICML 2014, Silver Et al.) using Tile Coding

Language:C++MIT3 1 1

Dimensionality-Reduced-Reinforcement-Learning-for-Assistive-Robots

Reproducing AAAI 2016 Paper : Dimensionality Reduced Reinforcement Learning for Assistive Robots

Language:Python3 10

DynaQ

DynaQ RL-Agent

Language:MATLABMIT3 10

Stochastic-Policy-Gradient-Methods

Monte-Carlo Policy Gradient, Stochastic Policy Gradient and Numerical Gradient Policy Gradient

Language:PythonMIT3 10

Eligibility-Traces-RL

Performance Comparison of various Eligibility Traces on Maze Task

Language:MatlabMIT2 10

Integer_Programming_CVXPY

Integer programming problems solved using Gurobi backend and CVXPY

Language:Python2 10

FourierBasis-Python

SARSA Lambda Fourier Basis

Language:PythonMIT1 10

IntraOption-Learning

Intra Option Learning, SMDP Framework

Language:MatlabMIT1 20

Kickstarter-ML-Feature-Engineering-

Language:Jupyter NotebookMIT1 20

QLearn-vs-SARSA-Cliff-Walk

Comparison of Q-Learning and SARSA On Cliff Walk

Language:MatlabMIT1 10

-Double-DQN-and-DQN

Implementation of DQN and Double DQN for OpenAI Gym Environments

Language:PythonMIT020

baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Language:PythonMIT010

Diverse-Density-Estimation-for-Subgoal-Detection

Autonomous Subgoal Discovery for Rl agent

Language:MatlabMIT010

gym

A toolkit for developing and comparing reinforcement learning algorithms.

Language:PythonNOASSERTION010

MC-Exploring-Starts-Blackjack

Language:MatlabMIT010

NumpyNets

Numpy NeuralNetworks with Keras like interface

Language:PythonMIT01 1

Off-Policy-Eligibility-Traces

Tree based backup proposed by Diana Precup on N-Step Random Walk

Language:MatlabMIT010

offworld-gym

OffWorld Gym client library

Language:PythonGPL-3.0010

Orienteering-Problem-for-Ebola-Camps

Language:Python020

Q-Learning

Q-Learning Discrete State Discrete Action

Language:PythonMIT010

rl-agents

Implementations of Reinforcement Learning and Planning algorithms

Language:PythonMIT010

RNN-TensorFlow

Implementation of RNN in TensorFlow

Language:PythonMIT010

sritee.github.io

Github page

Language:HTML010

Traffic_simulator_pygame

Pygame traffic simulator

Language:Python010