Sridhar Thiagarajan (sritee)

sritee

Geek Repo

Company:Google DeepMind

Location:Mountain View, CA

Home Page:https://sritee.github.io

Github PK Tool:Github PK Tool

Sridhar Thiagarajan's repositories

Convex-Optimization-Solver

Generic Solver (Primal Interior Point Method)

CPPGym

C++17 OpenAI gym

Language:C++License:GPL-2.0Stargazers:7Issues:1Issues:0

One-Shot-Imitation-Learning

Imitation Learning using context embedding

Language:PythonStargazers:7Issues:1Issues:0

Markov-Chain-Monte-Carlo--Gibbs-Sampling

MCMC Method : Gibbs Sampling from 2D Gaussian

Language:MATLABLicense:MITStargazers:6Issues:1Issues:0

Monte-Carlo-Tree-Search

Monte Carlo Tree Search for receding horizon control

Language:PythonLicense:MITStargazers:4Issues:2Issues:0

Deterministic-Policy-Gradient-Methods

C++ Implementation of Deterministic Policy Gradient Algorithms (ICML 2014, Silver Et al.) using Tile Coding

Language:C++License:MITStargazers:3Issues:1Issues:1

Dimensionality-Reduced-Reinforcement-Learning-for-Assistive-Robots

Reproducing AAAI 2016 Paper : Dimensionality Reduced Reinforcement Learning for Assistive Robots

Language:PythonStargazers:3Issues:1Issues:0

DynaQ

DynaQ RL-Agent

Language:MATLABLicense:MITStargazers:3Issues:1Issues:0

Stochastic-Policy-Gradient-Methods

Monte-Carlo Policy Gradient, Stochastic Policy Gradient and Numerical Gradient Policy Gradient

Language:PythonLicense:MITStargazers:3Issues:1Issues:0

Eligibility-Traces-RL

Performance Comparison of various Eligibility Traces on Maze Task

Language:MatlabLicense:MITStargazers:2Issues:1Issues:0

Integer_Programming_CVXPY

Integer programming problems solved using Gurobi backend and CVXPY

Language:PythonStargazers:2Issues:1Issues:0

FourierBasis-Python

SARSA Lambda Fourier Basis

Language:PythonLicense:MITStargazers:1Issues:1Issues:0

IntraOption-Learning

Intra Option Learning, SMDP Framework

Language:MatlabLicense:MITStargazers:1Issues:2Issues:0
Language:Jupyter NotebookLicense:MITStargazers:1Issues:2Issues:0

QLearn-vs-SARSA-Cliff-Walk

Comparison of Q-Learning and SARSA On Cliff Walk

Language:MatlabLicense:MITStargazers:1Issues:1Issues:0

-Double-DQN-and-DQN

Implementation of DQN and Double DQN for OpenAI Gym Environments

Language:PythonLicense:MITStargazers:0Issues:2Issues:0

baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

Diverse-Density-Estimation-for-Subgoal-Detection

Autonomous Subgoal Discovery for Rl agent

Language:MatlabLicense:MITStargazers:0Issues:1Issues:0

gym

A toolkit for developing and comparing reinforcement learning algorithms.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0
Language:MatlabLicense:MITStargazers:0Issues:1Issues:0

NumpyNets

Numpy NeuralNetworks with Keras like interface

Language:PythonLicense:MITStargazers:0Issues:1Issues:1

Off-Policy-Eligibility-Traces

Tree based backup proposed by Diana Precup on N-Step Random Walk

Language:MatlabLicense:MITStargazers:0Issues:1Issues:0

offworld-gym

OffWorld Gym client library

Language:PythonLicense:GPL-3.0Stargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:2Issues:0

Q-Learning

Q-Learning Discrete State Discrete Action

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

rl-agents

Implementations of Reinforcement Learning and Planning algorithms

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

RNN-TensorFlow

Implementation of RNN in TensorFlow

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

sritee.github.io

Github page

Language:HTMLStargazers:0Issues:1Issues:0

Traffic_simulator_pygame

Pygame traffic simulator

Language:PythonStargazers:0Issues:1Issues:0