whisht120

0

followers

following

stars

whisht120's repositories

ARS

An implementation of the Augmented Random Search algorithm

Language:PythonNOASSERTION000

Discount_as_Regularizer

Code for the paper "Discount Factor as a Regularizer in Reinforcement Learning" ICML 2020

Language:PythonMIT000

IRL-Toolkit

IRL Toolkit developed by Sergey Levine (Taken from https://graphics.stanford.edu/projects/gpirl/)

Language:MATLAB000

learn500lines

500 Lines or Less

Language:JavaScriptNOASSERTION000

nash_q_learning

Language:Python000

path_tracking_with_MPC-DDP-_and_parameter_least_square_matlab

Language:MATLAB000

Python-100-Days

Python - 100天从新手到大师

Language:Jupyter Notebook000

pytorch-handbook

pytorch handbook是一本开源的书籍，目标是帮助那些希望和使用PyTorch进行深度学习开发和研究的朋友快速入门，其中包含的Pytorch教程全部通过测试保证可以成功运行

Language:Jupyter Notebook000

Q-Learning-SARSA-Policy-and-Value-Iteration

Implementation of basic reinforcement learning algorithms (Q-learning, SARSA, Policy iteration and Value Iteration) on benchmark RL MDPs (GridWorld, SmallWorld and CliffWorld)

Language:MATLAB000

reinforcement-learning

Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.

Language:Jupyter NotebookMIT000

TD3

Author's PyTorch implementation of TD3 for OpenAI gym tasks

Language:PythonMIT000

VFIToolkit-matlab

A Matlab Toolkit for Macroeconomic Models using Value Function Iteration

Language:MATLABNOASSERTION000