whisht120

whisht120

Geek Repo

Github PK Tool:Github PK Tool

whisht120's repositories

ARS

An implementation of the Augmented Random Search algorithm

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

Discount_as_Regularizer

Code for the paper "Discount Factor as a Regularizer in Reinforcement Learning" ICML 2020

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

IRL-Toolkit

IRL Toolkit developed by Sergey Levine (Taken from https://graphics.stanford.edu/projects/gpirl/)

Language:MATLABStargazers:0Issues:0Issues:0

learn500lines

500 Lines or Less

Language:JavaScriptLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0
Language:MATLABStargazers:0Issues:0Issues:0

Python-100-Days

Python - 100天从新手到大师

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

pytorch-handbook

pytorch handbook是一本开源的书籍,目标是帮助那些希望和使用PyTorch进行深度学习开发和研究的朋友快速入门,其中包含的Pytorch教程全部通过测试保证可以成功运行

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

Q-Learning-SARSA-Policy-and-Value-Iteration

Implementation of basic reinforcement learning algorithms (Q-learning, SARSA, Policy iteration and Value Iteration) on benchmark RL MDPs (GridWorld, SmallWorld and CliffWorld)

Language:MATLABStargazers:0Issues:0Issues:0

reinforcement-learning

Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

TD3

Author's PyTorch implementation of TD3 for OpenAI gym tasks

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

VFIToolkit-matlab

A Matlab Toolkit for Macroeconomic Models using Value Function Iteration

Language:MATLABLicense:NOASSERTIONStargazers:0Issues:0Issues:0