There are 1 repository under td-learning topic.
Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers, Courses, etc..
My solutions to Yandex Practical Reinforcement Learning course in PyTorch and Tensorflow
A simple reinforcement learning simulation engine for OpenAI's gym.
Backgammon OpenAI Gym
Reinforcement Learning - Implementation of Exercises, algorithms from the book Sutton Barto and David silver's RL course in Python, OpenAI Gym.
Basic Reinforcement Learning algorithms
Implementation of Reinforcement Algorithms from scratch
Implementation of Q-Learning using TD error to navigate a maze avoiding obstacles and a moving enemy
An efficient reinforcement learning algorithm for learning a strategy for game 2048
An a bias-variance tradeoff of Sarsa vs. Expected Sarsa with experiments.
Reinforcement Learning Algorithms in a simple Gridworld
Reinforcement Learning algorithms SARSA, Q-Learning, DQN, for Classical and MuJoCo Environments and testing them with OpenAI Gym.
train an RNN to estimate value in a POMDP using TD learning
Reinforcement Learning algorithms
Implementation of reinforcement learning algorithms to solve pacman game. Part of CS188 AI course from UC Berkeley.
Onitama Board Game Simulator with Reinforcement Learning opponents (CS 5033)
All of my reinforcement learning projects (Some of the projects may contain errors :D )
Z. Sun, Q. Wang, J. Pan and Y. Xia, "Data-Driven MPC for Linear Systems using Reinforcement Learning," 2021 China Automation Congress (CAC), Beijing, China, 2021, pp. 394-399, doi: 10.1109/CAC53003.2021.9728233.
Code for the numerical experiments in Zhang, Sheng, Zhe Zhang, and Siva Theja Maguluri. "Finite Sample Analysis of Average-Reward TD Learning and Q-Learning."
This project is an implementation of the game EASY21
AI of modified version of Othello/Reversi
Reinforcement Learning Specialization | University of Alberta
Reinforcement Learning (COMP 579) Project
multi-armed bandit, gambler problem, cliff problem and TD learning
A neural network playing Backgammon.
This presentation contains very precise yet detailed explanation of concepts of a very interesting topic -- Reinforcement Learning.
python implementation of SARSA learning algorithm to solve a maze
Code and reports from two projects; Boltzmann machine trained on the MNIST data and temporal difference learning model for navigating Morris water-maze task
Some coding stuff from various machine learning books
Reinforcement Learning with tabular methods: TD-learning (Q-learning and SARSA) and MENACE-like approach applied to a Rubik's cube with a move set restricted to 180-degree turns.
A 2048 game platform made with Python & the AI of the game trained by reinforcement learning
Programming Assignments for Reinforcement Learning Specialization