There are 1 repository under policy-gradients topic.
Learn Deep Reinforcement Learning in 60 days! Lectures & Code in Python. Reinforcement Learning + Deep Learning
Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)
Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow
Implementations of Deep Reinforcement Learning Algorithms and Bench-marking with PyTorch
Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC
Basic reinforcement learning algorithms. Including:DQN,Double DQN, Dueling DQN, SARSA, REINFORCE, baseline-REINFORCE, Actor-Critic,DDPG,DDPG for discrete action space, A2C, A3C, TD3, SAC, TRPO
A Universal Deep Reinforcement Learning Framework
A collection of several Deep Reinforcement Learning techniques (Deep Q Learning, Policy Gradients, ...), gets updated over time.
Model-based Policy Gradients
Artificial Intelligence series
ReLAx - Reinforcement Learning Applications Library
Code for an intro to RL workshop. You'll be training a simple agent to play pong using policy gradients. Adapted from http://karpathy.github.io/2016/05/31/rl/
Projects for The School of AI
Solutions to the Stanford CS:234 Reinforcement Learning 2022 course assignments.
Project 2 of Udacity Deep Reinforcement Learning Nanodegree
The objective of this project is to develop an autonomous agent to perform well in the first person shooting games using various reinforcement learning techniques.
Udacity Deep Reinforcement Learning Nanodegree. Second Project Implementation (Continuous Control).
Exploring the fundamentals of neural networks
Remember the sad Marvin from "Hitchhiker's guide to the galaxy"? In this project we train him to walk from the scratch using only pure python with numpy!
Implementations of Rl algorithms ranging from Q-learning to Multi-Agent RL using DDPG in unity and gym environments.
Self Play Actor Critic, Reinforcement Learning on TROY; all puns intended
Vanilla Policy Gradient (REINFORCE) implementation with PyTorch
Policy Gradients, DDPG, and TD3 in gym env