Reda ALAMI's repositories
reinforcement-learning
Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.
models
Models and examples built with TensorFlow
Deep_Reinforcement_Learning
Resources, papers, tutorials
gym
A toolkit for developing and comparing reinforcement learning algorithms.
mujoco-py
MuJoCo is a physics engine for detailed, efficient rigid body simulations with contacts. mujoco-py allows using MuJoCo from Python 3.
dopamine
Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.
rlss-2019
[RLSS 2019] Bandits, RL & Deep RL: Practical Sessions.
Adversarial-Multi-Armed-bandit
Adversarial multi-armed bandit algorithms
gym-gazebo2
gym-gazebo2 is a toolkit for developing and comparing reinforcement learning algorithms using ROS 2 and Gazebo
Playground
Curated collection of notebooks and code files I have worked on while learning a wide range of data science subfields, such as Reinforcement Learning, Natural Language Processing, Deep Neural Networks, Genetic Algorithms, etc. Some of these are accompanied by a pdf and/or article.
Bayesian-Online-Change-point-Detector-Matlab-codes-
Implementation of the Bayesian Online Change-point Detector of Ryan Prescott Adams and David McKay (2007).
reinforcement_learning
Implementation of selected reinforcement learning algorithms in Tensorflow. A3C, DDPG, REINFORCE, DQN, etc.
Kuka_Robotics_Arms
Application of Deep Reinforcement Learning on Robotics Arm control task
Deep-Reinforcement-Learning
Code of the book <Deep Reinforcement Learning: Principles and Practices>
IndependentComponentAnalysis
ICA algorithm
qlearning-simple
For the tutorial blogpost
rbocpdms
Robust bayesian online changepoint detection with model selection
Reinforcement-Learning-for-Decision-Making-in-self-driving-cars
Reinforcement-Learning-for-Decision-Making-in-self-driving-cars
bandit-neuralmonkey
Bandit learning on top of Neural Monkey, an open-source tool for sequence learning in NLP built on TensorFlow. Bandit online learning objectives in branch bandits-acl (ACL17) and counterfactual learning objectives in branch acl-2018 (ACL18).