Ralami1859

Reda ALAMI's repositories

reinforcement-learning

Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.

MIT000

models

Models and examples built with TensorFlow

Language:PythonApache-2.0000

gym

A toolkit for developing and comparing reinforcement learning algorithms.

Language:PythonNOASSERTION000

mujoco-py

MuJoCo is a physics engine for detailed, efficient rigid body simulations with contacts. mujoco-py allows using MuJoCo from Python 3.

Language:PythonNOASSERTION000

dopamine

Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.

Language:Jupyter NotebookApache-2.0000

Generalized-Likelihood-Ratio-GLR-

Language:MATLAB000

rlss-2019

[RLSS 2019] Bandits, RL & Deep RL: Practical Sessions.

Language:Jupyter Notebook000

Adversarial-Multi-Armed-bandit

Adversarial multi-armed bandit algorithms

Language:MATLAB400

gym-gazebo2

gym-gazebo2 is a toolkit for developing and comparing reinforcement learning algorithms using ROS 2 and Gazebo

Apache-2.0000

DifferentiallyPrivateMultiArmedBandit

Language:MATLAB100

Curated collection of notebooks and code files I have worked on while learning a wide range of data science subfields, such as Reinforcement Learning, Natural Language Processing, Deep Neural Networks, Genetic Algorithms, etc. Some of these are accompanied by a pdf and/or article.

MIT000

Bayesian-Online-Change-point-Detector-Matlab-codes-

Implementation of the Bayesian Online Change-point Detector of Ryan Prescott Adams and David McKay (2007).

Language:MATLAB100

Decentralized-Exploration-in-Multi-Armed-Bandits

000

reinforcement_learning

Implementation of selected reinforcement learning algorithms in Tensorflow. A3C, DDPG, REINFORCE, DQN, etc.

MIT000

UCRL

Language:Python000

Kuka_Robotics_Arms

Application of Deep Reinforcement Learning on Robotics Arm control task

000

Deep-Reinforcement-Learning

Code of the book <Deep Reinforcement Learning: Principles and Practices>

000

IndependentComponentAnalysis

ICA algorithm

000

qlearning-simple

For the tutorial blogpost

Language:PythonMIT000

MultiLabelClassification

000

VanillaLSTM

000

Diffused-Spectral-Clustering

000

rbocpdms

Robust bayesian online changepoint detection with model selection

Language:PythonMIT000

intuitive_policy_gradient

000

Reinforcement-Learning-for-Decision-Making-in-self-driving-cars

000

bandit-neuralmonkey

Bandit learning on top of Neural Monkey, an open-source tool for sequence learning in NLP built on TensorFlow. Bandit online learning objectives in branch bandits-acl (ACL17) and counterfactual learning objectives in branch acl-2018 (ACL18).

BSD-3-Clause000

Ralami1859

Reda ALAMI's repositories

reinforcement-learning

football

models

Deep_Reinforcement_Learning

gym

mujoco-py

dopamine

Generalized-Likelihood-Ratio-GLR-

rlss-2019

Adversarial-Multi-Armed-bandit

MultiArmedBandit

gym-gazebo2

CorruptMultiArmedBandit

DifferentiallyPrivateMultiArmedBandit

Playground

Bayesian-Online-Change-point-Detector-Matlab-codes-

Decentralized-Exploration-in-Multi-Armed-Bandits

reinforcement_learning

UCRL

Kuka_Robotics_Arms

Deep-Reinforcement-Learning

IndependentComponentAnalysis

qlearning-simple

MultiLabelClassification

VanillaLSTM

Diffused-Spectral-Clustering

rbocpdms

intuitive_policy_gradient

Reinforcement-Learning-for-Decision-Making-in-self-driving-cars

bandit-neuralmonkey