Ralami1859

Reda ALAMI's repositories

A3C2-in-TensorFlow-2

Implementation of the Asynchronous Advantage Actor Critic with Communication in TensorFlow 2

Language:Python1 20

ad-deadlines

Countdown for all* relevant conferences in the domain of autonomous driving

MIT000

AirSim

Open source simulator for autonomous vehicles built on Unreal Engine / Unity, from Microsoft AI & Research

NOASSERTION000

arena

DIAMBRA Arena

NOASSERTION000

awesome-rl-envs

000

Deep-reinforcement-learning-lib

Language:Jupyter Notebook000

DRL-code-pytorch

Concise pytorch implements of DRL algorithms, including REINFORCE, A2C, DQN, PPO(discrete and continuous), DDPG, TD3, SAC.

MIT000

GRID-playground

Platform for General Robot Intelligence Development

NOASSERTION000

gym-pybullet-drones

PyBullet Gym environments for single and multi-agent reinforcement learning of quadcopter control

MIT000

jumanji

🌴 A Suite of Industry-Driven Hardware-Accelerated RL Environments written in JAX

Apache-2.0000

Kaggle_StoresSalesForecasting

000

learning-to-drive-in-5-minutes

Implementation of reinforcement learning approach to make a car learn to drive smoothly in minutes

MIT000

llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Apache-2.0000

LLMs-Finetuning-Safety

We jailbreak GPT-3.5 Turbo’s safety guardrails by fine-tuning it on only 10 adversarially designed examples, at a cost of less than $0.20 via OpenAI’s APIs.

MIT000

mab2rec

[AAAI 2024] Mab2Rec: Multi-Armed Bandits Recommender

000

Multi-Agent-Reinforcement-Learning

000

Multi-Agent-Reinforcement-Learning-Environment

Hello, I pushed some python environments for Multi Agent Reinforcement Learning.

000

multi_armed_bandit

000

ol-ems

Online learning algorithm for microgrid energy management based on MPC

MIT000

OPTIMIZING-STOCK-TRADING-STRATEGY-WITH-REINFORCEMENT-LEARNING

This project is a part of my Data Science Internship at Technocolabs Softwares.

000

reinforcement_learning_course_materials

Lecture notes, tutorial tasks including solutions as well as online videos for the reinforcement learning course hosted by Paderborn University

MIT000

RL-Bitcoin-trading-bot

Trying to create Reinforcement Learning powered Bitcoin trading bot

MIT000

RL4RS

A Real-World Benchmark for Reinforcement Learning based Recommender System

CC-BY-SA-4.0000

roerich

Roerich is a python library of change point detection algorithms for time series.

BSD-2-Clause000

safe-rlhf

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Apache-2.0000

SPIN_official

The official implementation of Self-Play Fine-Tuning (SPIN)

Apache-2.0000

Survey_AI_Drug_Discovery

000

TradeMaster

TradeMaster is an open-source platform for quantitative trading empowered by reinforcement learning :fire: :zap: :rainbow:

Apache-2.0000

trl

Train transformer language models with reinforcement learning.

Apache-2.0000

TSCP2

Time Series Change Point Detection based on Contrastive Predictive Coding

000