der sumo flow pytorch reinforcement-learning rl

Double Experience Replay (DER)

Pytorch implementaion of Double_Experience_Replay (DER)

This method mixes two stratesgies for sampling experiences which will be stored in replay buffer.
You could choose strategies whatever you want, but this paper we use temperal difference (TD) value based sample strategy and uniform sample strategy.

Method

Using Uniform sample strategy and TD value based sampling method.
As a training algorithm we use Deep Q-learning (DQN)

Requirements

Pytorch
Flow
Numpy
Gym
TensorboardX

Usage

To train SUMO with ring environment

cd ring
python ring.py

To train SUMO with Lane Change environment

cd lanechange
python lane.py

Result

YeongDong Bridge Agent (LEFT, white car)
Lane Change Agent (RIGHT, white car)

YeongDong Bridge (LEFT)
Ring Network (RIGHT)

About

Implementation of Double Experience Reply (DER) with Pytorch

der sumo flow pytorch reinforcement-learning rl

Languages

Language:Python 100.0%

jiseongHAN / Double-Experience-Replay-DER-