Sunbelt Computer Software

IPDALight

Reinforcement Learning (RL) has been recognized as one of the most effective methods to optimize traffic signal control. However, due to the inappropriate design of RL elements (i.e., reward and state) for complex traffic dynamics, existing RL-based approaches suffer from slow convergence to optimal traffic signal plans. Meanwhile, to simplify the traffic modeling, most optimization methods assume that the phase duration of traffic signals is constant, which strongly limits the RL capability to search for traffic signal control policies with shorter average vehicle travel time and better GreenWave control. To address these issues, this project proposes a novel intensity- and phase duration-aware RL method named IPDALight for the optimization of traffic signal control. Inspired by the Max Pressure (MP)-based traffic control strategy used in the transportation field, we introduce a new concept named intensity, which ensures that our reward design and state representation can accurately reflect the status of vehicles. By taking the coordination of neighboring intersections into account, our approach enables the fine-tuning of phase duration of traffic signals to adapt to dynamic traffic situations. Comprehensive experimental results on both synthetic and real-world traffic scenarios show that, compared with the state-of-the-art RL methods, IPDALight can not only achieve better average vehicle travel time and greenwave control for various multi-intersection scenarios, but also converge to optimal solutions much faster.

Welcome to cite the paper we published on JSA

@article{zhao2022ipdalight,
  title={Ipdalight: Intensity-and phase duration-aware traffic signal control based on reinforcement learning},
  author={Zhao, Wupan and Ye, Yutong and Ding, Jiepin and Wang, Ting and Wei, Tongquan and Chen, Mingsong},
  journal={Journal of Systems Architecture},
  volume={123},
  pages={102374},
  year={2022},
  publisher={Elsevier}
}

Requirements

Python 3.5+
cityflow
tqdm
tensorflow 1.11.0
keras
pandas
numpy

Simulator installation

Our experiments are implemented on top of the traffic simulator Cityflow. Detailed installation guide files can be found in https://cityflow-project.github.io/

1. Install cpp dependencies

sudo apt update && sudo apt install -y build-essential cmake

2. Clone CityFlow project from github

git clone https://github.com/cityflow-project/CityFlow.git

3. Go to CityFlow project’s root directory and run

pip install .

4. Wait for installation to complete and CityFlow should be successfully installed

import cityflow

eng = cityflow.Engine

Files

train.py

The main file of experiments. Choosing the dataset, setting simulation parameters and starting the train.
dqn_agent.py

Implement RL agent for proposed IPDALight.
cityflow_env.py

Define a simulator environment to interact with the simulator and obtain needed data (e.g., state, action and reward).
utility.py

Some functions for experiments (e.g., reading information from roadnet file and plot the travel time convergence).

Dataset

For the experiments, we used both synthetic and realworld traffic datasets provided by https://traffic-signal-control.github.io/dataset.html.

synthetic data

We considered four synthetic traffic datasets with different scales (i.e., 1x3, 2x2, 3x3, 4x4). Traffic files can be found in data/template_lsr/new/1_3 and data/template_lsr/new/2_2 and data/template_lsr/new/3_3 and data/template_lsr/new/4_4.
real-world data

We used two datasets collected from the real-world traffic of two cities (i.e., Hangzhou and Jinan) in China via roadside surveillance cameras. Traffic files of Hangzhou can be found in data/hangzhou, which contains 16 intersections in the form of a 4x4 grid. Traffic files of Jinan can be found in data/jinan, which contains 12 intersections in the form of a 3x4 grid.

Comparison methods

Experiments README

1. Comparison of average travel time and convergence rate

Detailed README for expriment of comparison of average travel time and convergence rate TravelTimeComparison

2. Comparison of fairness

Detailed README for expriment of comparison of fairness FairnessComparison

3. Comparison of intersection pressure

Detailed README for expriment of comparison of intersection pressure PressureComparison

4. Comparison of greenwave control effects

Detailed README for expriment of comparison of greenwave control effects GreenwaveComparison

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
Fair-IPDALight		Fair-IPDALight
Greenwave-IPDALight		Greenwave-IPDALight
Pressure-IPDALight		Pressure-IPDALight
data		data
Comparison.md		Comparison.md
FairnessComparison.md		FairnessComparison.md
GreenwaveComparison.md		GreenwaveComparison.md
PressureComparison.md		PressureComparison.md
README.md		README.md
cityflow_env.py		cityflow_env.py
dqn_agent.py		dqn_agent.py
train.py		train.py
utility.py		utility.py

Method	paper link	source code link
Fixed Time	Traffic engineering A policy that selects control phases in a cyclical way with a predefined duration and phase sequence. https://trid.trb.org/view/310674	-
SOTL	Self organizing traffic lights: A realistic simulation A method that adaptively controls traffic lights based on a threshold indicating the number of waiting vehicles https://arxiv.org/abs/nlin/0610040v1	https://github.com/tianrang-intelligence/TSCC2019/blob/master/sotl_agent.py
GRL	Coordinated deep reinforcement learners for traffic light control An RL-based method based on Q-learning for coordinated traffic signal control, which can learn the joint Q-function of two adjacent intersections by using a coordination graph. https://www.elisevanderpol.nl/papers/vanderpolNIPSMALIC2016.pdf	https://traffic-signal-control.github.io/code.html
CoLight	CoLight: Learning Network-level Cooperation for Traffic Signal Control A deep RL-based method that considers the neighboring intersection information, which uses graph attentional networks to facilitate the communication among intersections. https://dl.acm.org/doi/abs/10.1145/3357384.3357902	https://github.com/wingsweihua/colight
PressLight	PressLight: Learning Max Pressure Control to Coordinate Traffic Signals in Arterial Network A deep RL-based method that can effectively select control phases for intersection pressure minimization based on the MP theory. https://faculty.ist.psu.edu/jessieli/Publications/2019-KDD-presslight.pdf	https://github.com/wingsweihua/presslight

Sunbelt Computer Software

PL/B Language Development and Support

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

IPDALight

Requirements

Simulator installation

1. Install cpp dependencies

2. Clone CityFlow project from github

3. Go to CityFlow project’s root directory and run

4. Wait for installation to complete and CityFlow should be successfully installed

Files

Dataset

Comparison methods

Experiments README

1. Comparison of average travel time and convergence rate

2. Comparison of fairness

3. Comparison of intersection pressure

4. Comparison of greenwave control effects

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Sunbelt Computer Software

PL/B Language Development and Support

Folders and files

Latest commit

History

Repository files navigation

IPDALight

Requirements

Simulator installation

1. Install cpp dependencies

2. Clone CityFlow project from github

3. Go to CityFlow project’s root directory and run

4. Wait for installation to complete and CityFlow should be successfully installed

Files

Dataset

Comparison methods

Experiments README

1. Comparison of average travel time and convergence rate

2. Comparison of fairness

3. Comparison of intersection pressure

4. Comparison of greenwave control effects

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages