XeroAI, G.T. Road, Lahore, 54890, Punjab, Pakistan; University of Engineering and Technology (UET), Lahore, GT, Road, Lahore, 54890, Punjab, Pakistan.
King Abdul Aziz City for Science and Technology, Riyadh, 11442, Kingdom of Saudi Arabia.
Neural Netw. 2023 Sep;166:396-409. doi: 10.1016/j.neunet.2023.07.027. Epub 2023 Jul 26.
Tackling traffic signal control through multi-agent reinforcement learning is a widely-employed approach. However, current state-of-the-art models have drawbacks: intersections optimize their own local rewards and cause traffic to waste time and fuel with a start-stop mode at each intersection. They also lack information sharing among intersections and their specialized policy hinders the ability to adapt to new traffic scenarios. To overcome these limitations, This work presents a centralized collaborative graph network (CCGN) with the core objective of a signal-free corridor once the traffic flows have waited at the entry intersection of the traffic intersection network on either side, the subsequent intersection gives the open signal as the traffic flows arrive. CCGN combines local policy networks (LPN) and global policy networks, where LPN employed at each intersection predicts actions based on Transformer and Graph Convolutional Network (GCN). In contrast, GPN is based on GCN and Q-network that receives the LPN states, traffic flow and road information to manage intersections to provide a signal-free corridor. We developed the Deep Graph Convolution Q-Network (DGCQ) by combining Deep Q-Network (DQN) and GCN to achieve a signal-free corridor. DGCQ leverages GCN's intersection collaboration and DQN's information aggregation for traffic control decisions Proposed CCGN model is trained on the robust synthetic traffic network and evaluated on the real-world traffic networks that outperform the other state-of-the-art models.
通过多智能体强化学习来解决交通信号控制是一种广泛应用的方法。然而,当前最先进的模型存在一些缺陷:交叉口优化自身的局部奖励,导致交通在每个交叉口以启停模式浪费时间和燃料;它们还缺乏交叉口之间的信息共享,其专门的策略阻碍了适应新交通场景的能力。为了克服这些限制,本工作提出了一种集中式协同图网络(CCGN),其核心目标是一旦交通流在交通交叉口网络的任一侧的入口交叉口等待,后续交叉口就会向到达的交通流开放信号。CCGN 结合了局部策略网络(LPN)和全局策略网络,其中每个交叉口使用的 LPN 基于 Transformer 和图卷积网络(GCN)预测动作。相比之下,GPN 基于 GCN 和 Q 网络,接收 LPN 状态、交通流量和道路信息来管理交叉口以提供无信号走廊。我们通过结合深度 Q 网络(DQN)和 GCN 来开发深度图卷积 Q 网络(DGCQ),以实现无信号走廊。DGCQ 利用 GCN 的交叉口协作和 DQN 的信息聚合来做出交通控制决策。所提出的 CCGN 模型在稳健的合成交通网络上进行训练,并在真实世界的交通网络上进行评估,优于其他最先进的模型。