Cabri Giacomo, Lugli Matteo, Montangero Manuela, Muzzini Filippo
Department of Physics, Informatics and Mathematics, University of Modena e Reggio Emilia, 41125 Modena, Italy.
Sensors (Basel). 2024 Feb 17;24(4):1288. doi: 10.3390/s24041288.
With the advent of IoT, cities will soon be populated by autonomous vehicles and managed by intelligent systems capable of actively interacting with city infrastructures and vehicles. In this work, we propose a model based on reinforcement learning that teaches to autonomous connected vehicles how to save resources while navigating in such an environment. In particular, we focus on budget savings in the context of auction-based intersection management systems. We trained several models with Deep Q-learning by varying traffic conditions to find the most performance-effective variant in terms of the trade-off between saved currency and trip times. Afterward, we compared the performance of our model with previously proposed and random strategies, even under adverse traffic conditions. Our model appears to be robust and manages to save a considerable amount of currency without significantly increasing the waiting time in traffic. For example, the learner bidder saves at least 20% of its budget with heavy traffic conditions and up to 74% in lighter traffic with respect to a standard bidder, and around three times the saving of a random bidder. The results and discussion suggest practical adoption of the proposal in a foreseen future real-life scenario.
随着物联网的出现,城市中将很快充斥着自动驾驶车辆,并由能够与城市基础设施和车辆进行积极交互的智能系统管理。在这项工作中,我们提出了一种基于强化学习的模型,该模型教导自动驾驶联网车辆如何在这样的环境中导航时节省资源。特别是,我们专注于基于拍卖的交叉路口管理系统背景下的预算节省。我们通过改变交通状况,用深度Q学习训练了几个模型,以找到在节省资金和行程时间之间的权衡方面最具性能效益的变体。之后,我们将我们模型的性能与先前提出的策略和随机策略进行了比较,即使是在不利的交通条件下。我们的模型似乎很稳健,能够在不显著增加交通等待时间的情况下节省大量资金。例如,与标准出价者相比,学习者出价者在交通拥堵情况下至少节省20%的预算,在交通较顺畅时最多可节省74%,节省量约为随机出价者的三倍。结果和讨论表明,该提议在可预见的未来现实场景中具有实际应用价值。