基于上下文增强Transformer 网络的实时自主系统行人轨迹预测。

School of Information and Physical Sciences, The University of Newcastle, Callaghan, NSW 2308, Australia.

Sensors (Basel). 2022 Oct 2;22(19):7495. doi: 10.3390/s22197495.

Forecasting the trajectory of pedestrians in shared urban traffic environments from non-invasive sensor modalities is still considered one of the challenging problems facing the development of autonomous vehicles (AVs). In the literature, this problem is often tackled using recurrent neural networks (RNNs). Despite the powerful capabilities of RNNs in capturing the temporal dependency in the pedestrians' motion trajectories, they were argued to be challenged when dealing with longer sequential data. Additionally, whilst the accommodation for contextual information (such as scene semantics and agents interactions) was shown to be effective for robust trajectory prediction, they can also impact the overall real-time performance of prediction system. Thus, in this work, we are introducing a framework based on the transformer networks that were demonstrated recently to be more efficient and outperformed RNNs in many sequential-based tasks. We relied on a fusion of sensor modalities, namely the past positional information, agent interactions information and scene physical semantics information as an input to our framework in order to not only provide a robust trajectory prediction of pedestrians, but also achieve real-time performance for multi-pedestrians' trajectory prediction. We have evaluated our framework on three real-life datasets of pedestrians in shared urban traffic environments and it has outperformed the compared baseline approaches in both short-term and long-term prediction horizons. For the short-term prediction horizon, our approach has achieved lower scores according to the average displacement error and the root-mean squared error (ADE/RMSE) of predictions over the state-of-the art (SOTA) approach by more than 11 cm and 23 cm, respectively. While for the long-term prediction horizon, our approach has achieved lower ADE and FDE over the SOTA approach by more than 62 cm and 165 cm, respectively. Additionally, our approach has achieved superior real time performance by scoring only 0.025 s (i.e., it can provide 40 individual trajectory predictions per second).

从非侵入式传感器模态预测共享城市交通环境中的行人轨迹仍然被认为是自动驾驶车辆 (AV) 发展面临的挑战之一。在文献中，这个问题通常使用递归神经网络 (RNN) 来解决。尽管 RNN 在捕捉行人运动轨迹的时间依赖性方面具有强大的能力，但它们在处理更长的序列数据时被认为具有挑战性。此外，虽然上下文信息（如场景语义和代理交互）的适应被证明对鲁棒轨迹预测有效，但它们也会影响预测系统的整体实时性能。因此，在这项工作中，我们引入了一个基于转换器网络的框架，该框架最近被证明在许多基于序列的任务中比 RNN 更有效且表现更好。我们依赖于传感器模态的融合，即过去的位置信息、代理交互信息和场景物理语义信息作为输入到我们的框架中，以便不仅提供行人的稳健轨迹预测，而且实现多行人轨迹预测的实时性能。我们在三个共享城市交通环境中行人的真实数据集上评估了我们的框架，它在短期和长期预测范围内都优于比较基线方法。对于短期预测范围，我们的方法在预测的平均位移误差和均方根误差 (ADE/RMSE) 方面比最先进的方法 (SOTA) 分别低 11 厘米和 23 厘米。而对于长期预测范围，我们的方法在 ADE 和 FDE 方面比 SOTA 分别低 62 厘米和 165 厘米。此外，我们的方法通过仅获得 0.025 秒的得分实现了卓越的实时性能（即，它每秒可以提供 40 个单独的轨迹预测）。

相似文献

Pedestrian Trajectory Prediction for Real-Time Autonomous Systems via Context-Augmented Transformer Networks.

Sensors (Basel). 2022 Oct 2;22(19):7495. doi: 10.3390/s22197495.

Holistic LSTM for Pedestrian Trajectory Prediction.

IEEE Trans Image Process. 2021;30:3229-3239. doi: 10.1109/TIP.2021.3058599. Epub 2021 Mar 2.

A Review of Deep Learning-Based Methods for Pedestrian Trajectory Prediction.

Sensors (Basel). 2021 Nov 13;21(22):7543. doi: 10.3390/s21227543.

Holistic Spatio-Temporal Graph Attention for Trajectory Prediction in Vehicle-Pedestrian Interactions.

Sensors (Basel). 2023 Aug 23;23(17):7361. doi: 10.3390/s23177361.

MDST-DGCN: A Multilevel Dynamic Spatiotemporal Directed Graph Convolutional Network for Pedestrian Trajectory Prediction.

Comput Intell Neurosci. 2022 Apr 12;2022:4192367. doi: 10.1155/2022/4192367. eCollection 2022.

Prediction of pedestrian-vehicle conflicts at signalized intersections based on long short-term memory neural network.

Accid Anal Prev. 2020 Dec;148:105799. doi: 10.1016/j.aap.2020.105799. Epub 2020 Oct 17.

SSAGCN: Social Soft Attention Graph Convolution Network for Pedestrian Trajectory Prediction.

IEEE Trans Neural Netw Learn Syst. 2024 Sep;35(9):11989-12003. doi: 10.1109/TNNLS.2023.3250485. Epub 2024 Sep 3.

Analyzing vehicle-pedestrian interactions: Combining data cube structure and predictive collision risk estimation model.

Accid Anal Prev. 2022 Feb;165:106539. doi: 10.1016/j.aap.2021.106539. Epub 2021 Dec 17.

Interactions between autonomous vehicles and pedestrians at unsignalized mid-block crosswalks considering occlusions by opposing vehicles.

Accid Anal Prev. 2021 Dec;163:106468. doi: 10.1016/j.aap.2021.106468. Epub 2021 Nov 10.

The paradox of pedestrian's risk aversion.

Accid Anal Prev. 2020 Jul;142:105518. doi: 10.1016/j.aap.2020.105518. Epub 2020 May 20.

引用本文的文献

Analysis of Building Accessibility Using Inertial and Optical Sensors.

Sensors (Basel). 2023 Jun 10;23(12):5491. doi: 10.3390/s23125491.

本文引用的文献

Social force model for pedestrian dynamics.

Phys Rev E Stat Phys Plasmas Fluids Relat Interdiscip Topics. 1995 May;51(5):4282-4286. doi: 10.1103/physreve.51.4282.

Suppr 超能文献

核心技术专利：CN118964589B侵权必究

相似文献

Pedestrian Trajectory Prediction for Real-Time Autonomous Systems via Context-Augmented Transformer Networks.

Sensors (Basel). 2022 Oct 2;22(19):7495. doi: 10.3390/s22197495.

Holistic LSTM for Pedestrian Trajectory Prediction.

IEEE Trans Image Process. 2021;30:3229-3239. doi: 10.1109/TIP.2021.3058599. Epub 2021 Mar 2.

A Review of Deep Learning-Based Methods for Pedestrian Trajectory Prediction.

Sensors (Basel). 2021 Nov 13;21(22):7543. doi: 10.3390/s21227543.

Holistic Spatio-Temporal Graph Attention for Trajectory Prediction in Vehicle-Pedestrian Interactions.

Sensors (Basel). 2023 Aug 23;23(17):7361. doi: 10.3390/s23177361.

MDST-DGCN: A Multilevel Dynamic Spatiotemporal Directed Graph Convolutional Network for Pedestrian Trajectory Prediction.

Comput Intell Neurosci. 2022 Apr 12;2022:4192367. doi: 10.1155/2022/4192367. eCollection 2022.

Prediction of pedestrian-vehicle conflicts at signalized intersections based on long short-term memory neural network.

Accid Anal Prev. 2020 Dec;148:105799. doi: 10.1016/j.aap.2020.105799. Epub 2020 Oct 17.

SSAGCN: Social Soft Attention Graph Convolution Network for Pedestrian Trajectory Prediction.

IEEE Trans Neural Netw Learn Syst. 2024 Sep;35(9):11989-12003. doi: 10.1109/TNNLS.2023.3250485. Epub 2024 Sep 3.

Analyzing vehicle-pedestrian interactions: Combining data cube structure and predictive collision risk estimation model.

Accid Anal Prev. 2022 Feb;165:106539. doi: 10.1016/j.aap.2021.106539. Epub 2021 Dec 17.

Interactions between autonomous vehicles and pedestrians at unsignalized mid-block crosswalks considering occlusions by opposing vehicles.

Accid Anal Prev. 2021 Dec;163:106468. doi: 10.1016/j.aap.2021.106468. Epub 2021 Nov 10.

The paradox of pedestrian's risk aversion.

Accid Anal Prev. 2020 Jul;142:105518. doi: 10.1016/j.aap.2020.105518. Epub 2020 May 20.

引用本文的文献

Analysis of Building Accessibility Using Inertial and Optical Sensors.

Sensors (Basel). 2023 Jun 10;23(12):5491. doi: 10.3390/s23125491.

本文引用的文献

Social force model for pedestrian dynamics.

Phys Rev E Stat Phys Plasmas Fluids Relat Interdiscip Topics. 1995 May;51(5):4282-4286. doi: 10.1103/physreve.51.4282.

Pedestrian Trajectory Prediction for Real-Time Autonomous Systems via Context-Augmented Transformer Networks.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献