• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于增强交通流量预测泛化性能的预训练改进时空图网络。

Pretraining-improved Spatiotemporal graph network for the generalization performance enhancement of traffic forecasting.

作者信息

Zhang Xiangyue, Li Chao, Ji Ling, Kang Yuyun, Pan Mingming, Liu Zhuo, Qi Qiang

机构信息

School of Information Science and Engineering, Linyi University, Linyi, 276000, China.

Daopuyun (Shandong) Intelligent Technology Co., Ltd, Jinan, 265200, China.

出版信息

Sci Rep. 2025 Jul 29;15(1):27668. doi: 10.1038/s41598-025-11375-2.

DOI:10.1038/s41598-025-11375-2
PMID:40730627
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12307734/
Abstract

Traffic forecasting is considered a cornerstone of smart city development. A key challenge is capturing the long-term spatiotemporal dependencies of traffic data while improving the model's generalization ability. To address these issues, various sophisticated modules are embedded into different models. However, this approach increases the computational cost of the model. Additionally, adding or replacing datasets in a trained model requires retraining, which decreases prediction accuracy and increases time cost. To address the challenges faced by existing models in handling long-term spatiotemporal dependencies and high computational costs, this study proposes an enhanced pre-training method called the Improved Spatiotemporal Diffusion Graph (ImPreSTDG). While existing traffic prediction models, particularly those based on Graph Convolutional Networks (GCNs) and deep learning, are effective at capturing short-term spatiotemporal dependencies, they often experience accuracy degradation and increased computational demands when dealing with long-term dependencies. To overcome these limitations, we introduce a Denoised Diffusion Probability Model (DDPM) as part of the pre-training process, which enhances the model's ability to learn from long-term spatiotemporal data while significantly reducing computational costs. During the pre-training phase, ImPreSTDG employs a data masking and recovery strategy, with DDPM facilitating the reconstruction of masked data segments, thereby enabling the model to capture long-term dependencies in the traffic data. Additionally, we propose the Mamba module, which leverages the Selective State Space Model (SSM) to effectively capture long-term multivariate spatiotemporal correlations. This module enables more efficient processing of long sequences, extracting essential patterns while minimizing computational resource consumption. By improving computational efficiency, the Mamba module addresses the challenge of modeling long-term dependencies without compromising accuracy in capturing extended spatiotemporal trends. In the fine-tuning phase, the decoder is replaced with a forecasting header, and the pre-trained parameters are frozen. The forecasting header includes a meta-learning fusion module and a spatiotemporal convolutional layer, which facilitates the integration of both long-term and short-term traffic data for accurate forecasting. The model is then trained and adapted to the specific forecasting task. Experiments conducted on three real-world traffic datasets demonstrate that the proposed pre-training method significantly enhances the model's ability to handle long-term dependencies, missing data, and high computational costs, providing a more efficient solution for traffic prediction.

摘要

交通流量预测被视为智慧城市发展的基石。一个关键挑战是捕捉交通数据的长期时空依赖性,同时提高模型的泛化能力。为了解决这些问题,各种复杂的模块被嵌入到不同的模型中。然而,这种方法增加了模型的计算成本。此外,在训练好的模型中添加或替换数据集需要重新训练,这会降低预测准确性并增加时间成本。为了应对现有模型在处理长期时空依赖性和高计算成本方面面临的挑战,本研究提出了一种增强的预训练方法,称为改进的时空扩散图(ImPreSTDG)。虽然现有的交通预测模型,特别是基于图卷积网络(GCN)和深度学习的模型,在捕捉短期时空依赖性方面很有效,但在处理长期依赖性时,它们往往会出现准确性下降和计算需求增加的情况。为了克服这些限制,我们引入了一个去噪扩散概率模型(DDPM)作为预训练过程的一部分,它增强了模型从长期时空数据中学习的能力,同时显著降低了计算成本。在预训练阶段,ImPreSTDG采用数据掩码和恢复策略,DDPM有助于重建掩码数据段,从而使模型能够捕捉交通数据中的长期依赖性。此外,我们提出了曼巴模块,它利用选择性状态空间模型(SSM)有效地捕捉长期多变量时空相关性。该模块能够更高效地处理长序列,提取基本模式,同时最小化计算资源消耗。通过提高计算效率,曼巴模块解决了在不影响捕捉扩展时空趋势准确性的情况下对长期依赖性进行建模的挑战。在微调阶段,解码器被替换为预测头,预训练参数被冻结。预测头包括一个元学习融合模块和一个时空卷积层,它有助于整合长期和短期交通数据以进行准确预测。然后对模型进行训练并使其适应特定的预测任务。在三个真实世界交通数据集上进行的实验表明,所提出的预训练方法显著增强了模型处理长期依赖性、缺失数据和高计算成本的能力,为交通预测提供了更有效的解决方案。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce70/12307734/d0b4bf39ca0c/41598_2025_11375_Fig15_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce70/12307734/6eec7a7bbdc3/41598_2025_11375_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce70/12307734/dda471a28a03/41598_2025_11375_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce70/12307734/096d82b1746f/41598_2025_11375_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce70/12307734/b6b6ba160025/41598_2025_11375_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce70/12307734/67bcb13baea2/41598_2025_11375_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce70/12307734/d7829e2d72d1/41598_2025_11375_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce70/12307734/612a39847f45/41598_2025_11375_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce70/12307734/246fe47f9076/41598_2025_11375_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce70/12307734/dcdbc9975e23/41598_2025_11375_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce70/12307734/51252c96c44a/41598_2025_11375_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce70/12307734/46a3bbf26935/41598_2025_11375_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce70/12307734/ea292f7c3bf7/41598_2025_11375_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce70/12307734/d410e0908940/41598_2025_11375_Fig13_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce70/12307734/943c6de0924d/41598_2025_11375_Fig14_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce70/12307734/d0b4bf39ca0c/41598_2025_11375_Fig15_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce70/12307734/6eec7a7bbdc3/41598_2025_11375_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce70/12307734/dda471a28a03/41598_2025_11375_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce70/12307734/096d82b1746f/41598_2025_11375_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce70/12307734/b6b6ba160025/41598_2025_11375_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce70/12307734/67bcb13baea2/41598_2025_11375_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce70/12307734/d7829e2d72d1/41598_2025_11375_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce70/12307734/612a39847f45/41598_2025_11375_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce70/12307734/246fe47f9076/41598_2025_11375_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce70/12307734/dcdbc9975e23/41598_2025_11375_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce70/12307734/51252c96c44a/41598_2025_11375_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce70/12307734/46a3bbf26935/41598_2025_11375_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce70/12307734/ea292f7c3bf7/41598_2025_11375_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce70/12307734/d410e0908940/41598_2025_11375_Fig13_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce70/12307734/943c6de0924d/41598_2025_11375_Fig14_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce70/12307734/d0b4bf39ca0c/41598_2025_11375_Fig15_HTML.jpg

相似文献

1
Pretraining-improved Spatiotemporal graph network for the generalization performance enhancement of traffic forecasting.用于增强交通流量预测泛化性能的预训练改进时空图网络。
Sci Rep. 2025 Jul 29;15(1):27668. doi: 10.1038/s41598-025-11375-2.
2
Short-Term Memory Impairment短期记忆障碍
3
Spatio-temporal transformer and graph convolutional networks based traffic flow prediction.基于时空变换器和图卷积网络的交通流预测
Sci Rep. 2025 Jul 7;15(1):24299. doi: 10.1038/s41598-025-10287-5.
4
Enhancing intelligent transportation systems with a more efficient model for long-term traffic predictions based on an attention mechanism and a residual temporal convolutional network.基于注意力机制和残差时间卷积网络,通过一种更高效的长期交通预测模型来增强智能交通系统。
Neural Netw. 2025 Jul 23;192:107897. doi: 10.1016/j.neunet.2025.107897.
5
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
6
Sexual Harassment and Prevention Training性骚扰与预防培训
7
CMDMamba: dual-layer Mamba architecture with dual convolutional feed-forward networks for efficient financial time series forecasting.CMDMamba:具有双卷积前馈网络的双层Mamba架构,用于高效的金融时间序列预测。
Front Artif Intell. 2025 Jul 15;8:1599799. doi: 10.3389/frai.2025.1599799. eCollection 2025.
8
Are Current Survival Prediction Tools Useful When Treating Subsequent Skeletal-related Events From Bone Metastases?当前的生存预测工具在治疗骨转移后的骨骼相关事件时有用吗?
Clin Orthop Relat Res. 2024 Sep 1;482(9):1710-1721. doi: 10.1097/CORR.0000000000003030. Epub 2024 Mar 22.
9
Trajectory-Ordered Objectives for Self-Supervised Representation Learning of Temporal Healthcare Data Using Transformers: Model Development and Evaluation Study.使用Transformer进行时间序列医疗数据自监督表示学习的轨迹有序目标:模型开发与评估研究
JMIR Med Inform. 2025 Jun 4;13:e68138. doi: 10.2196/68138.
10
Enhancing Clinical Relevance of Pretrained Language Models Through Integration of External Knowledge: Case Study on Cardiovascular Diagnosis From Electronic Health Records.通过整合外部知识提高预训练语言模型的临床相关性:来自电子健康记录的心血管诊断案例研究
JMIR AI. 2024 Aug 6;3:e56932. doi: 10.2196/56932.

本文引用的文献

1
A fractional gradient descent algorithm robust to the initial weights of multilayer perceptron.一种对多层感知器初始权重具有鲁棒性的分数梯度下降算法。
Neural Netw. 2023 Jan;158:154-170. doi: 10.1016/j.neunet.2022.11.018. Epub 2022 Nov 17.