• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于概率主成分分析的线性生成模型的缺失交通数据插补。

Missing Traffic Data Imputation with a Linear Generative Model Based on Probabilistic Principal Component Analysis.

机构信息

School of Electrical and Electronic Engineering, Nanyang Technological University, Singapore 639798, Singapore.

出版信息

Sensors (Basel). 2022 Dec 25;23(1):204. doi: 10.3390/s23010204.

DOI:10.3390/s23010204
PMID:36616802
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9824200/
Abstract

Even with the ubiquitous sensing data in intelligent transportation systems, such as the mobile sensing of vehicle trajectories, traffic estimation is still faced with the data missing problem due to the detector faults or limited number of probe vehicles as mobile sensors. Such data missing issue poses an obstacle for many further explorations, e.g., the link-based traffic status modeling. Although many studies have focused on tackling this kind of problem, existing studies mainly focus on the situation in which data are missing at random and ignore the distinction between links of missing data. In the practical scenario, traffic speed data are always missing not at random (MNAR). The distinction for recovering missing data on different links has not been studied yet. In this paper, we propose a general linear model based on probabilistic principal component analysis (PPCA) for solving MNAR traffic speed data imputation. Furthermore, we propose a metric, i.e., Pearson score (p-score), for distinguishing links and investigate how the model performs on links with different p-score values. Experimental results show that the new model outperforms the typically used PPCA model, and missing data on links with higher p-score values can be better recovered.

摘要

即使在智能交通系统中存在无处不在的传感数据,例如车辆轨迹的移动传感,由于检测器故障或作为移动传感器的探针车辆数量有限,交通估计仍然面临数据缺失问题。这种数据缺失问题为许多进一步的探索设置了障碍,例如基于链路的交通状态建模。尽管许多研究都集中在解决这类问题上,但现有研究主要集中在数据随机缺失的情况下,忽略了缺失数据链路之间的区别。在实际场景中,交通速度数据总是非随机缺失(MNAR)。对于不同链路的缺失数据的恢复区别尚未进行研究。在本文中,我们提出了一种基于概率主成分分析(PPCA)的广义线性模型,用于解决 MNAR 交通速度数据插补问题。此外,我们提出了一种度量标准,即 Pearson 得分(p 得分),用于区分链路,并研究模型在具有不同 p 得分值的链路中的表现。实验结果表明,新模型优于常用的 PPCA 模型,并且可以更好地恢复具有更高 p 得分值的链路的缺失数据。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f8df/9824200/69a8ddd84068/sensors-23-00204-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f8df/9824200/cd0d9e3817d4/sensors-23-00204-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f8df/9824200/f4ac9e913f53/sensors-23-00204-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f8df/9824200/5f8c7ecd219c/sensors-23-00204-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f8df/9824200/ed191dd37a0e/sensors-23-00204-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f8df/9824200/69a8ddd84068/sensors-23-00204-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f8df/9824200/cd0d9e3817d4/sensors-23-00204-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f8df/9824200/f4ac9e913f53/sensors-23-00204-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f8df/9824200/5f8c7ecd219c/sensors-23-00204-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f8df/9824200/ed191dd37a0e/sensors-23-00204-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f8df/9824200/69a8ddd84068/sensors-23-00204-g005.jpg

相似文献

1
Missing Traffic Data Imputation with a Linear Generative Model Based on Probabilistic Principal Component Analysis.基于概率主成分分析的线性生成模型的缺失交通数据插补。
Sensors (Basel). 2022 Dec 25;23(1):204. doi: 10.3390/s23010204.
2
BGCP-based traffic data imputation and accident detection applications for the national trunk highway.基于 BGCP 的交通数据插补和国家干线公路事故检测应用
Accid Anal Prev. 2023 Jun;186:107051. doi: 10.1016/j.aap.2023.107051. Epub 2023 Apr 3.
3
NS-kNN: a modified k-nearest neighbors approach for imputing metabolomics data.NS-kNN:一种改进的 k-最近邻方法,用于代谢组学数据插补。
Metabolomics. 2018 Nov 23;14(12):153. doi: 10.1007/s11306-018-1451-8.
4
GSimp: A Gibbs sampler based left-censored missing value imputation approach for metabolomics studies.GSimp:一种基于 Gibbs 抽样的代谢组学研究中左截断缺失值插补方法。
PLoS Comput Biol. 2018 Jan 31;14(1):e1005973. doi: 10.1371/journal.pcbi.1005973. eCollection 2018 Jan.
5
Probabilistic principal component analysis with expectation maximization (PPCA-EM) facilitates volume classification and estimates the missing data.概率主成分分析与期望最大化(PPCA-EM)有助于体积分类,并估计缺失数据。
J Struct Biol. 2010 Jul;171(1):18-30. doi: 10.1016/j.jsb.2010.04.002. Epub 2010 Apr 10.
6
A hybrid imputation approach for microarray missing value estimation.一种用于微阵列缺失值估计的混合插补方法。
BMC Genomics. 2015;16 Suppl 9(Suppl 9):S1. doi: 10.1186/1471-2164-16-S9-S1. Epub 2015 Aug 17.
7
Heckman imputation models for binary or continuous MNAR outcomes and MAR predictors.Heckman 插补模型用于二分类或连续 MNAR 结局和 MAR 预测因子。
BMC Med Res Methodol. 2018 Aug 31;18(1):90. doi: 10.1186/s12874-018-0547-1.
8
A multiple imputation approach for MNAR mechanisms compatible with Heckman's model.一种与赫克曼模型兼容的非随机缺失机制的多重填补方法。
Stat Med. 2016 Jul 30;35(17):2907-20. doi: 10.1002/sim.6902. Epub 2016 Feb 18.
9
Traffic Estimation for Large Urban Road Network with High Missing Data Ratio.针对高缺失数据率的大型城市道路网络的交通流量估计
Sensors (Basel). 2019 Jun 24;19(12):2813. doi: 10.3390/s19122813.
10
Collateral missing value imputation: a new robust missing value estimation algorithm for microarray data.并行缺失值插补:一种用于微阵列数据的新型稳健缺失值估计算法。
Bioinformatics. 2005 May 15;21(10):2417-23. doi: 10.1093/bioinformatics/bti345. Epub 2005 Feb 24.

本文引用的文献

1
Tensor Decomposition for Spatial-Temporal Traffic Flow Prediction with Sparse Data.张量分解在稀疏数据时空交通流预测中的应用。
Sensors (Basel). 2020 Oct 24;20(21):6046. doi: 10.3390/s20216046.