• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于出租车 GPS 数据的缺失数据插补的集成模糊 C 均值方法。

An Integrated Fuzzy C-Means Method for Missing Data Imputation Using Taxi GPS Data.

机构信息

School of Traffic and Transportation, Beijing Jiaotong University, Beijing 100044, China.

Key Laboratory of Transport Industry of Big Data Application Technologies for Comprehensive Transport, Beijing Jiaotong University, Beijing 100044, China.

出版信息

Sensors (Basel). 2020 Apr 2;20(7):1992. doi: 10.3390/s20071992.

DOI:10.3390/s20071992
PMID:32252432
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7181140/
Abstract

Various traffic-sensing technologies have been employed to facilitate traffic control. Due to certain factors, e.g., malfunctioning devices and artificial mistakes, missing values typically occur in the Intelligent Transportation System (ITS) sensing datasets, resulting in a decrease in the data quality. In this study, an integrated imputation algorithm based on fuzzy C-means (FCM) and the genetic algorithm (GA) is proposed to improve the accuracy of the estimated values. The GA is applied to optimize the parameter of the membership degree and the number of cluster centroids in the FCM model. An experimental test of the taxi global positioning system (GPS) data in Manhattan, New York City, is employed to demonstrate the effectiveness of the integrated imputation approach. Three evaluation criteria, the root mean squared error (RMSE), correlation coefficient (R), and relative accuracy (RA), are used to verify the experimental results. Under the ±5% and ±10% thresholds, the average RAs obtained by the integrated imputation method are 0.576 and 0.785, which remain the highest among different methods, indicating that the integrated imputation method outperforms the history imputation method and the conventional FCM method. On the other hand, the clustering imputation performance with the Euclidean distance is better than that with the Manhattan distance. Thus, our proposed integrated imputation method can be employed to estimate the missing values in the daily traffic management.

摘要

已经采用了各种交通感应技术来实现交通管制。由于某些因素,例如设备故障和人为错误,智能交通系统(ITS)感应数据集中通常会出现缺失值,从而降低数据质量。在本研究中,提出了一种基于模糊 C 均值(FCM)和遗传算法(GA)的集成插补算法,以提高估计值的准确性。GA 用于优化 FCM 模型中隶属度和聚类中心数的参数。通过对纽约市曼哈顿出租车全球定位系统(GPS)数据的实验测试,验证了集成插补方法的有效性。采用均方根误差(RMSE)、相关系数(R)和相对精度(RA)三个评估标准来验证实验结果。在±5%和±10%的阈值下,集成插补方法获得的平均 RA 分别为 0.576 和 0.785,在不同方法中保持最高,表明集成插补方法优于历史插补方法和传统 FCM 方法。另一方面,基于欧几里得距离的聚类插补性能优于基于曼哈顿距离的聚类插补性能。因此,我们提出的集成插补方法可用于估计日常交通管理中的缺失值。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6cd/7181140/a19136279851/sensors-20-01992-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6cd/7181140/32acbcfbb44f/sensors-20-01992-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6cd/7181140/db346bc75c06/sensors-20-01992-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6cd/7181140/405caef94209/sensors-20-01992-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6cd/7181140/ef948b3f6f7a/sensors-20-01992-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6cd/7181140/f36d2870ab46/sensors-20-01992-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6cd/7181140/1014982190ac/sensors-20-01992-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6cd/7181140/f92a6254cf8e/sensors-20-01992-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6cd/7181140/7b6fd4c44657/sensors-20-01992-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6cd/7181140/ee4bd48a9113/sensors-20-01992-g009a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6cd/7181140/a19136279851/sensors-20-01992-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6cd/7181140/32acbcfbb44f/sensors-20-01992-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6cd/7181140/db346bc75c06/sensors-20-01992-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6cd/7181140/405caef94209/sensors-20-01992-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6cd/7181140/ef948b3f6f7a/sensors-20-01992-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6cd/7181140/f36d2870ab46/sensors-20-01992-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6cd/7181140/1014982190ac/sensors-20-01992-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6cd/7181140/f92a6254cf8e/sensors-20-01992-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6cd/7181140/7b6fd4c44657/sensors-20-01992-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6cd/7181140/ee4bd48a9113/sensors-20-01992-g009a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6cd/7181140/a19136279851/sensors-20-01992-g010.jpg

相似文献

1
An Integrated Fuzzy C-Means Method for Missing Data Imputation Using Taxi GPS Data.基于出租车 GPS 数据的缺失数据插补的集成模糊 C 均值方法。
Sensors (Basel). 2020 Apr 2;20(7):1992. doi: 10.3390/s20071992.
2
Advanced methods for missing values imputation based on similarity learning.基于相似性学习的缺失值插补先进方法。
PeerJ Comput Sci. 2021 Jul 21;7:e619. doi: 10.7717/peerj-cs.619. eCollection 2021.
3
Towards clustering of incomplete microarray data without the use of imputation.迈向无需插补的不完整微阵列数据聚类
Bioinformatics. 2007 Jan 1;23(1):107-13. doi: 10.1093/bioinformatics/btl555. Epub 2006 Oct 31.
4
A Repair Method for Missing Traffic Data Based on FCM, Optimized by the Twice Grid Optimization and Sparrow Search Algorithms.基于 FCM 并经两次网格优化和麻雀搜索算法优化的缺失交通数据修复方法。
Sensors (Basel). 2022 Jun 6;22(11):4304. doi: 10.3390/s22114304.
5
A Kriging based spatiotemporal approach for traffic volume data imputation.基于克里金的时空方法进行交通量数据插补。
PLoS One. 2018 Apr 17;13(4):e0195957. doi: 10.1371/journal.pone.0195957. eCollection 2018.
6
A classifier ensemble approach for the missing feature problem.分类器集成方法解决缺失特征问题。
Artif Intell Med. 2012 May;55(1):37-50. doi: 10.1016/j.artmed.2011.11.006. Epub 2011 Dec 20.
7
A hybrid imputation approach for microarray missing value estimation.一种用于微阵列缺失值估计的混合插补方法。
BMC Genomics. 2015;16 Suppl 9(Suppl 9):S1. doi: 10.1186/1471-2164-16-S9-S1. Epub 2015 Aug 17.
8
Travel Time Estimation Using Freeway Point Detector Data Based on Evolving Fuzzy Neural Inference System.基于进化模糊神经推理系统的高速公路点检测器数据的行程时间估计
PLoS One. 2016 Feb 1;11(2):e0147263. doi: 10.1371/journal.pone.0147263. eCollection 2016.
9
NS-kNN: a modified k-nearest neighbors approach for imputing metabolomics data.NS-kNN:一种改进的 k-最近邻方法,用于代谢组学数据插补。
Metabolomics. 2018 Nov 23;14(12):153. doi: 10.1007/s11306-018-1451-8.
10
An Adaptive Ellipse Distance Density Peak Fuzzy Clustering Algorithm Based on the Multi-target Traffic Radar.一种基于多目标交通雷达的自适应椭圆距离密度峰值模糊聚类算法。
Sensors (Basel). 2020 Aug 31;20(17):4920. doi: 10.3390/s20174920.

引用本文的文献

1
An Expressway ETC Missing Data Restoration Model Considering Multi-Attribute Features.一种考虑多属性特征的高速公路ETC缺失数据恢复模型
Sensors (Basel). 2023 Oct 26;23(21):8745. doi: 10.3390/s23218745.
2
A Belief Two-Level Weighted Clustering Method for Incomplete Pattern Based on Multiview Fusion.基于多视图融合的不完全模式信念双层加权聚类方法。
Comput Intell Neurosci. 2022 Nov 30;2022:2895338. doi: 10.1155/2022/2895338. eCollection 2022.
3
A Repair Method for Missing Traffic Data Based on FCM, Optimized by the Twice Grid Optimization and Sparrow Search Algorithms.

本文引用的文献

1
Missing data imputation via the expectation-maximization algorithm can improve principal component analysis aimed at deriving biomarker profiles and dietary patterns.通过期望最大化算法进行缺失数据插补可以改进主成分分析,以得出生物标志物图谱和饮食模式。
Nutr Res. 2020 Mar;75:67-76. doi: 10.1016/j.nutres.2020.01.001. Epub 2020 Jan 9.
2
R-Ensembler: A greedy rough set based ensemble attribute selection algorithm with kNN imputation for classification of medical data.R-Ensembler:一种基于粗糙集的贪婪集成属性选择算法,具有 kNN 插补功能,用于医学数据的分类。
Comput Methods Programs Biomed. 2020 Feb;184:105122. doi: 10.1016/j.cmpb.2019.105122. Epub 2019 Oct 8.
3
基于 FCM 并经两次网格优化和麻雀搜索算法优化的缺失交通数据修复方法。
Sensors (Basel). 2022 Jun 6;22(11):4304. doi: 10.3390/s22114304.
Traffic Estimation for Large Urban Road Network with High Missing Data Ratio.
针对高缺失数据率的大型城市道路网络的交通流量估计
Sensors (Basel). 2019 Jun 24;19(12):2813. doi: 10.3390/s19122813.
4
Improved conditional imputation for linear regression with a randomly censored predictor.带有随机删失预测变量的线性回归的改进条件推断。
Stat Methods Med Res. 2019 Feb;28(2):432-444. doi: 10.1177/0962280217727033. Epub 2017 Aug 22.
5
A study on the use of imputation methods for experimentation with Radial Basis Function Network classifiers handling missing attribute values: the good synergy between RBFNs and EventCovering method.基于径向基函数网络分类器处理缺失属性值的实验中插补方法的研究:RBFNs 与事件覆盖法的良好协同作用。
Neural Netw. 2010 Apr;23(3):406-18. doi: 10.1016/j.neunet.2009.11.014. Epub 2009 Nov 26.
6
Missing value estimation methods for DNA microarrays.DNA微阵列的缺失值估计方法。
Bioinformatics. 2001 Jun;17(6):520-5. doi: 10.1093/bioinformatics/17.6.520.