National United Engineering Laboratory of Integrated and Intelligent Transportation, School of Transportation and Logistics, Southwest Jiaotong University, Hi-Tech Industrial Development Zone, Chengdu, Sichuan, China.
Model Risk Management, Bank of America, Charlotte, NC, United States of America.
PLoS One. 2018 Apr 17;13(4):e0195957. doi: 10.1371/journal.pone.0195957. eCollection 2018.
Along with the rapid development of Intelligent Transportation Systems, traffic data collection technologies have progressed fast. The emergence of innovative data collection technologies such as remote traffic microwave sensor, Bluetooth sensor, GPS-based floating car method, and automated license plate recognition, has significantly increased the variety and volume of traffic data. Despite the development of these technologies, the missing data issue is still a problem that poses great challenge for data based applications such as traffic forecasting, real-time incident detection, dynamic route guidance, and massive evacuation optimization. A thorough literature review suggests most current imputation models either focus on the temporal nature of the traffic data and fail to consider the spatial information of neighboring locations or assume the data follow a certain distribution. These two issues reduce the imputation accuracy and limit the use of the corresponding imputation methods respectively. As a result, this paper presents a Kriging based data imputation approach that is able to fully utilize the spatiotemporal correlation in the traffic data and that does not assume the data follow any distribution. A set of scenarios with different missing rates are used to evaluate the performance of the proposed method. The performance of the proposed method was compared with that of two other widely used methods, historical average and K-nearest neighborhood. Comparison results indicate that the proposed method has the highest imputation accuracy and is more flexible compared to other methods.
随着智能交通系统的快速发展,交通数据采集技术也得到了快速发展。创新的数据采集技术,如远程交通微波传感器、蓝牙传感器、基于 GPS 的浮动车法和自动车牌识别技术的出现,大大增加了交通数据的种类和数量。尽管这些技术得到了发展,但缺失数据问题仍然是交通预测、实时事件检测、动态路线引导和大规模疏散优化等基于数据的应用面临的巨大挑战。全面的文献回顾表明,大多数当前的插补模型要么侧重于交通数据的时间特性,而未能考虑到相邻位置的空间信息,要么假设数据遵循某种分布。这两个问题分别降低了插补精度并限制了相应插补方法的使用。因此,本文提出了一种基于克里金的插补方法,该方法能够充分利用交通数据中的时空相关性,并且不假设数据遵循任何分布。使用一组具有不同缺失率的场景来评估所提出方法的性能。将所提出的方法的性能与其他两种广泛使用的方法,即历史平均值和 K-最近邻进行了比较。比较结果表明,与其他方法相比,所提出的方法具有最高的插补精度和更高的灵活性。