Zeng Wei, Lin Chengqiao, Lin Juncong, Jiang Jincheng, Xia Jiazhi, Turkay Cagatay, Chen Wei
IEEE Trans Vis Comput Graph. 2021 Feb;27(2):839-848. doi: 10.1109/TVCG.2020.3030410. Epub 2021 Jan 28.
Deep learning methods are being increasingly used for urban traffic prediction where spatiotemporal traffic data is aggregated into sequentially organized matrices that are then fed into convolution-based residual neural networks. However, the widely known modifiable areal unit problem within such aggregation processes can lead to perturbations in the network inputs. This issue can significantly destabilize the feature embeddings and the predictions - rendering deep networks much less useful for the experts. This paper approaches this challenge by leveraging unit visualization techniques that enable the investigation of many-to-many relationships between dynamically varied multi-scalar aggregations of urban traffic data and neural network predictions. Through regular exchanges with a domain expert, we design and develop a visual analytics solution that integrates 1) a Bivariate Map equipped with an advanced bivariate colormap to simultaneously depict input traffic and prediction errors across space, 2) a Moran's I Scatterplot that provides local indicators of spatial association analysis, and 3) a Multi-scale Attribution View that arranges non-linear dot plots in a tree layout to promote model analysis and comparison across scales. We evaluate our approach through a series of case studies involving a real-world dataset of Shenzhen taxi trips, and through interviews with domain experts. We observe that geographical scale variations have important impact on prediction performances, and interactive visual exploration of dynamically varying inputs and outputs benefit experts in the development of deep traffic prediction models.
深度学习方法越来越多地用于城市交通预测,在这种预测中,时空交通数据被聚合为按顺序组织的矩阵,然后输入基于卷积的残差神经网络。然而,这种聚合过程中广为人知的可变面积单元问题可能会导致网络输入的扰动。这个问题会显著破坏特征嵌入和预测的稳定性,使深度网络对专家的用处大大降低。本文通过利用单元可视化技术来应对这一挑战,这些技术能够研究城市交通数据动态变化的多标量聚合与神经网络预测之间的多对多关系。通过与领域专家的定期交流,我们设计并开发了一种可视化分析解决方案,该方案集成了:1)一个配备高级双变量色图的双变量地图,用于同时描绘空间上的输入交通和预测误差;2)一个莫兰指数散点图,提供空间关联分析的局部指标;3)一个多尺度归因视图,以树形布局排列非线性点图,以促进跨尺度的模型分析和比较。我们通过一系列涉及深圳出租车行程真实世界数据集的案例研究以及与领域专家的访谈来评估我们的方法。我们观察到地理尺度变化对预测性能有重要影响,对动态变化的输入和输出进行交互式可视化探索有助于专家开发深度交通预测模型。