UKCRC Centre for Diet and Activity Research (CEDAR), MRC Epidemiology Unit, University of Cambridge School of Clinical Medicine, Cambridge, United Kingdom.
London School of Hygiene & Tropical Medicine, London, United Kingdom.
PLoS One. 2018 May 2;13(5):e0196521. doi: 10.1371/journal.pone.0196521. eCollection 2018.
Street imagery is a promising and growing big data source providing current and historical images in more than 100 countries. Studies have reported using this data to audit road infrastructure and other built environment features. Here we explore a novel application, using Google Street View (GSV) to predict travel patterns at the city level.
We sampled 34 cities in Great Britain. In each city, we accessed 2000 GSV images from 1000 random locations. We selected archived images from time periods overlapping with the 2011 Census and the 2011-2013 Active People Survey (APS). We manually annotated the images into seven categories of road users. We developed regression models with the counts of images of road users as predictors. The outcomes included Census-reported commute shares of four modes (combined walking plus public transport, cycling, motorcycle, and car), as well as APS-reported past-month participation in walking and cycling.
We found high correlations between GSV counts of cyclists ('GSV-cyclists') and cycle commute mode share (r = 0.92)/past-month cycling (r = 0.90). Likewise, GSV-pedestrians was moderately correlated with past-month walking for transport (r = 0.46), GSV-motorcycles was moderately correlated with commute share of motorcycles (r = 0.44), and GSV-buses was highly correlated with commute share of walking plus public transport (r = 0.81). GSV-car was not correlated with car commute mode share (r = -0.12). However, in multivariable regression models, all outcomes were predicted well, except past-month walking. The prediction performance was measured using cross-validation analyses. GSV-buses and GSV-cyclists are the strongest predictors for most outcomes.
GSV images are a promising new big data source to predict urban mobility patterns. Predictive power was the greatest for those modes that varied the most (cycle and bus). With its ability to identify mode of travel and capture street activity often excluded in routinely carried out surveys, GSV has the potential to be complementary to new and traditional data. With half the world's population covered by street imagery, and with up to 10 years historical data available in GSV, further testing across multiple settings is warranted both for cross-sectional and longitudinal assessments.
街景图像是一种很有前景且不断发展的大数据资源,它提供了 100 多个国家的当前和历史图像。已有研究报告称,可利用这些数据来审计道路基础设施和其他建成环境特征。在此,我们探索了一种新的应用,即利用谷歌街景(GSV)来预测城市层面的出行模式。
我们在英国抽取了 34 个城市。在每个城市中,我们从 1000 个随机位置访问了 2000 张 GSV 图像。我们从与 2011 年人口普查和 2011-2013 年活跃人群调查(APS)时间重叠的时期中选择存档图像。我们将图像手动标注为 7 个类别的道路使用者。我们以道路使用者的图像数量作为预测因子,建立了回归模型。结果包括人口普查报告的四种模式(步行加公共交通、骑自行车、骑摩托车和开车)的通勤比例,以及 APS 报告的过去一个月的步行和骑自行车参与率。
我们发现 GSV 计数的自行车(“GSV 自行车”)与自行车通勤模式比例(r = 0.92)/过去一个月的自行车骑行(r = 0.90)之间存在高度相关性。同样,GSV 行人与交通方式的步行(r = 0.46)也存在中度相关性,GSV 摩托车与摩托车通勤比例(r = 0.44)也存在中度相关性,GSV 公共汽车与步行加公共交通的通勤比例(r = 0.81)也存在高度相关性。GSV 汽车与汽车通勤模式比例(r = -0.12)之间没有相关性。然而,在多变量回归模型中,除了过去一个月的步行外,所有结果都得到了很好的预测。通过交叉验证分析来衡量预测性能。GSV 公共汽车和 GSV 自行车是大多数结果的最强预测因子。
GSV 图像是一种很有前景的新大数据资源,可用于预测城市流动模式。对于变化最大的模式(自行车和公共汽车),预测能力最强。GSV 可以识别出行方式,并捕捉通常在常规调查中排除的街道活动,因此它有可能成为新数据和传统数据的补充。由于世界上一半的人口都覆盖在街景图像中,并且 GSV 中可提供长达 10 年的历史数据,因此在多个环境中进行进一步测试是必要的,包括横断面和纵向评估。