D'Silva Krittika, Noulas Anastasios, Musolesi Mirco, Mascolo Cecilia, Sklar Max
1Department of Computer Science, University of Cambridge, Cambridge, UK.
2Center for Data Science, New York University, New York, USA.
EPJ Data Sci. 2018;7(1):13. doi: 10.1140/epjds/s13688-018-0142-z. Epub 2018 May 18.
Estimating revenue and business demand of a newly opened venue is paramount as these early stages often involve critical decisions such as first rounds of staffing and resource allocation. Traditionally, this estimation has been performed through coarse-grained measures such as observing numbers in local venues or venues at similar places (e.g., coffee shops around another station in the same city). The advent of crowdsourced data from devices and services carried by individuals on a daily basis has opened up the possibility of performing better predictions of temporal visitation patterns for locations and venues. In this paper, using mobility data from Foursquare, a location-centric platform, we treat venue categories as proxies for urban activities and analyze how they become popular over time. The main contribution of this work is a prediction framework able to use characteristic temporal signatures of places together with -nearest neighbor metrics capturing similarities among urban regions, to forecast weekly popularity dynamics of a new venue establishment in a city neighborhood. We further show how we are able to forecast the popularity of the new venue after one month following its opening by using locality and temporal similarity as features. For the evaluation of our approach we focus on London. We show that temporally similar areas of the city can be successfully used as inputs of predictions of the visit patterns of new venues, with an improvement of 41% compared to a random selection of wards as a training set for the prediction task. We apply these concepts of temporally similar areas and locality to the real-time predictions related to new venues and show that these features can effectively be used to predict the future trends of a venue. Our findings have the potential to impact the design of location-based technologies and decisions made by new business owners.
估算新开业场所的收入和业务需求至关重要,因为这些早期阶段通常涉及关键决策,如首轮人员配备和资源分配。传统上,这种估算通过粗粒度的方法进行,例如观察当地场所或类似地点(如同一城市另一个车站周围的咖啡店)的客流量。个人日常携带的设备和服务产生的众包数据的出现,为更好地预测地点和场所的时间访问模式提供了可能性。在本文中,我们使用以位置为中心的平台Foursquare的移动性数据,将场所类别作为城市活动的代理,并分析它们如何随时间变得受欢迎。这项工作的主要贡献是一个预测框架,该框架能够使用地点的特征时间特征以及捕捉城市区域之间相似性的 - 最近邻度量,来预测城市社区中新场所的每周人气动态。我们进一步展示了如何通过使用局部性和时间相似性作为特征来预测新场所开业一个月后的人气。为了评估我们的方法,我们以伦敦为重点。我们表明,城市中时间上相似的区域可以成功用作新场所访问模式预测的输入,与随机选择病房作为预测任务的训练集相比,准确率提高了41%。我们将这些时间上相似区域和局部性的概念应用于与新场所相关的实时预测,并表明这些特征可以有效地用于预测场所的未来趋势。我们的研究结果有可能影响基于位置的技术设计和新企业主做出的决策。