Suppr超能文献

用于地理参考大数据高效近似分析的多边形简化

Polygon Simplification for the Efficient Approximate Analytics of Georeferenced Big Data.

作者信息

Al Jawarneh Isam Mashhour, Foschini Luca, Bellavista Paolo

机构信息

Department of Computer Science, University of Sharjah, Sharjah P.O. Box 27272, United Arab Emirates.

Dipartimento di Informatica-Scienza e Ingegneria, University of Bologna, Viale Risorgimento 2, 40136 Bologna, Italy.

出版信息

Sensors (Basel). 2023 Sep 29;23(19):8178. doi: 10.3390/s23198178.

Abstract

The unprecedented availability of sensor networks and GPS-enabled devices has caused the accumulation of voluminous georeferenced data streams. These data streams offer an opportunity to derive valuable insights and facilitate decision making for urban planning. However, processing and managing such data is challenging, given the size and multidimensionality of these data. Therefore, there is a growing interest in spatial approximate query processing depending on stratified-like sampling methods. However, in these solutions, as the number of strata increases, response time grows, thus counteracting the benefits of sampling. In this paper, we originally show the design and realization of a novel online geospatial approximate processing solution called GeoRAP. GeoRAP employs a front-stage filter based on the Ramer-Douglas-Peucker line simplification algorithm to reduce the size of study area coverage; thereafter, it employs a spatial stratified-like sampling method that minimizes the number of strata, thus increasing throughput and minimizing response time, while keeping the accuracy loss in check. Our method is applicable for various online and batch geospatial processing workloads, including complex geo-statistics, aggregation queries, and the generation of region-based aggregate geo-maps such as choropleth maps and heatmaps. We have extensively tested the performance of our prototyped solution with real-world big spatial data, and this paper shows that GeoRAP can outperform state-of-the-art baselines by an order of magnitude in terms of throughput while statistically obtaining results with good accuracy.

摘要

传感器网络和具备全球定位系统(GPS)功能的设备前所未有的普及,导致了大量地理参考数据流的积累。这些数据流为获取有价值的见解以及促进城市规划决策提供了契机。然而,鉴于这些数据的规模和多维度性,处理和管理此类数据颇具挑战性。因此,基于分层抽样方法的空间近似查询处理正日益受到关注。然而,在这些解决方案中,随着分层数量的增加,响应时间会变长,从而抵消了抽样的优势。在本文中,我们首次展示了一种名为GeoRAP的新型在线地理空间近似处理解决方案的设计与实现。GeoRAP采用基于拉默 - 道格拉斯 - 普克(Ramer-Douglas-Peucker)线简化算法的前端过滤器来减小研究区域覆盖范围的大小;此后,它采用一种空间分层抽样方法,该方法可将分层数量降至最低,从而提高吞吐量并将响应时间减至最短,同时控制精度损失。我们的方法适用于各种在线和批处理地理空间处理工作负载,包括复杂的地理统计、聚合查询以及生成基于区域的聚合地理地图,如分级统计图和热力图。我们使用真实世界的大型空间数据对我们的原型解决方案的性能进行了广泛测试,本文表明GeoRAP在吞吐量方面可比最先进的基线方法高出一个数量级,同时在统计上能获得具有良好准确性的结果。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d83/10575302/118a9bf24a04/sensors-23-08178-g001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验