Suppr超能文献

汇编器:在海量地理传感数据中高效发现空间协同进化模式。

Assembler: Efficient Discovery of Spatial Co-evolving Patterns in Massive Geo-sensory Data.

作者信息

Zhang Chao, Zheng Yu, Ma Xiuli, Han Jiawei

机构信息

Dept. of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL, USA.

Microsoft Research, Beijing, China.

出版信息

KDD. 2015 Aug;2015:1415-1424. doi: 10.1145/2783258.2783394.

Abstract

Recent years have witnessed the wide proliferation of geo-sensory applications wherein a bundle of sensors are deployed at different locations to cooperatively monitor the target condition. Given massive geo-sensory data, we study the problem of mining spatial co-evolving patterns (SCPs), ., groups of sensors that are spatially correlated and co-evolve frequently in their readings. SCP mining is of great importance to various real-world applications, yet it is challenging because (1) the truly interesting evolutions are often flooded by numerous trivial fluctuations in the geo-sensory time series; and (2) the pattern search space is extremely large due to the spatiotemporal combinatorial nature of SCP. In this paper, we propose a two-stage method called Assembler. In the first stage, Assembler filters trivial fluctuations using wavelet transform and detects frequent evolutions for individual sensors via a segment-and-group approach. In the second stage, Assembler generates SCPs by assembling the frequent evolutions of individual sensors. Leveraging the spatial constraint, it conceptually organizes all the SCPs into a novel structure called the SCP search tree, which facilitates the effective pruning of the search space to generate SCPs efficiently. Our experiments on both real and synthetic data sets show that Assembler is effective, efficient, and scalable.

摘要

近年来,地理传感应用广泛普及,其中一系列传感器部署在不同位置,以协同监测目标状况。针对海量的地理传感数据,我们研究挖掘空间共同演化模式(SCP)的问题,即空间相关且读数频繁共同演化的传感器组。SCP挖掘对各种实际应用至关重要,但具有挑战性,原因如下:(1)真正有趣的演化往往被地理传感时间序列中的大量琐碎波动所淹没;(2)由于SCP的时空组合性质,模式搜索空间极大。在本文中,我们提出一种名为汇编器(Assembler)的两阶段方法。在第一阶段,汇编器使用小波变换过滤琐碎波动,并通过分段分组方法检测单个传感器的频繁演化。在第二阶段,汇编器通过组合单个传感器的频繁演化来生成SCP。利用空间约束,它从概念上将所有SCP组织成一种名为SCP搜索树的新颖结构,这有助于有效修剪搜索空间,从而高效地生成SCP。我们在真实数据集和合成数据集上的实验表明,汇编器是有效、高效且可扩展的。

相似文献

3
Mining Non-Redundant Co-Location Patterns.
IEEE Trans Neural Netw Learn Syst. 2022 Nov;33(11):6613-6626. doi: 10.1109/TNNLS.2021.3082628. Epub 2022 Oct 27.
5
HUOPM: High-Utility Occupancy Pattern Mining.
IEEE Trans Cybern. 2020 Mar;50(3):1195-1208. doi: 10.1109/TCYB.2019.2896267. Epub 2019 Feb 20.
6
Efficient discovery of risk patterns in medical data.
Artif Intell Med. 2009 Jan;45(1):77-89. doi: 10.1016/j.artmed.2008.07.008. Epub 2008 Sep 9.
7
Fast Utility Mining on Sequence Data.
IEEE Trans Cybern. 2021 Feb;51(2):487-500. doi: 10.1109/TCYB.2020.2970176. Epub 2021 Jan 15.
8
SWAP-Assembler: scalable and efficient genome assembly towards thousands of cores.
BMC Bioinformatics. 2014;15 Suppl 9(Suppl 9):S2. doi: 10.1186/1471-2105-15-S9-S2. Epub 2014 Sep 10.
9
Visualizing Large-Scale Spatial Time Series with GeoChron.
IEEE Trans Vis Comput Graph. 2024 Jan;30(1):1194-1204. doi: 10.1109/TVCG.2023.3327162. Epub 2023 Dec 25.

引用本文的文献

1
A framework for spatial-temporal cluster evolution representation and analysis based on graphs.
Sci Rep. 2024 Sep 27;14(1):22137. doi: 10.1038/s41598-024-72504-x.
2
Prediction model of laparoendoscopic single-site surgery in gynecology using machine learning algorithm.
Wideochir Inne Tech Maloinwazyjne. 2021 Sep;16(3):587-596. doi: 10.5114/wiitm.2021.106081. Epub 2021 May 14.

本文引用的文献

1
Exact Discovery of Time Series Motifs.
Proc SIAM Int Conf Data Min. 2009;2009:473-484. doi: 10.1137/1.9781611972795.41.
2
Change-point detection in time-series data by relative density-ratio estimation.
Neural Netw. 2013 Jul;43:72-83. doi: 10.1016/j.neunet.2013.01.012. Epub 2013 Feb 4.
3
Online segmentation of time series based on polynomial least-squares approximations.
IEEE Trans Pattern Anal Mach Intell. 2010 Dec;32(12):2232-45. doi: 10.1109/TPAMI.2010.44.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验