本地化拓扑数据结构：在森林中查询树。

Toward Localized Topological Data Structures: Querying the Forest for the Tree.

出版信息

IEEE Trans Vis Comput Graph. 2020 Jan;26(1):173-183. doi: 10.1109/TVCG.2019.2934257. Epub 2019 Aug 12.

DOI:10.1109/TVCG.2019.2934257

Abstract

Topological approaches to data analysis can answer complex questions about the number, connectivity, and scale of intrinsic features in scalar data. However, the global nature of many topological structures makes their computation challenging at scale, and thus often limits the size of data that can be processed. One key quality to achieving scalability and performance on modern architectures is data locality, i.e., a process operates on data that resides in a nearby memory system, avoiding frequent jumps in data access patterns. From this perspective, topological computations are particularly challenging because the implied data structures represent features that can span the entire data set, often requiring a global traversal phase that limits their scalability. Traditionally, expensive preprocessing is considered an acceptable trade-off as it accelerates all subsequent queries. Most published use cases, however, explore only a fraction of all possible queries, most often those returning small, local features. In these cases, much of the global information is not utilized, yet computing it dominates the overall response time. We address this challenge for merge trees, one of the most commonly used topological structures. In particular, we propose an alternative representation, the merge forest, a collection of local trees corresponding to regions in a domain decomposition. Local trees are connected by a bridge set that allows us to recover any necessary global information at query time. The resulting system couples (i) a preprocessing that scales linearly in practice with (ii) fast runtime queries that provide the same functionality as traditional queries of a global merge tree. We test the scalability of our approach on a shared-memory parallel computer and demonstrate how data structure locality enables the analysis of large data with an order of magnitude performance improvement over the status quo. Furthermore, a merge forest reduces the memory overhead compared to a global merge tree and enables the processing of data sets that are an order of magnitude larger than possible with previous algorithms.

摘要

拓扑数据分析方法可以回答关于标量数据中内在特征的数量、连通性和规模的复杂问题。然而，许多拓扑结构的全局性质使得它们在大规模计算时具有挑战性，因此通常限制了可以处理的数据量。在现代架构上实现可扩展性和性能的一个关键质量是数据局部性，即一个过程在位于附近内存系统中的数据上运行，避免数据访问模式的频繁跳转。从这个角度来看，拓扑计算特别具有挑战性，因为隐含的数据结构表示可以跨越整个数据集的特征，通常需要全局遍历阶段，从而限制了它们的可扩展性。传统上，昂贵的预处理被认为是可以接受的折衷方案，因为它可以加速所有后续查询。然而，大多数已发布的用例仅探索所有可能查询的一小部分，通常是那些返回小的、局部特征的查询。在这些情况下，大部分全局信息未被利用，但计算它占据了总体响应时间的主导地位。我们针对合并树（拓扑结构中最常用的结构之一）解决了这一挑战。特别是，我们提出了一种替代表示形式，即合并森林，它是对应于域分解中区域的局部树的集合。局部树通过桥集连接，允许我们在查询时恢复任何必要的全局信息。由此产生的系统结合了（i）实践中线性扩展的预处理和（ii）快速运行时查询，这些查询提供了与全局合并树的传统查询相同的功能。我们在共享内存并行计算机上测试了我们方法的可扩展性，并展示了数据结构局部性如何使我们能够分析比现有算法能够处理的大一个数量级的数据，从而实现性能提高一个数量级。此外，与全局合并树相比，合并森林减少了内存开销，并能够处理比以前的算法可能处理的大一个数量级的数据。

相似文献

Toward Localized Topological Data Structures: Querying the Forest for the Tree.本地化拓扑数据结构：在森林中查询树。

IEEE Trans Vis Comput Graph. 2020 Jan;26(1):173-183. doi: 10.1109/TVCG.2019.2934257. Epub 2019 Aug 12.

Visually defining and querying consistent multi-granular clinical temporal abstractions.直观定义和查询一致的多粒度临床时间抽象。

Artif Intell Med. 2012 Feb;54(2):75-101. doi: 10.1016/j.artmed.2011.10.004. Epub 2011 Dec 15.

Mapping high-fidelity volume rendering for medical imaging to CPU, GPU and many-core architectures.医学成像的高保真体绘制映射到 CPU、GPU 和多核架构。

IEEE Trans Vis Comput Graph. 2009 Nov-Dec;15(6):1563-70. doi: 10.1109/TVCG.2009.164.

Kd-Jump: a path-preserving stackless traversal for faster isosurface raytracing on GPUs.Kd-Jump：一种保持路径的无栈遍历方法，可在 GPU 上实现更快的等表面光线追踪。

IEEE Trans Vis Comput Graph. 2009 Nov-Dec;15(6):1555-62. doi: 10.1109/TVCG.2009.161.

Unordered Task-Parallel Augmented Merge Tree Construction.无序任务并行增强合并树构建。

IEEE Trans Vis Comput Graph. 2021 Aug;27(8):3585-3596. doi: 10.1109/TVCG.2021.3076875. Epub 2021 Jun 30.

Geometry-Aware Merge Tree Comparisons for Time-Varying Data With Interleaving Distances.用于具有交错距离的时变数据的几何感知合并树比较

IEEE Trans Vis Comput Graph. 2023 Aug;29(8):3489-3506. doi: 10.1109/TVCG.2022.3163349. Epub 2023 Jun 29.

cuRnet: an R package for graph traversing on GPU.cuRnet：一个在 GPU 上进行图遍历的 R 包。

BMC Bioinformatics. 2018 Oct 15;19(Suppl 10):356. doi: 10.1186/s12859-018-2310-3.

Fast Comparative Analysis of Merge Trees Using Locality Sensitive Hashing.使用局部敏感哈希的合并树快速比较分析

IEEE Trans Vis Comput Graph. 2025 Jan;31(1):141-151. doi: 10.1109/TVCG.2024.3456383. Epub 2024 Nov 25.

A Structural Average of Labeled Merge Trees for Uncertainty Visualization.用于不确定性可视化的带标签合并树的结构平均值

IEEE Trans Vis Comput Graph. 2020 Jan;26(1):832-842. doi: 10.1109/TVCG.2019.2934242. Epub 2019 Aug 12.

Rapid and Precise Topological Comparison with Merge Tree Neural Networks.使用合并树神经网络进行快速精确的拓扑比较

IEEE Trans Vis Comput Graph. 2025 Jan;31(1):1322-1332. doi: 10.1109/TVCG.2024.3456395. Epub 2024 Nov 25.

本地化拓扑数据结构：在森林中查询树。

Toward Localized Topological Data Structures: Querying the Forest for the Tree.

出版信息

IEEE Trans Vis Comput Graph. 2020 Jan;26(1):173-183. doi: 10.1109/TVCG.2019.2934257. Epub 2019 Aug 12.

DOI:10.1109/TVCG.2019.2934257

PMID:31403428

Abstract

摘要

Suppr 超能文献

文献检索

文件翻译

深度研究

Suppr 超能文献

文献检索

文件翻译

深度研究

本地化拓扑数据结构：在森林中查询树。

Toward Localized Topological Data Structures: Querying the Forest for the Tree.

出版信息

相似文献

本地化拓扑数据结构：在森林中查询树。

Toward Localized Topological Data Structures: Querying the Forest for the Tree.

出版信息

相似文献