• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

迈向构建用于大规模医学影像数据的高性能空间查询系统

Towards Building a High Performance Spatial Query System for Large Scale Medical Imaging Data.

作者信息

Aji Ablimit, Wang Fusheng, Saltz Joel H

机构信息

Department of Mathematics & Computer Science, Emory University.

Department of Biomedical Informatics, Emory University.

出版信息

Proc ACM SIGSPATIAL Int Conf Adv Inf. 2012 Nov 6;2012:309-318. doi: 10.1145/2424321.2424361.

DOI:10.1145/2424321.2424361
PMID:24501719
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3909999/
Abstract

Support of high performance queries on large volumes of scientific spatial data is becoming increasingly important in many applications. This growth is driven by not only geospatial problems in numerous fields, but also emerging scientific applications that are increasingly data- and compute-intensive. For example, digital pathology imaging has become an emerging field during the past decade, where examination of high resolution images of human tissue specimens enables more effective diagnosis, prediction and treatment of diseases. Systematic analysis of large-scale pathology images generates tremendous amounts of spatially derived quantifications of micro-anatomic objects, such as nuclei, blood vessels, and tissue regions. Analytical pathology imaging provides high potential to support image based computer aided diagnosis. One major requirement for this is effective of such enormous amount of data with fast response, which is faced with two major challenges: the "big data" challenge and the high computation complexity. In this paper, we present our work towards building a high performance spatial query system for querying massive spatial data on MapReduce. Our framework takes an on demand index building approach for processing spatial queries and a partition-merge approach for building parallel spatial query pipelines, which fits nicely with the computing model of MapReduce. We demonstrate our framework on supporting multi-way spatial joins for algorithm evaluation and nearest neighbor queries for microanatomic objects. To reduce query response time, we propose cost based query optimization to mitigate the effect of data skew. Our experiments show that the framework can efficiently support complex analytical spatial queries on MapReduce.

摘要

在许多应用中,支持对大量科学空间数据进行高性能查询变得越来越重要。这种增长不仅受到众多领域中地理空间问题的推动,还受到越来越多的数据密集型和计算密集型新兴科学应用的推动。例如,数字病理学成像在过去十年中已成为一个新兴领域,对人体组织标本的高分辨率图像进行检查能够实现更有效的疾病诊断、预测和治疗。对大规模病理学图像的系统分析会生成大量关于微观解剖对象(如细胞核、血管和组织区域)的空间量化数据。分析性病理学成像为支持基于图像的计算机辅助诊断提供了巨大潜力。对此的一个主要要求是能够快速响应地处理如此大量的数据,而这面临着两个主要挑战:“大数据”挑战和高计算复杂性。在本文中,我们展示了我们在构建一个用于在MapReduce上查询海量空间数据的高性能空间查询系统方面所做的工作。我们的框架采用按需索引构建方法来处理空间查询,并采用分区合并方法来构建并行空间查询管道,这与MapReduce的计算模型非常契合。我们展示了我们的框架在支持用于算法评估的多路空间连接和用于微观解剖对象的最近邻查询方面的能力。为了减少查询响应时间,我们提出基于成本的查询优化以减轻数据倾斜的影响。我们的实验表明,该框架能够在MapReduce上高效地支持复杂的分析性空间查询。

相似文献

1
Towards Building a High Performance Spatial Query System for Large Scale Medical Imaging Data.迈向构建用于大规模医学影像数据的高性能空间查询系统
Proc ACM SIGSPATIAL Int Conf Adv Inf. 2012 Nov 6;2012:309-318. doi: 10.1145/2424321.2424361.
2
Scalable 3D Spatial Queries for Analytical Pathology Imaging with MapReduce.用于分析病理学成像的可扩展3D空间查询与MapReduce技术
Proc ACM SIGSPATIAL Int Conf Adv Inf. 2016 Oct-Nov;2016. doi: 10.1145/2996913.2996925.
3
Hadoop-GIS: A High Performance Spatial Data Warehousing System over MapReduce.Hadoop-GIS:一种基于MapReduce的高性能空间数据仓库系统。
Proceedings VLDB Endowment. 2013 Aug;6(11).
4
Efficient 3D Spatial Queries for Complex Objects.针对复杂物体的高效三维空间查询
ACM Trans Spat Algorithms Syst. 2022 Jun;8(2). doi: 10.1145/3502221. Epub 2022 Feb 12.
5
Demonstration of Hadoop-GIS: A Spatial Data Warehousing System Over MapReduce.Hadoop-GIS演示:一种基于MapReduce的空间数据仓库系统
Proc ACM SIGSPATIAL Int Conf Adv Inf. 2013 Nov;2013:528-531. doi: 10.1145/2525314.2525320.
6
SparkGIS: Resource Aware Efficient In-Memory Spatial Query Processing.SparkGIS:资源感知型高效内存空间查询处理
Proc ACM SIGSPATIAL Int Conf Adv Inf. 2017 Nov;2017.
7
A data model and database for high-resolution pathology analytical image informatics.用于高分辨率病理学分析图像信息学的数据模型与数据库。
J Pathol Inform. 2011;2:32. doi: 10.4103/2153-3539.83192. Epub 2011 Jul 26.
8
A high-performance spatial database based approach for pathology imaging algorithm evaluation.一种基于高性能空间数据库的病理学成像算法评估方法。
J Pathol Inform. 2013 Mar 14;4:5. doi: 10.4103/2153-3539.108543. Print 2013.
9
iSPEED: an Efficient In-Memory Based Spatial Query System for Large-Scale 3D Data with Complex Structures.iSPEED:一种用于具有复杂结构的大规模3D数据的高效基于内存的空间查询系统。
Proc ACM SIGSPATIAL Int Conf Adv Inf. 2017 Nov;2017. doi: 10.1145/3139958.3139961.
10
iSPEED: a Scalable and Distributed In-Memory Based Spatial Query System for Large and Structurally Complex 3D Data.iSPEED:一种用于大型且结构复杂的3D数据的可扩展分布式内存空间查询系统。
Proceedings VLDB Endowment. 2018 Aug;11(12):2078-2081. doi: 10.14778/3229863.3236264.

引用本文的文献

1
: A Cloud MapReduce Based High Performance Whole Slide Image Analysis Framework.一种基于云MapReduce的高性能全切片图像分析框架。
Distrib Parallel Databases. 2019 Jun;37(2):251-272. doi: 10.1007/s10619-018-7237-1. Epub 2018 Jul 30.
2
Demonstration of Hadoop-GIS: A Spatial Data Warehousing System Over MapReduce.Hadoop-GIS演示:一种基于MapReduce的空间数据仓库系统
Proc ACM SIGSPATIAL Int Conf Adv Inf. 2013 Nov;2013:528-531. doi: 10.1145/2525314.2525320.
3
Querying and Extracting Timeline Information from Road Traffic Sensor Data.

本文引用的文献

1
Accelerating Pathology Image Data Cross-Comparison on CPU-GPU Hybrid Systems.在CPU-GPU混合系统上加速病理学图像数据的交叉比较
Proceedings VLDB Endowment. 2012 Jul;5(11):1543-1554. doi: 10.14778/2350229.2350268.
2
Integrated morphologic analysis for the identification and characterization of disease subtypes.综合形态分析用于疾病亚型的识别和特征描述。
J Am Med Inform Assoc. 2012 Mar-Apr;19(2):317-23. doi: 10.1136/amiajnl-2011-000700. Epub 2012 Jan 24.
3
Integrative, multimodal analysis of glioblastoma using TCGA molecular data, pathology images, and clinical outcomes.
从道路交通传感器数据中查询和提取时间线信息。
Sensors (Basel). 2016 Aug 23;16(9):1340. doi: 10.3390/s16091340.
4
Toward a Literature-Driven Definition of Big Data in Healthcare.迈向基于文献的医疗大数据定义。
Biomed Res Int. 2015;2015:639021. doi: 10.1155/2015/639021. Epub 2015 Jun 2.
5
Hadoop-GIS: A High Performance Spatial Data Warehousing System over MapReduce.Hadoop-GIS:一种基于MapReduce的高性能空间数据仓库系统。
Proceedings VLDB Endowment. 2013 Aug;6(11).
使用 TCGA 分子数据、病理图像和临床结果对胶质母细胞瘤进行综合的、多模态分析。
IEEE Trans Biomed Eng. 2011 Dec;58(12):3469-74. doi: 10.1109/TBME.2011.2169256. Epub 2011 Sep 23.
4
The Open Microscopy Environment (OME) Data Model and XML file: open tools for informatics and quantitative analysis in biological imaging.开放显微镜环境(OME)数据模型与XML文件:用于生物成像信息学和定量分析的开放工具。
Genome Biol. 2005;6(5):R47. doi: 10.1186/gb-2005-6-5-r47. Epub 2005 May 3.
5
The virtual microscope.虚拟显微镜。
IEEE Trans Inf Technol Biomed. 2003 Dec;7(4):230-48. doi: 10.1109/titb.2004.823952.
6
Pseudopalisades in glioblastoma are hypoxic, express extracellular matrix proteases, and are formed by an actively migrating cell population.胶质母细胞瘤中的假栅栏状结构是缺氧的,表达细胞外基质蛋白酶,并且由一个活跃迁移的细胞群体形成。
Cancer Res. 2004 Feb 1;64(3):920-7. doi: 10.1158/0008-5472.can-03-2073.