数字病理学中用于全切片图像分析的高性能数据管理

High-performance Data Management for Whole Slide Image Analysis in Digital Pathology.

作者信息

Leng Haoju, Deng Ruining, Bao Shunxing, Fang Dazheng, Millis Bryan A, Tang Yucheng, Yang Haichun, Wang Xiao, Peng Yifan, Wan Lipeng, Huo Yuankai

机构信息

Department of Computer Science, Vanderbilt University, Nashville, TN, USA.

Department of Electrical and Computer Engineering, Vanderbilt University Medical Center, Nashville, TN, USA.

出版信息

Proc SPIE Int Soc Opt Eng. 2024 Feb;12933. doi: 10.1117/12.3006273. Epub 2024 Apr 3.

DOI:10.1117/12.3006273

PMID:40343079

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12061081/

Abstract

When dealing with giga-pixel digital pathology in whole-slide imaging, a notable proportion of data records holds relevance during each analysis operation. For instance, when deploying an image analysis algorithm on whole-slide images (WSI), the computational bottleneck often lies in the input-output (I/O) system. This is particularly notable as patch-level processing introduces a considerable I/O load onto the computer system. However, this data management process could be further paralleled, given the typical independence of patch-level image processes across different patches. This paper details our endeavors in tackling this data access challenge by implementing the Adaptable IO System version 2 (ADIOS2). Our focus has been constructing and releasing a digital pathology-centric pipeline using ADIOS2, which facilitates streamlined data management across WSIs. Additionally, we've developed strategies aimed at curtailing data retrieval times. The performance evaluation encompasses two key scenarios: (1) a pure CPU-based image analysis scenario ("CPU scenario"), and (2) a GPU-based deep learning framework scenario ("GPU scenario"). Our findings reveal noteworthy outcomes. Under the CPU scenario, ADIOS2 showcases an impressive two-fold speed-up compared to the brute-force approach. In the GPU scenario, its performance stands on par with the cutting-edge GPU I/O acceleration framework, NVIDIA Magnum IO GPU Direct Storage (GDS). From what we know, this appears to be among the initial instances, if any, of utilizing ADIOS2 within the field of digital pathology. The source code has been made publicly available at https://github.com/hrlblab/adios.

摘要

在处理全切片成像中的千兆像素数字病理学问题时，在每次分析操作期间，相当一部分数据记录都具有相关性。例如，在全切片图像（WSI）上部署图像分析算法时，计算瓶颈通常在于输入输出（I/O）系统。这一点尤为明显，因为补丁级处理会给计算机系统带来相当大的I/O负载。然而，鉴于不同补丁之间补丁级图像过程的典型独立性，这种数据管理过程可以进一步并行化。本文详细介绍了我们通过实施自适应I/O系统版本2（ADIOS2）来应对这一数据访问挑战的努力。我们的重点一直是使用ADIOS2构建并发布一个以数字病理学为中心的管道，这有助于简化跨WSI的数据管理。此外，我们还制定了旨在缩短数据检索时间的策略。性能评估包括两个关键场景：（1）基于纯CPU的图像分析场景（“CPU场景”），以及（2）基于GPU的深度学习框架场景（“GPU场景”）。我们的研究结果揭示了值得注意的成果。在CPU场景下，与暴力方法相比，ADIOS2的速度提升了两倍，令人印象深刻。在GPU场景下，其性能与前沿的GPU I/O加速框架NVIDIA Magnum IO GPU Direct Storage（GDS）相当。据我们所知，这似乎是数字病理学领域中使用ADIOS2的首批实例之一（如果有的话）。源代码已在https://github.com/hrlblab/adios上公开提供。

相似文献

High-performance Data Management for Whole Slide Image Analysis in Digital Pathology.数字病理学中用于全切片图像分析的高性能数据管理

Proc SPIE Int Soc Opt Eng. 2024 Feb;12933. doi: 10.1117/12.3006273. Epub 2024 Apr 3.

An Accelerated Pipeline for Multi-label Renal Pathology Image Segmentation at the Whole Slide Image Level.一种用于全切片图像级多标签肾脏病理图像分割的加速流程

Proc SPIE Int Soc Opt Eng. 2023 Feb;12471. doi: 10.1117/12.2653651. Epub 2023 Apr 6.

HCTTI: High-Performance Heterogeneous Computing Toolkit for Tissue Image Stain Normalization.HCTTI：用于组织图像染色归一化的高性能异构计算工具包。

J Imaging Inform Med. 2025 Jan 17. doi: 10.1007/s10278-025-01398-6.

Learning how to detect: A deep reinforcement learning method for whole-slide melanoma histopathology images.学习如何检测：一种用于全幻灯片黑色素瘤组织病理学图像的深度强化学习方法。

Comput Med Imaging Graph. 2023 Sep;108:102275. doi: 10.1016/j.compmedimag.2023.102275. Epub 2023 Jul 29.

Operational greenhouse-gas emissions of deep learning in digital pathology: a modelling study.深度学习在数字病理学中的运营温室气体排放：建模研究。

Lancet Digit Health. 2024 Jan;6(1):e58-e69. doi: 10.1016/S2589-7500(23)00219-4. Epub 2023 Nov 22.

Deep learning-based framework for slide-based histopathological image analysis.基于深度学习的幻灯片组织病理学图像分析框架。

Sci Rep. 2022 Nov 9;12(1):19075. doi: 10.1038/s41598-022-23166-0.

LESS: Label-efficient multi-scale learning for cytological whole slide image screening.LESS：用于细胞学全玻片图像筛选的标签高效多尺度学习

Med Image Anal. 2024 May;94:103109. doi: 10.1016/j.media.2024.103109. Epub 2024 Feb 20.

Domain-Specific Pre-training Improves Confidence in Whole Slide Image Classification.特定领域的预训练提高了对全切片图像分类的置信度。

Annu Int Conf IEEE Eng Med Biol Soc. 2023 Jul;2023:1-4. doi: 10.1109/EMBC40787.2023.10340659.

Dynamic graph based weakly supervised deep hashing for whole slide image classification and retrieval.基于动态图的弱监督深度哈希用于全切片图像分类与检索

Med Image Anal. 2025 Apr;101:103468. doi: 10.1016/j.media.2025.103468. Epub 2025 Jan 23.

A universal multiple instance learning framework for whole slide image analysis.用于全幻灯片图像分析的通用多实例学习框架。

Comput Biol Med. 2024 Aug;178:108714. doi: 10.1016/j.compbiomed.2024.108714. Epub 2024 Jun 8.

本文引用的文献

An Accelerated Pipeline for Multi-label Renal Pathology Image Segmentation at the Whole Slide Image Level.一种用于全切片图像级多标签肾脏病理图像分割的加速流程

Proc SPIE Int Soc Opt Eng. 2023 Feb;12471. doi: 10.1117/12.2653651. Epub 2023 Apr 6.

Building robust pathology image analyses with uncertainty quantification.利用不确定性量化构建稳健的病理学图像分析。

Comput Methods Programs Biomed. 2021 Sep;208:106291. doi: 10.1016/j.cmpb.2021.106291. Epub 2021 Jul 24.

Digital pathology: accurate technique for quantitative assessment of histological features in metabolic-associated fatty liver disease.数字病理学：用于代谢相关脂肪性肝病组织学特征定量评估的准确技术。

Aliment Pharmacol Ther. 2021 Jan;53(1):160-171. doi: 10.1111/apt.16100. Epub 2020 Sep 27.

Computer-assisted stereology and automated image analysis for quantification of tumor infiltrating lymphocytes in colon cancer.计算机辅助体视学和自动图像分析用于定量结肠癌中的肿瘤浸润淋巴细胞

Diagn Pathol. 2017 Aug 29;12(1):65. doi: 10.1186/s13000-017-0653-0.

Computer-aided diagnostics in digital pathology.

Cytometry A. 2017 Jun;91(6):551-554. doi: 10.1002/cyto.a.23151.

Image analysis and machine learning in digital pathology: Challenges and opportunities.数字病理学中的图像分析与机器学习：挑战与机遇

Med Image Anal. 2016 Oct;33:170-175. doi: 10.1016/j.media.2016.06.037. Epub 2016 Jul 4.

OpenSlide: A vendor-neutral software foundation for digital pathology.OpenSlide：一个用于数字病理学的供应商中立软件基础。

J Pathol Inform. 2013 Sep 27;4:27. doi: 10.4103/2153-3539.119005. eCollection 2013.

Pathology imaging informatics for quantitative analysis of whole-slide images.病理学成像信息学用于全切片图像的定量分析。

J Am Med Inform Assoc. 2013 Nov-Dec;20(6):1099-108. doi: 10.1136/amiajnl-2012-001540. Epub 2013 Aug 19.

Analyzing huge pathology images with open source software.使用开源软件分析大型病理图像。

Diagn Pathol. 2013 Jun 6;8:92. doi: 10.1186/1746-1596-8-92.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验