IEEE Trans Med Imaging. 2022 Dec;41(12):3835-3848. doi: 10.1109/TMI.2022.3198526. Epub 2022 Dec 2.
Pathology analysis is crucial to precise cancer diagnoses and the succeeding treatment plan as well. To detect abnormality in histopathology images with prevailing patch-based convolutional neural networks (CNNs), contextual information often serves as a powerful cue. However, as whole-slide images (WSIs) are characterized by intense morphological heterogeneity and extensive tissue scale, a straightforward visual span to a larger context may not well capture the information closely associated with the focal patch. In this paper, we propose a novel pixel-offset based patch-location method to identify high-representative tissues, with a CNN backbone. Pathology Deformable Conditional Random Field (PDCRF) is proposed to learn the offsets and weights of neighboring contexts in a spatial-adaptive manner, to search for high-representative patches. A CNN structure with the localized patches as training input is then capable of consistently reaching superior classification outcomes for histology images. Overall, the proposed method has achieved state-of-the-art performance, in terms of the test classification accuracy improvement to the baseline by 1.15-2.60%, 0.78-1.78%, and 1.47-2.18% on TCGA public datasets of TCGA-STAD, TCGA-COAD, and TCGA-READ respectively. It also achieves 88.95% test accuracy and 0.920 test AUC on Camelyon 16. To show the effectiveness of the proposed framework on downstream tasks, we take a further step by incorporating an active learning model, which noticeably reduces the number of manual annotations by PDCRF to reach a parallel patch-based histology classifier.
病理学分析对于精确的癌症诊断和随后的治疗计划至关重要。为了使用现有的基于补丁的卷积神经网络(CNN)在组织病理学图像中检测异常,上下文信息通常是一个强大的线索。然而,由于全幻灯片图像(WSI)的形态异质性强烈,组织范围广泛,因此直接通过视觉跨度获取更大的上下文可能无法很好地捕获与焦点补丁密切相关的信息。在本文中,我们提出了一种基于像素偏移的补丁定位方法,该方法使用 CNN 骨干网络来识别高代表性组织。病理学可变形条件随机场(PDCRF)用于以空间自适应的方式学习邻域上下文的偏移量和权重,以搜索高代表性的补丁。然后,具有局部化补丁作为训练输入的 CNN 结构能够始终如一地为组织学图像实现更好的分类结果。总体而言,该方法在 TCGA-STAD、TCGA-COAD 和 TCGA-READ 的 TCGA 公共数据集上,通过将基线的测试分类准确率提高了 1.15-2.60%、0.78-1.78%和 1.47-2.18%,实现了最先进的性能。它在 Camelyon 16 上的测试准确率为 88.95%,测试 AUC 为 0.920。为了展示所提出的框架在下游任务中的有效性,我们进一步采用主动学习模型,该模型通过 PDCRF 明显减少了手动注释的数量,从而达到了基于补丁的组织病理学分类器的并行效果。