Liang Meiyan, Zhang Shupeng, Wang Xikai, Li Bo, Hamza Javed Muhammad, Jia Xiaojun, Wang Lin
IEEE Trans Image Process. 2025;34:4215-4229. doi: 10.1109/TIP.2025.3583127.
Gigapixel whole-slide image (WSI) prediction and region-of-interest localization present considerable challenges due to the diverse range of features both across different slides and within individual slides. Most current methods rely on weakly supervised learning using homogeneous graphs to establish context-aware relevance within slides, often neglecting the rich diversity of heterogeneous information inherent in pathology images. Inspired by the negative sampling strategy of the Determinantal Point Process (DPP) and the hierarchical structure of pathology slides, we introduce the Negative Sample Boosted Hierarchical Heterogeneous Graph Attention Network (NSB-H2GAN). This model addresses the over-smoothing issue typically encountered in classical Graph Convolutional Networks (GCNs) when applied to pathology slides. By incorporating "negative samples" at multiple scales and utilizing hierarchical, heterogeneous feature discrimination, NSB-H2GAN more effectively captures the unique features of each patch, leading to an improved representation of gigapixel WSIs. We evaluated the performance of NSB-H2GAN on three publicly available datasets: CAMELYON16, TCGA-NSCLC and TCGA-COAD. The results show that NSB-H2GAN significantly outperforms existing state-of-the-art methods in both qualitative and quantitative evaluations. Moreover, NSB-H2GAN generates more detailed and interpretable heatmaps, allowing for precise localization of tiny lesions as small as $200\mu m\times 200\mu m$ that are often missed by the human eye. The robust performance of NSB-H2GAN offers a new paradigm for computer-aided pathology diagnosis and holds great potential for advancing the clinical applications of computational pathology.
由于不同幻灯片之间以及单个幻灯片内部存在各种不同的特征,千兆像素全切片图像(WSI)预测和感兴趣区域定位面临着巨大挑战。当前大多数方法依赖于使用同构图的弱监督学习来在幻灯片内建立上下文感知相关性,常常忽略了病理图像中固有的丰富多样的异质信息。受行列式点过程(DPP)的负采样策略和病理幻灯片的层次结构启发,我们引入了负样本增强层次异质图注意力网络(NSB-H2GAN)。该模型解决了经典图卷积网络(GCN)应用于病理幻灯片时通常遇到的过平滑问题。通过在多个尺度上纳入“负样本”并利用层次化的异质特征判别,NSB-H2GAN更有效地捕捉每个补丁的独特特征,从而改进了千兆像素WSI的表示。我们在三个公开可用的数据集上评估了NSB-H2GAN的性能:CAMELYON16、TCGA-NSCLC和TCGA-COAD。结果表明,在定性和定量评估中,NSB-H2GAN均显著优于现有的最先进方法。此外,NSB-H2GAN生成的热图更详细且可解释,能够精确定位小至200μm×200μm的微小病变,而这些病变往往是人眼容易错过的。NSB-H2GAN的强大性能为计算机辅助病理诊断提供了一种新范式,在推进计算病理学的临床应用方面具有巨大潜力。