Huang Yu-Yuan, Chu Wei-Ta
National Cheng Kung University, Tainan, Taiwan.
J Imaging Inform Med. 2024 Nov 4. doi: 10.1007/s10278-024-01302-8.
Multiple instance learning (MIL) has become a cornerstone in whole slide image (WSI) analysis. In this paradigm, a WSI is conceptualized as a bag of instances. Instance features are extracted by a feature extractor, and then a feature aggregator fuses these instance features into a bag representation. In this paper, we advocate that both feature extraction and aggregation can be enhanced by considering the context or correlation between instances. We learn contextual features between instances, and then fuse contextual features with instance features to enhance instance representations. For feature aggregation, we observe performance instability particularly when disease-positive instances are only a minor fraction of the WSI. We introduce a self-attention mechanism to discover correlation among instances and foster more effective bag representations. Through comprehensive testing, we have demonstrated that the proposed method outperforms existing WSI classification methods by 1 to 4% classification accuracy, based on the Camelyon16 and the TCGA-NSCLC datasets. The proposed method also outperforms the most recent weakly supervised WSI segmentation method by 0.6 in terms of the Dice coefficient, based on the Camelyon16 dataset.
多实例学习(MIL)已成为全切片图像(WSI)分析的基石。在这种范式中,WSI被概念化为一袋实例。实例特征由特征提取器提取,然后特征聚合器将这些实例特征融合成一个袋表示。在本文中,我们主张通过考虑实例之间的上下文或相关性来增强特征提取和聚合。我们学习实例之间的上下文特征,然后将上下文特征与实例特征融合以增强实例表示。对于特征聚合,我们观察到性能不稳定,特别是当疾病阳性实例仅占WSI的一小部分时。我们引入自注意力机制来发现实例之间的相关性并促进更有效的袋表示。通过全面测试,我们证明了基于Camelyon16和TCGA-NSCLC数据集,所提出的方法在分类准确率上比现有WSI分类方法高出1%至4%。基于Camelyon16数据集,所提出的方法在Dice系数方面也比最新的弱监督WSI分割方法高出0.6。