Institute of Computing Technology, Chinese Academy of Sciences, Beijing, 100190, China; University of Chinese Academy of Sciences, Beijing, 100049, China.
Department of Urology, Fudan University Shanghai Cancer Center, Shanghai, 200032, China; Department of Oncology, Shanghai Medical College, Fudan University, Shanghai, 200032, China.
Med Image Anal. 2023 Jul;87:102824. doi: 10.1016/j.media.2023.102824. Epub 2023 Apr 23.
Gene mutation detection is usually carried out by molecular biological methods, which is expensive and has a long-time cycle. In contrast, pathological images are ubiquitous. If clinically significant gene mutations can be predicted only through pathological images, it will greatly promote the widespread use of gene mutation detection in clinical practice. However, current gene mutation prediction methods based on pathological images are ineffective because of the inability to identify mutated regions in gigapixel Whole Slide Image (WSI). To address this challenge, hereby we propose a carefully designed framework for WSI-based gene mutation prediction, which consists of three parts. (i) The first part of cancerous area segmentation, based on supervised learning, quickly filters out a large number of non-mutated regions; (ii) the second part of cancerous patch clustering, based on the representations derived from contrastive learning, ensures the comprehensiveness of patch selection; and (iii) the third part of mutation classification, based on the proposed hierarchical deep multi-instance learning method (HDMIL), ensures that sufficient patches are considered and inaccurate selections are ignored. In addition, benefiting from a two-stage attention mechanism in HDMIL, the patches that are highly correlated with gene mutations can be identified. This interpretability can help a pathologist to analyze the correlation between gene mutation and histopathological morphology. Experimental results demonstrate that the proposed gene mutation prediction framework significantly outperforms the state-of-the-art methods. In the TCGA bladder cancer dataset, five clinically relevant gene mutations are well predicted.
基因突变检测通常采用分子生物学方法进行,这种方法费用昂贵且周期长。相比之下,病理图像则无处不在。如果仅通过病理图像就可以预测具有临床意义的基因突变,将极大地促进基因突变检测在临床实践中的广泛应用。然而,目前基于病理图像的基因突变预测方法由于无法识别千兆像素全切片图像(WSI)中的突变区域而效果不佳。针对这一挑战,我们在此提出了一种精心设计的基于 WSI 的基因突变预测框架,该框架由三个部分组成。(i)基于监督学习的癌症区域分割的第一部分,快速过滤掉大量非突变区域;(ii)基于对比学习得到的表示的癌症斑块聚类的第二部分,确保了斑块选择的全面性;以及(iii)基于所提出的分层深度多实例学习方法(HDMIL)的突变分类的第三部分,确保了充分考虑了足够的斑块并且忽略了不准确的选择。此外,受益于 HDMIL 中的两阶段注意力机制,可以识别与基因突变高度相关的斑块。这种可解释性有助于病理学家分析基因突变与组织病理学形态之间的相关性。实验结果表明,所提出的基因突变预测框架明显优于最先进的方法。在 TCGA 膀胱癌数据集上,成功预测了五个具有临床意义的基因突变。