IEEE Trans Med Imaging. 2021 May;40(5):1474-1483. doi: 10.1109/TMI.2021.3057635. Epub 2021 Apr 30.
The combination of multimodal imaging and genomics provides a more comprehensive way for the study of mental illnesses and brain functions. Deep network-based data fusion models have been developed to capture their complex associations, resulting in improved diagnosis of diseases. However, deep learning models are often difficult to interpret, bringing about challenges for uncovering biological mechanisms using these models. In this work, we develop an interpretable multimodal fusion model to perform automated diagnosis and result interpretation simultaneously. We name it Grad-CAM guided convolutional collaborative learning (gCAM-CCL), which is achieved by combining intermediate feature maps with gradient-based weights. The gCAM-CCL model can generate interpretable activation maps to quantify pixel-level contributions of the input features. Moreover, the estimated activation maps are class-specific, which can therefore facilitate the identification of biomarkers underlying different groups. We validate the gCAM-CCL model on a brain imaging-genetic study, and demonstrate its applications to both the classification of cognitive function groups and the discovery of underlying biological mechanisms. Specifically, our analysis results suggest that during task-fMRI scans, several object recognition related regions of interests (ROIs) are activated followed by several downstream encoding ROIs. In addition, the high cognitive group may have stronger neurotransmission signaling while the low cognitive group may have problems in brain/neuron development due to genetic variations.
多模态成像和基因组学的结合为研究精神疾病和大脑功能提供了更全面的方法。已经开发了基于深度网络的数据融合模型来捕捉它们的复杂关联,从而提高疾病的诊断能力。然而,深度学习模型通常难以解释,这给利用这些模型揭示生物机制带来了挑战。在这项工作中,我们开发了一种可解释的多模态融合模型,以同时进行自动诊断和结果解释。我们将其命名为基于梯度的卷积协同学习的梯度激活映射引导(Grad-CAM guided convolutional collaborative learning,gCAM-CCL),它通过将中间特征映射与基于梯度的权重相结合来实现。gCAM-CCL 模型可以生成可解释的激活图,以量化输入特征的像素级贡献。此外,估计的激活图是特定于类别的,因此可以方便地识别不同组的潜在生物标志物。我们在脑成像-遗传研究中验证了 gCAM-CCL 模型,并展示了其在认知功能组分类和潜在生物学机制发现方面的应用。具体来说,我们的分析结果表明,在任务 fMRI 扫描期间,几个与物体识别相关的感兴趣区域(ROI)被激活,随后是几个下游编码 ROI。此外,高认知组可能具有更强的神经传递信号,而低认知组可能由于遗传变异而在大脑/神经元发育方面存在问题。