School of Computer Science and Technology, Heilongjiang University, Harbin, People's Republic of China.
Department of Computer Science, Shantou University, Shantou, People's Republic of China.
Phys Med Biol. 2024 Mar 14;69(7). doi: 10.1088/1361-6560/ad294c.
The accurate automatic segmentation of tumors from computed tomography (CT) volumes facilitates early diagnosis and treatment of patients. A significant challenge in tumor segmentation is the integration of the spatial correlations among multiple parts of a CT volume and the context relationship across multiple channels.We proposed a mutually enhanced multi-view information model (MEMI) to propagate and fuse the spatial correlations and the context relationship and then apply it to lung tumor CT segmentation. First, a feature map was obtained from segmentation backbone encoder, which contained many image region nodes. An attention mechanism from the region node perspective was presented to determine the impact of all the other nodes on a specific node and enhance the node attribute embedding. A gated convolution-based strategy was also designed to integrate the enhanced attributes and the original node features. Second, transformer across multiple channels was constructed to integrate the channel context relationship. Finally, since the encoded node attributes from the gated convolution view and those from the channel transformer view were complementary, an interaction attention mechanism was proposed to propagate the mutual information among the multiple views.The segmentation performance was evaluated on both public lung tumor dataset and private dataset collected from a hospital. The experimental results demonstrated that MEMI was superior to other compared segmentation methods. Ablation studies showed the contributions of node correlation learning, channel context relationship learning, and mutual information interaction across multiple views to the improved segmentation performance. Utilizing MEMI on multiple segmentation backbones also demonstrated MEMI's generalization ability.Our model improved the lung tumor segmentation performance by learning the correlations among multiple region nodes, integrating the channel context relationship, and mutual information enhancement from multiple views.
从计算机断层扫描 (CT) 容积中准确自动分割肿瘤有助于对患者进行早期诊断和治疗。肿瘤分割的一个重大挑战是整合 CT 容积中多个部分的空间相关性和多个通道之间的上下文关系。我们提出了一种相互增强的多视图信息模型 (MEMI),以传播和融合空间相关性和上下文关系,然后将其应用于肺肿瘤 CT 分割。首先,从分割骨干编码器获得特征图,其中包含许多图像区域节点。从区域节点的角度提出了一种注意力机制,以确定所有其他节点对特定节点的影响,并增强节点属性嵌入。还设计了基于门控卷积的策略来整合增强的属性和原始节点特征。其次,构建了跨多个通道的变压器,以整合通道上下文关系。最后,由于门控卷积视图中编码的节点属性和通道变压器视图中的编码节点属性是互补的,因此提出了交互注意力机制来传播多个视图之间的相互信息。在公共肺肿瘤数据集和医院收集的私有数据集上评估了分割性能。实验结果表明,MEMI 优于其他比较分割方法。消融研究表明,节点相关性学习、通道上下文关系学习和多视图之间的互信息交互对改进的分割性能有贡献。在多个分割骨干上利用 MEMI 还证明了 MEMI 的泛化能力。我们的模型通过学习多个区域节点之间的相关性、整合通道上下文关系以及来自多个视图的互信息增强,提高了肺肿瘤分割性能。