Ahmad Ibtihaj, Anwar Sadia Jabbar, Hussain Bagh, Ur Rehman Atiq, Bermak Amine
Northwestern Polytechnical University, Xi'an, 710072, Shaanxi, People's Republic of China.
School of Public Health, Shandong University, Jinan, Shandong, People's Republic of China.
Sci Rep. 2025 Apr 9;15(1):12153. doi: 10.1038/s41598-025-95757-6.
Segmentation in computed tomography (CT) provides detailed anatomical information, while positron emission tomography (PET) provide the metabolic activity of cancer. Existing segmentation models in CT and PET either rely on early fusion, which struggles to effectively capture independent features from each modality, or late fusion, which is computationally expensive and fails to leverage the complementary nature of the two modalities. This research addresses the gap by proposing an intermediate fusion approach that optimally balances the strengths of both modalities. Our method leverages anatomical features to guide the fusion process while preserving spatial representation quality. We achieve this through the separate encoding of anatomical and metabolic features followed by an attentive fusion decoder. Unlike traditional fixed normalization techniques, we introduce novel "zero layers" with learnable normalization. The proposed intermediate fusion reduces the number of filters, resulting in a lightweight model. Our approach demonstrates superior performance, achieving a dice score of 0.8184 and an [Formula: see text] score of 2.31. The implications of this study include more precise tumor delineation, leading to enhanced cancer diagnosis and more effective treatment planning.
计算机断层扫描(CT)中的分割提供详细的解剖信息,而正电子发射断层扫描(PET)提供癌症的代谢活性。CT和PET中现有的分割模型要么依赖早期融合,这种方法难以有效捕捉每种模态的独立特征,要么依赖晚期融合,这种方法计算成本高且无法利用两种模态的互补特性。本研究通过提出一种中间融合方法来解决这一差距,该方法能最佳地平衡两种模态的优势。我们的方法利用解剖特征来指导融合过程,同时保持空间表征质量。我们通过对解剖和代谢特征进行单独编码,然后使用注意力融合解码器来实现这一点。与传统的固定归一化技术不同,我们引入了具有可学习归一化的新型“零层”。所提出的中间融合减少了滤波器数量,从而得到一个轻量级模型。我们的方法表现出卓越的性能,骰子系数得分为0.8184,[公式:见原文]得分为2.31。本研究的意义包括更精确的肿瘤描绘,从而实现更准确的癌症诊断和更有效的治疗规划。