基于Transformer 的跨模态自适应特征融合框架用于食管大体肿瘤体积分割。

A transformer-guided cross-modality adaptive feature fusion framework for esophageal gross tumor volume segmentation.

机构信息

Department of Biomedical Engineering, School of Information Science and Technology, Fudan University, Shanghai 200438, PR China.

Department of Nuclear Medicine, Fudan University Shanghai Cancer Center, Shanghai 201321, PR China.

出版信息

Comput Methods Programs Biomed. 2024 Jun;251:108216. doi: 10.1016/j.cmpb.2024.108216. Epub 2024 May 11.

DOI:10.1016/j.cmpb.2024.108216

PMID:38761412

Abstract

BACKGROUND AND OBJECTIVE

Accurate segmentation of esophageal gross tumor volume (GTV) indirectly enhances the efficacy of radiotherapy for patients with esophagus cancer. In this domain, learning-based methods have been employed to fuse cross-modality positron emission tomography (PET) and computed tomography (CT) images, aiming to improve segmentation accuracy. This fusion is essential as it combines functional metabolic information from PET with anatomical information from CT, providing complementary information. While the existing three-dimensional (3D) segmentation method has achieved state-of-the-art (SOTA) performance, it typically relies on pure-convolution architectures, limiting its ability to capture long-range spatial dependencies due to convolution's confinement to a local receptive field. To address this limitation and further enhance esophageal GTV segmentation performance, this work proposes a transformer-guided cross-modality adaptive feature fusion network, referred to as TransAttPSNN, which is based on cross-modality PET/CT scans.

METHODS

Specifically, we establish an attention progressive semantically-nested network (AttPSNN) by incorporating the convolutional attention mechanism into the progressive semantically-nested network (PSNN). Subsequently, we devise a plug-and-play transformer-guided cross-modality adaptive feature fusion model, which is inserted between the multi-scale feature counterparts of a two-stream AttPSNN backbone (one for the PET modality flow and another for the CT modality flow), resulting in the proposed TransAttPSNN architecture.

RESULTS

Through extensive four-fold cross-validation experiments on the clinical PET/CT cohort. The proposed approach acquires a Dice similarity coefficient (DSC) of 0.76 ± 0.13, a Hausdorff distance (HD) of 9.38 ± 8.76 mm, and a Mean surface distance (MSD) of 1.13 ± 0.94 mm, outperforming the SOTA competing methods. The qualitative results show a satisfying consistency with the lesion areas.

CONCLUSIONS

The devised transformer-guided cross-modality adaptive feature fusion module integrates the strengths of PET and CT, effectively enhancing the segmentation performance of esophageal GTV. The proposed TransAttPSNN has further advanced the research of esophageal GTV segmentation.

摘要

背景与目的

准确分割食管癌大体肿瘤体积（GTV）可间接提高食管癌患者放射治疗的疗效。在这一领域，基于学习的方法已被用于融合跨模态正电子发射断层扫描（PET）和计算机断层扫描（CT）图像，旨在提高分割准确性。这种融合是必要的，因为它将来自 PET 的功能代谢信息与来自 CT 的解剖信息相结合，提供互补信息。虽然现有的三维（3D）分割方法已经达到了最先进的水平（SOTA），但它通常依赖于纯卷积架构，由于卷积受到局部感受野的限制，其捕捉远程空间依赖的能力有限。为了克服这一限制，进一步提高食管 GTV 分割性能，本研究提出了一种基于跨模态 PET/CT 扫描的基于变压器引导的跨模态自适应特征融合网络，称为 TransAttPSNN。

方法

具体来说，我们通过将卷积注意力机制纳入渐进语义嵌套网络（PSNN），建立了一个注意力渐进语义嵌套网络（AttPSNN）。然后，我们设计了一个可插拔的变压器引导的跨模态自适应特征融合模型，插入到双流 AttPSNN 骨干网（一个用于 PET 模态流，另一个用于 CT 模态流）的多尺度特征对应物之间，得到了所提出的 TransAttPSNN 架构。

结果

通过对临床 PET/CT 队列进行四折交叉验证实验，该方法获得了 0.76±0.13 的 Dice 相似系数（DSC）、9.38±8.76mm 的 Hausdorff 距离（HD）和 1.13±0.94mm 的平均表面距离（MSD），优于最先进的竞争方法。定性结果显示，病变区域的一致性令人满意。

结论

所设计的变压器引导的跨模态自适应特征融合模块集成了 PET 和 CT 的优势，有效提高了食管 GTV 的分割性能。所提出的 TransAttPSNN 进一步推进了食管 GTV 分割的研究。

相似文献

A transformer-guided cross-modality adaptive feature fusion framework for esophageal gross tumor volume segmentation.

Comput Methods Programs Biomed. 2024 Jun;251:108216. doi: 10.1016/j.cmpb.2024.108216. Epub 2024 May 11.

DeepTarget: Gross tumor and clinical target volume segmentation in esophageal cancer radiotherapy.

Med Image Anal. 2021 Feb;68:101909. doi: 10.1016/j.media.2020.101909. Epub 2020 Nov 19.

SwinCross: Cross-modal Swin transformer for head-and-neck tumor segmentation in PET/CT images.

Med Phys. 2024 Mar;51(3):2096-2107. doi: 10.1002/mp.16703. Epub 2023 Sep 30.

Automatic segmentation of esophageal gross tumor volume in F-FDG PET/CT images via GloD-LoATUNet.

Comput Methods Programs Biomed. 2023 Feb;229:107266. doi: 10.1016/j.cmpb.2022.107266. Epub 2022 Nov 24.

Multi-modal segmentation with missing image data for automatic delineation of gross tumor volumes in head and neck cancers.

Med Phys. 2024 Oct;51(10):7295-7307. doi: 10.1002/mp.17260. Epub 2024 Jun 19.

Gross tumor volume segmentation for head and neck cancer radiotherapy using deep dense multi-modality network.

Phys Med Biol. 2019 Oct 16;64(20):205015. doi: 10.1088/1361-6560/ab440d.

Diffuse large B-cell lymphoma segmentation in PET-CT images via hybrid learning for feature fusion.

Med Phys. 2021 Jul;48(7):3665-3678. doi: 10.1002/mp.14847. Epub 2021 Jun 22.

MFCNet: A multi-modal fusion and calibration networks for 3D pancreas tumor segmentation on PET-CT images.

Comput Biol Med. 2023 Mar;155:106657. doi: 10.1016/j.compbiomed.2023.106657. Epub 2023 Feb 10.

Recurrent feature fusion learning for multi-modality pet-ct tumor segmentation.

Comput Methods Programs Biomed. 2021 May;203:106043. doi: 10.1016/j.cmpb.2021.106043. Epub 2021 Mar 11.

DMCT-Net: dual modules convolution transformer network for head and neck tumor segmentation in PET/CT.

Phys Med Biol. 2023 May 22;68(11). doi: 10.1088/1361-6560/acd29f.

引用本文的文献

AI-Guided Delineation of Gross Tumor Volume for Body Tumors: A Systematic Review.

Diagnostics (Basel). 2025 Mar 26;15(7):846. doi: 10.3390/diagnostics15070846.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于Transformer 的跨模态自适应特征融合框架用于食管大体肿瘤体积分割。

A transformer-guided cross-modality adaptive feature fusion framework for esophageal gross tumor volume segmentation.

机构信息

Department of Biomedical Engineering, School of Information Science and Technology, Fudan University, Shanghai 200438, PR China.

Department of Nuclear Medicine, Fudan University Shanghai Cancer Center, Shanghai 201321, PR China.

出版信息

Comput Methods Programs Biomed. 2024 Jun;251:108216. doi: 10.1016/j.cmpb.2024.108216. Epub 2024 May 11.

DOI:10.1016/j.cmpb.2024.108216

PMID:38761412

Abstract

BACKGROUND AND OBJECTIVE

METHODS

RESULTS

CONCLUSIONS

摘要

基于Transformer 的跨模态自适应特征融合框架用于食管大体肿瘤体积分割。

A transformer-guided cross-modality adaptive feature fusion framework for esophageal gross tumor volume segmentation.

机构信息

出版信息

BACKGROUND AND OBJECTIVE

METHODS

RESULTS

CONCLUSIONS

背景与目的

方法

结果

结论

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

基于Transformer 的跨模态自适应特征融合框架用于食管大体肿瘤体积分割。

A transformer-guided cross-modality adaptive feature fusion framework for esophageal gross tumor volume segmentation.

机构信息

出版信息

BACKGROUND AND OBJECTIVE

METHODS

RESULTS

CONCLUSIONS

背景与目的

方法

结果

结论

相似文献

引用本文的文献