• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于医学图像分割的类别感知对抗变压器

Class-Aware Adversarial Transformers for Medical Image Segmentation.

作者信息

You Chenyu, Zhao Ruihan, Liu Fenglin, Dong Siyuan, Chinchali Sandeep, Topcu Ufuk, Staib Lawrence, Duncan James S

机构信息

Yale University.

UT Austin.

出版信息

Adv Neural Inf Process Syst. 2022 Dec;35:29582-29596.

PMID:37533756
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10395073/
Abstract

Transformers have made remarkable progress towards modeling long-range dependencies within the medical image analysis domain. However, current transformer-based models suffer from several disadvantages: (1) existing methods fail to capture the important features of the images due to the naive tokenization scheme; (2) the models suffer from information loss because they only consider single-scale feature representations; and (3) the segmentation label maps generated by the models are not accurate enough without considering rich semantic contexts and anatomical textures. In this work, we present CASTformer, a novel type of adversarial transformers, for 2D medical image segmentation. First, we take advantage of the pyramid structure to construct multi-scale representations and handle multi-scale variations. We then design a novel class-aware transformer module to better learn the discriminative regions of objects with semantic structures. Lastly, we utilize an adversarial training strategy that boosts segmentation accuracy and correspondingly allows a transformer-based discriminator to capture high-level semantically correlated contents and low-level anatomical features. Our experiments demonstrate that CASTformer dramatically outperforms previous state-of-the-art transformer-based approaches on three benchmarks, obtaining 2.54%-5.88% absolute improvements in Dice over previous models. Further qualitative experiments provide a more detailed picture of the model's inner workings, shed light on the challenges in improved transparency, and demonstrate that transfer learning can greatly improve performance and reduce the size of medical image datasets in training, making CASTformer a strong starting point for downstream medical image analysis tasks.

摘要

Transformer在医学图像分析领域对长距离依赖关系的建模方面取得了显著进展。然而,当前基于Transformer的模型存在几个缺点:(1)由于简单的tokenization方案,现有方法无法捕捉图像的重要特征;(2)模型存在信息损失,因为它们只考虑单尺度特征表示;(3)模型生成的分割标签图在没有考虑丰富语义上下文和解剖纹理的情况下不够准确。在这项工作中,我们提出了CASTformer,一种新型的对抗性Transformer,用于二维医学图像分割。首先,我们利用金字塔结构构建多尺度表示并处理多尺度变化。然后,我们设计了一个新颖的类感知Transformer模块,以更好地学习具有语义结构的对象的判别区域。最后,我们采用对抗训练策略,提高分割精度,并相应地允许基于Transformer的判别器捕捉高级语义相关内容和低级解剖特征。我们的实验表明,CASTformer在三个基准测试中显著优于以前基于Transformer的最先进方法,在Dice上比以前的模型获得了2.54%-5.88%的绝对提升。进一步的定性实验更详细地展示了模型的内部工作原理,揭示了提高透明度方面的挑战,并表明迁移学习可以大大提高性能并减少训练中医学图像数据集的大小,使CASTformer成为下游医学图像分析任务的一个强大起点。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ccd3/10395073/70059602e545/nihms-1912996-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ccd3/10395073/3da279eb19e6/nihms-1912996-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ccd3/10395073/022a8bd64d52/nihms-1912996-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ccd3/10395073/85cc99e3d4a2/nihms-1912996-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ccd3/10395073/70059602e545/nihms-1912996-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ccd3/10395073/3da279eb19e6/nihms-1912996-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ccd3/10395073/022a8bd64d52/nihms-1912996-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ccd3/10395073/85cc99e3d4a2/nihms-1912996-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ccd3/10395073/70059602e545/nihms-1912996-f0004.jpg

相似文献

1
Class-Aware Adversarial Transformers for Medical Image Segmentation.用于医学图像分割的类别感知对抗变压器
Adv Neural Inf Process Syst. 2022 Dec;35:29582-29596.
2
Transformer guided self-adaptive network for multi-scale skin lesion image segmentation.Transformer 引导的自适网络用于多尺度皮肤病变图像分割。
Comput Biol Med. 2024 Feb;169:107846. doi: 10.1016/j.compbiomed.2023.107846. Epub 2023 Dec 23.
3
TGDAUNet: Transformer and GCNN based dual-branch attention UNet for medical image segmentation.TGDAUNet:基于 Transformer 和 GCNN 的双分支注意力 U-Net 用于医学图像分割。
Comput Biol Med. 2023 Dec;167:107583. doi: 10.1016/j.compbiomed.2023.107583. Epub 2023 Oct 21.
4
LM-Net: A light-weight and multi-scale network for medical image segmentation.LM-Net:用于医学图像分割的轻量级多尺度网络。
Comput Biol Med. 2024 Jan;168:107717. doi: 10.1016/j.compbiomed.2023.107717. Epub 2023 Nov 23.
5
A modality-collaborative convolution and transformer hybrid network for unpaired multi-modal medical image segmentation with limited annotations.一种用于具有有限标注的未配对多模态医学图像分割的模态协作卷积与Transformer混合网络。
Med Phys. 2023 Sep;50(9):5460-5478. doi: 10.1002/mp.16338. Epub 2023 Mar 15.
6
Collaborative networks of transformers and convolutional neural networks are powerful and versatile learners for accurate 3D medical image segmentation.基于转换器和卷积神经网络的协作网络是精确的 3D 医学图像分割的强大且多功能的学习者。
Comput Biol Med. 2023 Sep;164:107228. doi: 10.1016/j.compbiomed.2023.107228. Epub 2023 Jul 5.
7
TransConver: transformer and convolution parallel network for developing automatic brain tumor segmentation in MRI images.TransConver:用于在MRI图像中开发自动脑肿瘤分割的变压器与卷积并行网络。
Quant Imaging Med Surg. 2022 Apr;12(4):2397-2415. doi: 10.21037/qims-21-919.
8
BiU-net: A dual-branch structure based on two-stage fusion strategy for biomedical image segmentation.BiU-net:一种基于两阶段融合策略的双分支结构,用于生物医学图像分割。
Comput Methods Programs Biomed. 2024 Jul;252:108235. doi: 10.1016/j.cmpb.2024.108235. Epub 2024 May 18.
9
MS-TCNet: An effective Transformer-CNN combined network using multi-scale feature learning for 3D medical image segmentation.MS-TCNet:一种基于多尺度特征学习的有效的 Transformer-CNN 组合网络,用于 3D 医学图像分割。
Comput Biol Med. 2024 Mar;170:108057. doi: 10.1016/j.compbiomed.2024.108057. Epub 2024 Jan 28.
10
PFD-Net: Pyramid Fourier Deformable Network for medical image segmentation.PFD-Net:用于医学图像分割的金字塔傅里叶可变形网络。
Comput Biol Med. 2024 Apr;172:108302. doi: 10.1016/j.compbiomed.2024.108302. Epub 2024 Mar 16.

引用本文的文献

1
Comparative analysis of supervised and self-supervised learning with small and imbalanced medical imaging datasets.使用小型和不平衡医学影像数据集对监督学习和自监督学习进行比较分析。
Sci Rep. 2025 Sep 2;15(1):32345. doi: 10.1038/s41598-025-99000-0.
2
A highly generalized federated learning algorithm for brain tumor segmentation.一种用于脑肿瘤分割的高度通用的联邦学习算法。
Sci Rep. 2025 Jul 1;15(1):21053. doi: 10.1038/s41598-025-05297-2.
3
Diffusion-driven distillation and contrastive learning for class-incremental semantic segmentation of laparoscopic images.

本文引用的文献

1
Mine Your Own Anatomy: Revisiting Medical Image Segmentation With Extremely Limited Labels.挖掘自身解剖结构:利用极其有限的标签重新审视医学图像分割
IEEE Trans Pattern Anal Mach Intell. 2024 Sep 13;PP. doi: 10.1109/TPAMI.2024.3461321.
2
Medical image registration via neural fields.基于神经场的医学图像配准。
Med Image Anal. 2024 Oct;97:103249. doi: 10.1016/j.media.2024.103249. Epub 2024 Jun 27.
3
A review of deep learning in medical imaging: Imaging traits, technology trends, case studies with progress highlights, and future promises.
用于腹腔镜图像类增量语义分割的扩散驱动蒸馏与对比学习
Int J Comput Assist Radiol Surg. 2025 Jul;20(7):1551-1560. doi: 10.1007/s11548-025-03405-1. Epub 2025 Jun 14.
4
Improved SwinUNet with fusion transformer and large kernel convolutional attention for liver and tumor segmentation in CT images.基于融合Transformer和大核卷积注意力机制的改进SwinUNet用于CT图像中的肝脏和肿瘤分割
Sci Rep. 2025 Apr 24;15(1):14286. doi: 10.1038/s41598-025-98938-5.
5
Exploration of CT-based discrimination and diagnosis of various pathological types of ground glass nodules in the lungs.基于CT的肺磨玻璃结节不同病理类型的鉴别与诊断探索
BMC Med Imaging. 2025 Apr 14;25(1):119. doi: 10.1186/s12880-025-01653-w.
6
Structured hashing with deep learning for modality, organ, and disease content sensitive medical image retrieval.用于模态、器官和疾病内容敏感医学图像检索的深度学习结构化哈希
Sci Rep. 2025 Mar 14;15(1):8912. doi: 10.1038/s41598-025-93418-2.
7
Closed loop automated drug infusion regulation based on optimal 2-DOF TID control approach for the mean arterial blood pressure.基于用于平均动脉血压的最优二自由度TID控制方法的闭环自动药物输注调节。
Med Biol Eng Comput. 2025 Jul;63(7):2069-2089. doi: 10.1007/s11517-025-03313-1. Epub 2025 Feb 10.
8
A deep ensemble learning framework for glioma segmentation and grading prediction.一种用于脑胶质瘤分割和分级预测的深度集成学习框架。
Sci Rep. 2025 Feb 6;15(1):4448. doi: 10.1038/s41598-025-87127-z.
9
Robust thoracic CT image registration with environmental adaptability using dynamic Welsch's function and hierarchical structure-awareness strategy.使用动态韦尔施函数和层次结构感知策略实现具有环境适应性的稳健胸部CT图像配准
Quant Imaging Med Surg. 2024 Dec 5;14(12):8999-9020. doi: 10.21037/qims-24-596. Epub 2024 Nov 29.
10
Enhanced Cross-stage-attention U-Net for esophageal target volume segmentation.用于食管靶区体积分割的增强跨阶段注意力U型网络
BMC Med Imaging. 2024 Dec 18;24(1):339. doi: 10.1186/s12880-024-01515-x.
医学成像中的深度学习综述:成像特征、技术趋势、具有进展亮点的案例研究及未来展望。
Proc IEEE Inst Electr Electron Eng. 2021 May;109(5):820-838. doi: 10.1109/JPROC.2021.3054390. Epub 2021 Feb 26.
4
Momentum Contrastive Voxel-wise Representation Learning for Semi-supervised Volumetric Medical Image Segmentation.用于半监督体医学图像分割的动量对比体素级表示学习
Med Image Comput Comput Assist Interv. 2022 Sep;13434:639-652. doi: 10.1007/978-3-031-16440-8_61. Epub 2022 Sep 16.
5
Incremental Learning Meets Transfer Learning: Application to Multi-site Prostate MRI Segmentation.增量学习与迁移学习相结合:在多站点前列腺MRI分割中的应用
Distrib Collab Fed Learn Afford AI Healthc Resour Div Glob Health (2022). 2022 Sep;13573:3-16. doi: 10.1007/978-3-031-18523-6_1. Epub 2022 Oct 7.
6
Bootstrapping Semi-supervised Medical Image Segmentation with Anatomical-Aware Contrastive Distillation.基于解剖感知对比蒸馏的自训练半监督医学图像分割
Inf Process Med Imaging. 2023 Jun;13939:641-653. doi: 10.1007/978-3-031-34048-2_49. Epub 2023 Jun 8.
7
Joint liver and hepatic lesion segmentation in MRI using a hybrid CNN with transformer layers.基于融合 Transformer 层的卷积神经网络的 MRI 中肝脏和肝脏病变的联合分割。
Comput Methods Programs Biomed. 2023 Oct;240:107647. doi: 10.1016/j.cmpb.2023.107647. Epub 2023 Jun 7.
8
ResViT: Residual Vision Transformers for Multimodal Medical Image Synthesis.ResViT:用于多模态医学图像合成的残差视觉转换器。
IEEE Trans Med Imaging. 2022 Oct;41(10):2598-2614. doi: 10.1109/TMI.2022.3167808. Epub 2022 Sep 30.
9
SimCVD: Simple Contrastive Voxel-Wise Representation Distillation for Semi-Supervised Medical Image Segmentation.SimCVD:用于半监督医学图像分割的简单对比体素级表示提取。
IEEE Trans Med Imaging. 2022 Sep;41(9):2228-2237. doi: 10.1109/TMI.2022.3161829. Epub 2022 Aug 31.
10
Unsupervised MRI Reconstruction via Zero-Shot Learned Adversarial Transformers.基于零样本学习对抗 Transformer 的无监督 MRI 重建。
IEEE Trans Med Imaging. 2022 Jul;41(7):1747-1763. doi: 10.1109/TMI.2022.3147426. Epub 2022 Jun 30.