• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

混合掩模图像建模在 3D 医学图像分割中的应用。

Hybrid Masked Image Modeling for 3D Medical Image Segmentation.

出版信息

IEEE J Biomed Health Inform. 2024 Apr;28(4):2115-2125. doi: 10.1109/JBHI.2024.3360239. Epub 2024 Apr 4.

DOI:10.1109/JBHI.2024.3360239
PMID:38289846
Abstract

Masked image modeling (MIM) with transformer backbones has recently been exploited as a powerful self-supervised pre-training technique. The existing MIM methods adopt the strategy to mask random patches of the image and reconstruct the missing pixels, which only considers semantic information at a lower level, and causes a long pre-training time. This paper presents HybridMIM, a novel hybrid self-supervised learning method based on masked image modeling for 3D medical image segmentation. Specifically, we design a two-level masking hierarchy to specify which and how patches in sub-volumes are masked, effectively providing the constraints of higher level semantic information. Then we learn the semantic information of medical images at three levels, including: 1) partial region prediction to reconstruct key contents of the 3D image, which largely reduces the pre-training time burden (pixel-level); 2) patch-masking perception to learn the spatial relationship between the patches in each sub-volume (region-level); and 3) drop-out-based contrastive learning between samples within a mini-batch, which further improves the generalization ability of the framework (sample-level). The proposed framework is versatile to support both CNN and transformer as encoder backbones, and also enables to pre-train decoders for image segmentation. We conduct comprehensive experiments on five widely-used public medical image segmentation datasets, including BraTS2020, BTCV, MSD Liver, MSD Spleen, and BraTS2023. The experimental results show the clear superiority of HybridMIM against competing supervised methods, masked pre-training approaches, and other self-supervised methods, in terms of quantitative metrics, speed performance and qualitative observations.

摘要

基于 Transformer 骨干的掩码图像建模 (MIM) 最近被用作一种强大的自监督预训练技术。现有的 MIM 方法采用随机遮挡图像块并重建缺失像素的策略,仅考虑较低层次的语义信息,导致预训练时间较长。本文提出了一种新颖的基于掩码图像建模的混合自监督学习方法 HybridMIM,用于 3D 医学图像分割。具体来说,我们设计了两级掩蔽层次结构来指定子体积中的哪些和如何掩蔽块,有效地提供了更高层次语义信息的约束。然后,我们学习了医学图像的三个层次的语义信息,包括:1)部分区域预测,以重建 3D 图像的关键内容,大大减少了预训练时间负担(像素级);2) 块掩蔽感知,学习每个子体积中块之间的空间关系(区域级);以及 3) 基于样本内的dropout 的对比学习,进一步提高了框架的泛化能力(样本级)。所提出的框架具有通用性,支持 CNN 和 transformer 作为编码器骨干,并且还能够预训练用于图像分割的解码器。我们在五个广泛使用的公共医学图像分割数据集上进行了全面的实验,包括 BraTS2020、BTCV、MSD Liver、MSD Spleen 和 BraTS2023。实验结果表明,HybridMIM 在定量指标、速度性能和定性观察方面明显优于竞争的监督方法、掩码预训练方法和其他自监督方法。

相似文献

1
Hybrid Masked Image Modeling for 3D Medical Image Segmentation.混合掩模图像建模在 3D 医学图像分割中的应用。
IEEE J Biomed Health Inform. 2024 Apr;28(4):2115-2125. doi: 10.1109/JBHI.2024.3360239. Epub 2024 Apr 4.
2
Boundary-aware information maximization for self-supervised medical image segmentation.用于自监督医学图像分割的边界感知信息最大化
Med Image Anal. 2024 May;94:103150. doi: 10.1016/j.media.2024.103150. Epub 2024 Mar 28.
3
Diffusion semantic segmentation model: A generative model for medical image segmentation based on joint distribution.扩散语义分割模型:一种基于联合分布的医学图像分割生成模型。
Med Phys. 2025 Jul;52(7):e17928. doi: 10.1002/mp.17928. Epub 2025 Jun 8.
4
Short-Term Memory Impairment短期记忆障碍
5
Uni4Eye++: A General Masked Image Modeling Multi-Modal Pre-Training Framework for Ophthalmic Image Classification and Segmentation.Uni4Eye++:用于眼科图像分类和分割的通用掩码图像建模多模态预训练框架
IEEE Trans Med Imaging. 2024 Dec;43(12):4419-4429. doi: 10.1109/TMI.2024.3422102. Epub 2024 Dec 2.
6
A segment anything model-guided and match-based semi-supervised segmentation framework for medical imaging.一种用于医学成像的基于段式分割模型引导和匹配的半监督分割框架。
Med Phys. 2025 Mar 29. doi: 10.1002/mp.17785.
7
Structural semantic-guided MR synthesis from PET images via a dual cross-attention mechanism.通过双交叉注意力机制从PET图像进行结构语义引导的MR合成。
Med Phys. 2025 Jul;52(7):e17957. doi: 10.1002/mp.17957.
8
Point-cloud segmentation with in-silico data augmentation for prostate cancer treatment.用于前列腺癌治疗的基于计算机模拟数据增强的点云分割
Med Phys. 2025 Apr 3. doi: 10.1002/mp.17815.
9
Masked Deformation Modeling for Volumetric Brain MRI Self-Supervised Pre-Training.用于容积脑磁共振成像自监督预训练的掩码变形建模
IEEE Trans Med Imaging. 2025 Mar;44(3):1596-1607. doi: 10.1109/TMI.2024.3510922. Epub 2025 Mar 17.
10
Diffusion-driven distillation and contrastive learning for class-incremental semantic segmentation of laparoscopic images.用于腹腔镜图像类增量语义分割的扩散驱动蒸馏与对比学习
Int J Comput Assist Radiol Surg. 2025 Jul;20(7):1551-1560. doi: 10.1007/s11548-025-03405-1. Epub 2025 Jun 14.