• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

重新思考医学图像表示的掩模图像建模。

Rethinking masked image modelling for medical image representation.

机构信息

University of Adelaide, Australia.

RIKEN AIP, Japan; RCAST, The University of Tokyo, Japan.

出版信息

Med Image Anal. 2024 Dec;98:103304. doi: 10.1016/j.media.2024.103304. Epub 2024 Aug 17.

DOI:10.1016/j.media.2024.103304
PMID:39173412
Abstract

Masked Image Modelling (MIM), a form of self-supervised learning, has garnered significant success in computer vision by improving image representations using unannotated data. Traditional MIMs typically employ a strategy of random sampling across the image. However, this random masking technique may not be ideally suited for medical imaging, which possesses distinct characteristics divergent from natural images. In medical imaging, particularly in pathology, disease-related features are often exceedingly sparse and localized, while the remaining regions appear normal and undifferentiated. Additionally, medical images frequently accompany reports, directly pinpointing pathological changes' location. Inspired by this, we propose Masked medical Image Modelling (MedIM), a novel approach, to our knowledge, the first research that employs radiological reports to guide the masking and restore the informative areas of images, encouraging the network to explore the stronger semantic representations from medical images. We introduce two mutual comprehensive masking strategies, knowledge-driven masking (KDM), and sentence-driven masking (SDM). KDM uses Medical Subject Headings (MeSH) words unique to radiology reports to identify symptom clues mapped to MeSH words (e.g., cardiac, edema, vascular, pulmonary) and guide the mask generation. Recognizing that radiological reports often comprise several sentences detailing varied findings, SDM integrates sentence-level information to identify key regions for masking. MedIM reconstructs images informed by this masking from the KDM and SDM modules, promoting a comprehensive and enriched medical image representation. Our extensive experiments on seven downstream tasks covering multi-label/class image classification, pneumothorax segmentation, and medical image-report analysis, demonstrate that MedIM with report-guided masking achieves competitive performance. Our method substantially outperforms ImageNet pre-training, MIM-based pre-training, and medical image-report pre-training counterparts. Codes are available at https://github.com/YtongXie/MedIM.

摘要

掩模图像建模 (MIM) 是一种自监督学习形式,通过使用未注释的数据改进图像表示,在计算机视觉中取得了重大成功。传统的 MIM 通常采用在图像上随机采样的策略。然而,这种随机掩蔽技术对于医学成像来说可能并不理想,因为医学成像具有与自然图像不同的独特特征。在医学成像中,特别是在病理学中,与疾病相关的特征通常非常稀疏和局部化,而其余区域则显得正常且无差别。此外,医学图像通常附有报告,直接指出病理变化的位置。受此启发,我们提出了掩模医学图像建模 (MedIM),这是一种新颖的方法,据我们所知,这是首次利用放射学报告来指导掩蔽和恢复图像的信息区域的研究,鼓励网络从医学图像中探索更强的语义表示。我们引入了两种相互综合的掩蔽策略,知识驱动掩蔽 (KDM) 和句子驱动掩蔽 (SDM)。KDM 使用放射学报告中特有的医学主题词 (MeSH) 词来识别映射到 MeSH 词的症状线索 (例如,心脏、水肿、血管、肺部),并指导掩蔽生成。认识到放射学报告通常包含几个句子,详细描述不同的发现,SDM 整合句子级别的信息来识别掩蔽的关键区域。MedIM 根据 KDM 和 SDM 模块的掩蔽信息重建图像,促进全面而丰富的医学图像表示。我们在七个下游任务上进行了广泛的实验,涵盖了多标签/类图像分类、气胸分割和医学图像报告分析,结果表明,具有报告指导掩蔽的 MedIM 取得了有竞争力的性能。我们的方法在图像分类、气胸分割和医学图像报告分析等七个下游任务上取得了优异的结果,在大多数任务上都显著优于 ImageNet 预训练、MIM 预训练和医学图像报告预训练方法。代码可在 https://github.com/YtongXie/MedIM 上获得。

相似文献

1
Rethinking masked image modelling for medical image representation.重新思考医学图像表示的掩模图像建模。
Med Image Anal. 2024 Dec;98:103304. doi: 10.1016/j.media.2024.103304. Epub 2024 Aug 17.
2
GMIM: Self-supervised pre-training for 3D medical image segmentation with adaptive and hierarchical masked image modeling.GMIM:基于自适应和分层掩模图像建模的 3D 医学图像分割自监督预训练。
Comput Biol Med. 2024 Jun;176:108547. doi: 10.1016/j.compbiomed.2024.108547. Epub 2024 May 6.
3
Local contrastive loss with pseudo-label based self-training for semi-supervised medical image segmentation.基于伪标签自训练的局部对比损失的半监督医学图像分割。
Med Image Anal. 2023 Jul;87:102792. doi: 10.1016/j.media.2023.102792. Epub 2023 Mar 11.
4
Self-Supervised Medical Image Segmentation Using Deep Reinforced Adaptive Masking.使用深度强化自适应掩码的自监督医学图像分割
IEEE Trans Med Imaging. 2025 Jan;44(1):180-193. doi: 10.1109/TMI.2024.3436608. Epub 2025 Jan 2.
5
Cellular data extraction from multiplexed brain imaging data using self-supervised Dual-loss Adaptive Masked Autoencoder.使用自监督双损失自适应掩蔽自动编码器从多路复用脑成像数据中提取细胞数据。
Artif Intell Med. 2024 May;151:102828. doi: 10.1016/j.artmed.2024.102828. Epub 2024 Mar 15.
6
Masked hypergraph learning for weakly supervised histopathology whole slide image classification.基于掩蔽超图学习的弱监督病理切片图像分类。
Comput Methods Programs Biomed. 2024 Aug;253:108237. doi: 10.1016/j.cmpb.2024.108237. Epub 2024 May 23.
7
Efficient Masked Autoencoders With Self-Consistency.具有自一致性的高效掩码自动编码器
IEEE Trans Pattern Anal Mach Intell. 2024 Dec;46(12):8743-8757. doi: 10.1109/TPAMI.2024.3409826. Epub 2024 Nov 6.
8
Image-level supervision and self-training for transformer-based cross-modality tumor segmentation.基于 Transformer 的跨模态肿瘤分割的图像级监督和自训练。
Med Image Anal. 2024 Oct;97:103287. doi: 10.1016/j.media.2024.103287. Epub 2024 Jul 31.
9
Foundation models in gastrointestinal endoscopic AI: Impact of architecture, pre-training approach and data efficiency.胃肠道内镜 AI 中的基础模型:架构、预训练方法和数据效率的影响。
Med Image Anal. 2024 Dec;98:103298. doi: 10.1016/j.media.2024.103298. Epub 2024 Aug 12.
10
URCA: Uncertainty-based region clipping algorithm for semi-supervised medical image segmentation.基于不确定性的区域裁剪算法在半监督医学图像分割中的应用。
Comput Methods Programs Biomed. 2024 Sep;254:108278. doi: 10.1016/j.cmpb.2024.108278. Epub 2024 Jun 11.