• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

冻结的大规模预训练视觉语言模型是多模态乳腺癌预测的有效基础骨干。

Frozen Large-Scale Pretrained Vision-Language Models are the Effective Foundational Backbone for Multimodal Breast Cancer Prediction.

作者信息

Vo Hung Q, Wang Lin, Wong Kelvin K, Ezeana Chika F, Yu Xiaohui, Yang Wei, Chang Jenny, Nguyen Hien V, Wong Stephen T C

出版信息

IEEE J Biomed Health Inform. 2025 May;29(5):3234-3246. doi: 10.1109/JBHI.2024.3507638. Epub 2025 May 6.

DOI:10.1109/JBHI.2024.3507638
PMID:40030302
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12145120/
Abstract

Breast cancer is a pervasive global health concern among women. Leveraging multimodal data from enterprise patient databases-including Picture Archiving and Communication Systems (PACS) and Electronic Health Records (EHRs)-holds promise for improving prediction. This study introduces a multimodal deep-learning model leveraging mammogram datasets to evaluate breast cancer prediction. Our approach integrates frozen large-scale pretrained vision-language models, showcasing superior performance and stability compared to traditional image-tabular models across two public breast cancer datasets. The model consistently outperforms conventional full fine-tuning methods by using frozen pretrained vision-language models alongside a lightweight trainable classifier. The observed improvements are significant. In the CBIS-DDSM dataset, the Area Under the Curve (AUC) increases from 0.867 to 0.902 during validation and from 0.803 to 0.830 for the official test set. Within the EMBED dataset, AUC improves from 0.780 to 0.805 during validation. In scenarios with limited data, using Breast Imaging-Reporting and Data System category three (BI-RADS 3) cases, AUC improves from 0.91 to 0.96 on the official CBIS-DDSM test set and from 0.79 to 0.83 on a challenging validation set. This study underscores the benefits of vision-language models in jointly training diverse image-clinical datasets from multiple healthcare institutions, effectively addressing challenges related to non-aligned tabular features. Combining training data enhances breast cancer prediction on the EMBED dataset, outperforming all other experiments. In summary, our research emphasizes the efficacy of frozen large-scale pretrained vision-language models in multimodal breast cancer prediction, offering superior performance and stability over conventional methods, reinforcing their potential for breast cancer prediction.

摘要

乳腺癌是全球女性普遍关注的健康问题。利用企业患者数据库中的多模态数据——包括图像存档与通信系统(PACS)和电子健康记录(EHR)——有望改善预测效果。本研究引入了一种利用乳房X光数据集的多模态深度学习模型来评估乳腺癌预测。我们的方法整合了冻结的大规模预训练视觉语言模型,与传统的图像表格模型相比,在两个公共乳腺癌数据集上展现出卓越的性能和稳定性。该模型通过将冻结的预训练视觉语言模型与轻量级可训练分类器一起使用,始终优于传统的完全微调方法。观察到的改进非常显著。在CBIS-DDSM数据集中,验证期间曲线下面积(AUC)从0.867增加到0.902,官方测试集从0.803增加到0.830。在EMBED数据集中,验证期间AUC从0.780提高到0.805。在数据有限的情况下,使用乳腺影像报告和数据系统3类(BI-RADS 3)病例,官方CBIS-DDSM测试集的AUC从0.91提高到0.96,在具有挑战性的验证集上从0.79提高到0.83。本研究强调了视觉语言模型在联合训练来自多个医疗机构的不同图像临床数据集方面的优势,有效解决了与未对齐表格特征相关的挑战。合并训练数据增强了EMBED数据集上的乳腺癌预测,优于所有其他实验。总之,我们的研究强调了冻结的大规模预训练视觉语言模型在多模态乳腺癌预测中的有效性,与传统方法相比具有卓越的性能和稳定性,增强了它们在乳腺癌预测中的潜力。

相似文献

1
Frozen Large-Scale Pretrained Vision-Language Models are the Effective Foundational Backbone for Multimodal Breast Cancer Prediction.冻结的大规模预训练视觉语言模型是多模态乳腺癌预测的有效基础骨干。
IEEE J Biomed Health Inform. 2025 May;29(5):3234-3246. doi: 10.1109/JBHI.2024.3507638. Epub 2025 May 6.
2
Integrative hybrid deep learning for enhanced breast cancer diagnosis: leveraging the Wisconsin Breast Cancer Database and the CBIS-DDSM dataset.基于威斯康星州乳腺癌数据库和 CBIS-DDSM 数据集的集成混合深度学习增强乳腺癌诊断
Sci Rep. 2024 Nov 1;14(1):26287. doi: 10.1038/s41598-024-74305-8.
3
A deep learning method for classifying mammographic breast density categories.一种用于对乳腺钼靶图像的乳房密度类别进行分类的深度学习方法。
Med Phys. 2018 Jan;45(1):314-321. doi: 10.1002/mp.12683. Epub 2017 Dec 22.
4
Trajectory-Ordered Objectives for Self-Supervised Representation Learning of Temporal Healthcare Data Using Transformers: Model Development and Evaluation Study.使用Transformer进行时间序列医疗数据自监督表示学习的轨迹有序目标:模型开发与评估研究
JMIR Med Inform. 2025 Jun 4;13:e68138. doi: 10.2196/68138.
5
External validation of a publicly available computer assisted diagnostic tool for mammographic mass lesions with two high prevalence research datasets.使用两个高患病率研究数据集对一种公开可用的乳腺钼靶肿块病变计算机辅助诊断工具进行外部验证。
Med Phys. 2015 Aug;42(8):4987-96. doi: 10.1118/1.4927260.
6
Deep Learning to Distinguish Recalled but Benign Mammography Images in Breast Cancer Screening.深度学习在乳腺癌筛查中区分召回的良性乳腺 X 光图像。
Clin Cancer Res. 2018 Dec 1;24(23):5902-5909. doi: 10.1158/1078-0432.CCR-18-1115. Epub 2018 Oct 11.
7
Deep Learning to Improve Breast Cancer Detection on Screening Mammography.深度学习在提高筛查性乳房 X 光摄影乳腺癌检测中的应用。
Sci Rep. 2019 Aug 29;9(1):12495. doi: 10.1038/s41598-019-48995-4.
8
Deep learning empowered breast cancer diagnosis: Advancements in detection and classification.深度学习助力乳腺癌诊断:检测与分类技术的新进展。
PLoS One. 2024 Jul 11;19(7):e0304757. doi: 10.1371/journal.pone.0304757. eCollection 2024.
9
Large Scale Semi-Automated Labeling of Routine Free-Text Clinical Records for Deep Learning.大规模半自动化标注常规自由文本临床记录用于深度学习。
J Digit Imaging. 2019 Feb;32(1):30-37. doi: 10.1007/s10278-018-0105-8.
10
A Deep Learning-Based Decision Support Tool for Precision Risk Assessment of Breast Cancer.一种基于深度学习的乳腺癌精准风险评估决策支持工具。
JCO Clin Cancer Inform. 2019 May;3:1-12. doi: 10.1200/CCI.18.00121.

引用本文的文献

1
Vision-language foundation models for medical imaging: a review of current practices and innovations.用于医学成像的视觉语言基础模型:当前实践与创新综述
Biomed Eng Lett. 2025 Jun 6;15(5):809-830. doi: 10.1007/s13534-025-00484-6. eCollection 2025 Sep.
2
Clinical applications of large language models in medicine and surgery: A scoping review.大型语言模型在医学与外科中的临床应用:一项范围综述
J Int Med Res. 2025 Jul;53(7):3000605251347556. doi: 10.1177/03000605251347556. Epub 2025 Jul 4.

本文引用的文献

1
Biomedical Visual Instruction Tuning with Clinician Preference Alignment.基于临床医生偏好对齐的生物医学视觉指令微调
Adv Neural Inf Process Syst. 2024 Dec;37:96449-96467.
2
Domain generalization for mammographic image analysis with contrastive learning.基于对比学习的乳腺X线图像分析领域泛化
Comput Biol Med. 2025 Feb;185:109455. doi: 10.1016/j.compbiomed.2024.109455. Epub 2024 Dec 9.
3
Combining clinical and imaging data for predicting functional outcomes after acute ischemic stroke: an automated machine learning approach.
结合临床和影像学数据预测急性缺血性脑卒中后功能结局:一种自动化机器学习方法。
Sci Rep. 2023 Oct 7;13(1):16926. doi: 10.1038/s41598-023-44201-8.
4
Assessing Breast Cancer Risk by Combining AI for Lesion Detection and Mammographic Texture.利用人工智能进行病灶检测和乳腺纹理分析来评估乳腺癌风险。
Radiology. 2023 Aug;308(2):e230227. doi: 10.1148/radiol.230227.
5
Functional Outcome Prediction in Acute Ischemic Stroke Using a Fused Imaging and Clinical Deep Learning Model.使用融合影像与临床深度学习模型对急性缺血性脑卒中的功能预后进行预测。
Stroke. 2023 Sep;54(9):2316-2327. doi: 10.1161/STROKEAHA.123.044072. Epub 2023 Jul 24.
6
Attention-Based Deep Learning System for Classification of Breast Lesions-Multimodal, Weakly Supervised Approach.基于注意力的深度学习系统用于乳腺病变分类——多模态、弱监督方法
Cancers (Basel). 2023 May 10;15(10):2704. doi: 10.3390/cancers15102704.
7
Radiomics-informed Deep Curriculum Learning for Breast Cancer Diagnosis.基于影像组学的深度课程学习用于乳腺癌诊断
Med Image Comput Comput Assist Interv. 2021 Sep-Oct;12905:634-643. doi: 10.1007/978-3-030-87240-3_61. Epub 2021 Sep 21.
8
Automatic Calcification Morphology and Distribution Classification for Breast Mammograms With Multi-Task Graph Convolutional Neural Network.基于多任务图卷积神经网络的乳腺 X 线摄影自动钙化形态和分布分类。
IEEE J Biomed Health Inform. 2023 Aug;27(8):3782-3793. doi: 10.1109/JBHI.2023.3249404. Epub 2023 Aug 7.
9
Convolutional Feature Descriptor Selection for Mammogram Classification.用于乳腺X光图像分类的卷积特征描述符选择
IEEE J Biomed Health Inform. 2023 Mar;27(3):1467-1476. doi: 10.1109/JBHI.2022.3233535. Epub 2023 Mar 7.
10
Deep Neural Networks and Tabular Data: A Survey.深度神经网络与表格数据:一项综述。
IEEE Trans Neural Netw Learn Syst. 2024 Jun;35(6):7499-7519. doi: 10.1109/TNNLS.2022.3229161. Epub 2024 Jun 3.