• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用多模态、多教师见解增强用于医学图像分割的高效深度学习模型。

Enhancing efficient deep learning models with multimodal, multi-teacher insights for medical image segmentation.

作者信息

Hossain Khondker Fariha, Kamran Sharif Amit, Ong Joshua, Tavakkoli Alireza

机构信息

Department of Computer Science and Engineering, University of Nevada, Reno, Reno, NV, 89557, USA.

Department of Ophthalmology and Visual Sciences, University of Michigan Kellogg Eye Center, Ann Arbor, MI, USA.

出版信息

Sci Rep. 2025 May 7;15(1):15948. doi: 10.1038/s41598-025-91430-0.

DOI:10.1038/s41598-025-91430-0
PMID:40335579
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12059033/
Abstract

The rapid evolution of deep learning has dramatically enhanced the field of medical image segmentation, leading to the development of models with unprecedented accuracy in analyzing complex medical images. Deep learning-based segmentation holds significant promise for advancing clinical care and enhancing the precision of medical interventions. However, these models' high computational demand and complexity present significant barriers to their application in resource-constrained clinical settings. To address this challenge, we introduce Teach-Former, a novel knowledge distillation (KD) framework that leverages a Transformer backbone to effectively condense the knowledge of multiple teacher models into a single, streamlined student model. Moreover, it excels in the contextual and spatial interpretation of relationships across multimodal images for more accurate and precise segmentation. Teach-Former stands out by harnessing multimodal inputs (CT, PET, MRI) and distilling the final predictions and the intermediate attention maps, ensuring a richer spatial and contextual knowledge transfer. Through this technique, the student model inherits the capacity for fine segmentation while operating with a significantly reduced parameter set and computational footprint. Additionally, introducing a novel training strategy optimizes knowledge transfer, ensuring the student model captures the intricate mapping of features essential for high-fidelity segmentation. The efficacy of Teach-Former has been effectively tested on two extensive multimodal datasets, HECKTOR21 and PI-CAI22, encompassing various image types. The results demonstrate that our KD strategy reduces the model complexity and surpasses existing state-of-the-art methods to achieve superior performance. The findings of this study indicate that the proposed methodology could facilitate efficient segmentation of complex multimodal medical images, supporting clinicians in achieving more precise diagnoses and comprehensive monitoring of pathological conditions ( https://github.com/FarihaHossain/TeachFormer ).

摘要

深度学习的快速发展极大地推动了医学图像分割领域的进步,促使在分析复杂医学图像方面出现了具有前所未有的准确性的模型。基于深度学习的分割在推进临床护理和提高医疗干预的精度方面具有巨大潜力。然而,这些模型对计算的高要求和复杂性给它们在资源受限的临床环境中的应用带来了重大障碍。为应对这一挑战,我们引入了Teach-Former,这是一种新颖的知识蒸馏(KD)框架,它利用Transformer主干有效地将多个教师模型的知识浓缩到一个单一的、简化的学生模型中。此外,它在跨多模态图像的关系的上下文和空间解释方面表现出色,以实现更准确和精确的分割。Teach-Former通过利用多模态输入(CT、PET、MRI)并蒸馏最终预测和中间注意力图而脱颖而出,确保更丰富的空间和上下文知识转移。通过这种技术,学生模型在以显著减少的参数集和计算量运行的同时,继承了精细分割的能力。此外,引入一种新颖的训练策略优化了知识转移,确保学生模型捕捉到高保真分割所需的复杂特征映射。Teach-Former的有效性已在两个广泛的多模态数据集HECKTOR21和PI-CAI22上进行了有效测试,这些数据集涵盖了各种图像类型。结果表明,我们的KD策略降低了模型复杂性,并超越了现有的最先进方法,以实现卓越的性能。这项研究的结果表明,所提出的方法可以促进复杂多模态医学图像的高效分割,支持临床医生实现更精确的诊断和对病理状况的全面监测(https://github.com/FarihaHossain/TeachFormer)。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/52a9/12059033/1418c224e647/41598_2025_91430_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/52a9/12059033/9de648d35a12/41598_2025_91430_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/52a9/12059033/bda0e40e342e/41598_2025_91430_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/52a9/12059033/afb92d9332fa/41598_2025_91430_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/52a9/12059033/535c3eb392a7/41598_2025_91430_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/52a9/12059033/1418c224e647/41598_2025_91430_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/52a9/12059033/9de648d35a12/41598_2025_91430_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/52a9/12059033/bda0e40e342e/41598_2025_91430_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/52a9/12059033/afb92d9332fa/41598_2025_91430_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/52a9/12059033/535c3eb392a7/41598_2025_91430_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/52a9/12059033/1418c224e647/41598_2025_91430_Fig5_HTML.jpg

相似文献

1
Enhancing efficient deep learning models with multimodal, multi-teacher insights for medical image segmentation.利用多模态、多教师见解增强用于医学图像分割的高效深度学习模型。
Sci Rep. 2025 May 7;15(1):15948. doi: 10.1038/s41598-025-91430-0.
2
Brain tumor segmentation and detection in MRI using convolutional neural networks and VGG16.使用卷积神经网络和VGG16在磁共振成像(MRI)中进行脑肿瘤分割与检测
Cancer Biomark. 2025 Mar;42(3):18758592241311184. doi: 10.1177/18758592241311184. Epub 2025 Apr 4.
3
Semi-supervised abdominal multi-organ segmentation by object-redrawing.通过对象重绘实现半监督腹部多器官分割
Med Phys. 2024 Nov;51(11):8334-8347. doi: 10.1002/mp.17364. Epub 2024 Aug 21.
4
Efficient Combination of CNN and Transformer for Dual-Teacher Uncertainty-guided Semi-supervised Medical Image Segmentation.基于 CNN 和 Transformer 的高效组合用于双教师不确定性引导的半监督医学图像分割。
Comput Methods Programs Biomed. 2022 Nov;226:107099. doi: 10.1016/j.cmpb.2022.107099. Epub 2022 Sep 2.
5
ScribSD+: Scribble-supervised medical image segmentation based on simultaneous multi-scale knowledge distillation and class-wise contrastive regularization.ScribSD+:基于同时多尺度知识蒸馏和类内对比正则化的涂鸦监督医学图像分割。
Comput Med Imaging Graph. 2024 Sep;116:102416. doi: 10.1016/j.compmedimag.2024.102416. Epub 2024 Jul 9.
6
Efficient Multi-Organ Segmentation From 3D Abdominal CT Images With Lightweight Network and Knowledge Distillation.基于轻量化网络和知识蒸馏的 3D 腹部 CT 图像高效多器官分割。
IEEE Trans Med Imaging. 2023 Sep;42(9):2513-2523. doi: 10.1109/TMI.2023.3262680. Epub 2023 Aug 31.
7
SwinCross: Cross-modal Swin transformer for head-and-neck tumor segmentation in PET/CT images.SwinCross:用于 PET/CT 图像中头颈部肿瘤分割的跨模态 Swin 变换器。
Med Phys. 2024 Mar;51(3):2096-2107. doi: 10.1002/mp.16703. Epub 2023 Sep 30.
8
Parallel-way: Multi-modality-based brain tumor segmentation using parallel capsule network.并行方式:使用并行胶囊网络的基于多模态的脑肿瘤分割
Electromagn Biol Med. 2024 Oct;43(4):267-291. doi: 10.1080/15368378.2024.2390058. Epub 2024 Oct 29.
9
A single stage knowledge distillation network for brain tumor segmentation on limited MR image modalities.基于有限模态磁共振成像的脑肿瘤分割单阶段知识蒸馏网络。
Comput Methods Programs Biomed. 2023 Oct;240:107644. doi: 10.1016/j.cmpb.2023.107644. Epub 2023 Jun 4.
10
Dual-attention transformer-based hybrid network for multi-modal medical image segmentation.基于双注意力转换器的多模态医学图像分割混合网络。
Sci Rep. 2024 Oct 28;14(1):25704. doi: 10.1038/s41598-024-76234-y.

本文引用的文献

1
From CNN to Transformer: A Review of Medical Image Segmentation Models.从 CNN 到 Transformer:医学图像分割模型综述。
J Imaging Inform Med. 2024 Aug;37(4):1529-1547. doi: 10.1007/s10278-024-00981-7. Epub 2024 Mar 4.
2
Segment anything model for medical image segmentation: Current applications and future directions.用于医学图像分割的分割模型:当前应用与未来方向。
Comput Biol Med. 2024 Mar;171:108238. doi: 10.1016/j.compbiomed.2024.108238. Epub 2024 Feb 27.
3
Multi-modal medical Transformers: A meta-analysis for medical image segmentation in oncology.
多模态医学 Transformers:肿瘤医学图像分割的元分析。
Comput Med Imaging Graph. 2023 Dec;110:102308. doi: 10.1016/j.compmedimag.2023.102308. Epub 2023 Oct 26.
4
MSKD: Structured knowledge distillation for efficient medical image segmentation.MSKD:用于高效医学图像分割的结构化知识蒸馏。
Comput Biol Med. 2023 Sep;164:107284. doi: 10.1016/j.compbiomed.2023.107284. Epub 2023 Aug 2.
5
Momentum contrast transformer for COVID-19 diagnosis with knowledge distillation.基于知识蒸馏的用于COVID-19诊断的动量对比变压器
Pattern Recognit. 2023 Nov;143:109732. doi: 10.1016/j.patcog.2023.109732. Epub 2023 Jun 1.
6
Overview of the HECKTOR Challenge at MICCAI 2022: Automatic Head and Neck Tumor Segmentation and Outcome Prediction in PET/CT.2022年医学影像计算大会(MICCAI)HECKTOR挑战赛概述:PET/CT中头颈部肿瘤的自动分割与结果预测
Head Neck Tumor Chall (2022). 2023;13626:1-30. doi: 10.1007/978-3-031-27420-6_1. Epub 2023 Mar 18.
7
MISSU: 3D Medical Image Segmentation via Self-Distilling TransUNet.MISSU:基于自蒸馏 TransUNet 的 3D 医学图像分割。
IEEE Trans Med Imaging. 2023 Sep;42(9):2740-2750. doi: 10.1109/TMI.2023.3264433. Epub 2023 Aug 31.
8
MISSFormer: An Effective Transformer for 2D Medical Image Segmentation.MISSFormer:用于二维医学图像分割的有效 Transformer。
IEEE Trans Med Imaging. 2023 May;42(5):1484-1494. doi: 10.1109/TMI.2022.3230943. Epub 2023 May 2.
9
Self-evolving vision transformer for chest X-ray diagnosis through knowledge distillation.基于知识蒸馏的胸部 X 射线诊断自进化视觉Transformer。
Nat Commun. 2022 Jul 4;13(1):3848. doi: 10.1038/s41467-022-31514-x.
10
Deep Learning-based Image Segmentation on Multimodal Medical Imaging.基于深度学习的多模态医学影像图像分割
IEEE Trans Radiat Plasma Med Sci. 2019 Mar;3(2):162-169. doi: 10.1109/trpms.2018.2890359. Epub 2019 Jan 1.