Bian Yueyan, Li Jin, Ye Chuyang, Jia Xiuqin, Yang Qi
Department of Radiology, Beijing Chaoyang Hospital, Capital Medical University, Beijing 100020, China.
Key Lab of Medical Engineering for Cardiovascular Disease, Ministry of Education, Beijing 100020, China.
Chin Med J (Engl). 2025 Mar 20;138(6):651-663. doi: 10.1097/CM9.0000000000003489. Epub 2025 Feb 26.
Artificial intelligence (AI), particularly deep learning, has demonstrated remarkable performance in medical imaging across a variety of modalities, including X-ray, computed tomography (CT), magnetic resonance imaging (MRI), ultrasound, positron emission tomography (PET), and pathological imaging. However, most existing state-of-the-art AI techniques are task-specific and focus on a limited range of imaging modalities. Compared to these task-specific models, emerging foundation models represent a significant milestone in AI development. These models can learn generalized representations of medical images and apply them to downstream tasks through zero-shot or few-shot fine-tuning. Foundation models have the potential to address the comprehensive and multifactorial challenges encountered in clinical practice. This article reviews the clinical applications of both task-specific and foundation models, highlighting their differences, complementarities, and clinical relevance. We also examine their future research directions and potential challenges. Unlike the replacement relationship seen between deep learning and traditional machine learning, task-specific and foundation models are complementary, despite inherent differences. While foundation models primarily focus on segmentation and classification, task-specific models are integrated into nearly all medical image analyses. However, with further advancements, foundation models could be applied to other clinical scenarios. In conclusion, all indications suggest that task-specific and foundation models, especially the latter, have the potential to drive breakthroughs in medical imaging, from image processing to clinical workflows.
人工智能(AI),尤其是深度学习,在包括X射线、计算机断层扫描(CT)、磁共振成像(MRI)、超声、正电子发射断层扫描(PET)和病理成像等多种模态的医学成像中表现出了卓越的性能。然而,大多数现有的最先进的AI技术都是特定任务的,并且专注于有限范围的成像模态。与这些特定任务模型相比,新兴的基础模型代表了AI发展中的一个重要里程碑。这些模型可以学习医学图像的通用表示,并通过零样本或少样本微调将其应用于下游任务。基础模型有潜力应对临床实践中遇到的全面和多因素挑战。本文回顾了特定任务模型和基础模型的临床应用,强调了它们的差异、互补性和临床相关性。我们还研究了它们未来的研究方向和潜在挑战。与深度学习和传统机器学习之间的替代关系不同,特定任务模型和基础模型尽管存在固有差异,但它们是互补的。虽然基础模型主要专注于分割和分类,但特定任务模型几乎被集成到所有医学图像分析中。然而,随着进一步的发展,基础模型可以应用于其他临床场景。总之,所有迹象表明,特定任务模型和基础模型,尤其是后者,有潜力推动医学成像从图像处理到临床工作流程的突破。