Zhang Shaoting, Metaxas Dimitris
University of Electronic Science and Technology of China, Chengdu, Sichuan, China; Shanghai Artificial Intelligence Laboratory, Shanghai, China.
Rutgers University, New Brunswick, NJ, USA.
Med Image Anal. 2024 Jan;91:102996. doi: 10.1016/j.media.2023.102996. Epub 2023 Oct 12.
This article discusses the opportunities, applications and future directions of large-scale pretrained models, i.e., foundation models, which promise to significantly improve the analysis of medical images. Medical foundation models have immense potential in solving a wide range of downstream tasks, as they can help to accelerate the development of accurate and robust models, reduce the dependence on large amounts of labeled data, preserve the privacy and confidentiality of patient data. Specifically, we illustrate the "spectrum" of medical foundation models, ranging from general imaging models, modality-specific models, to organ/task-specific models, and highlight their challenges, opportunities and applications. We also discuss how foundation models can be leveraged in downstream medical tasks to enhance the accuracy and efficiency of medical image analysis, leading to more precise diagnosis and treatment decisions.
本文讨论了大规模预训练模型,即基础模型的机遇、应用和未来发展方向,这些模型有望显著改善医学图像分析。医学基础模型在解决广泛的下游任务方面具有巨大潜力,因为它们有助于加速准确且稳健模型的开发,减少对大量标注数据的依赖,保护患者数据的隐私和保密性。具体而言,我们阐述了医学基础模型的“谱系”,从通用成像模型、特定模态模型到特定器官/任务模型,并强调了它们面临的挑战、机遇和应用。我们还讨论了如何在下游医学任务中利用基础模型来提高医学图像分析的准确性和效率,从而做出更精确的诊断和治疗决策。