Ma DongAo, Pang Jiaxuan, Gotway Michael B, Liang Jianming
School of Computing and Augmented Intelligence, Arizona State University, Tempe, AZ, USA.
Department of Radiology, Mayo Clinic, Phoenix, AZ, USA.
Nature. 2025 Jun 11. doi: 10.1038/s41586-025-09079-8.
Chest radiography frequently serves as baseline imaging for most lung diseases. Deep learning has great potential for automating the interpretation of chest radiography. However, existing chest radiographic deep learning models are limited in diagnostic scope, generalizability, adaptability, robustness and extensibility. To overcome these limitations, we have developed Ark, a foundation model applied to chest radiography and pretrained by cyclically accruing and reusing the knowledge from heterogeneous expert labels in numerous datasets. Ark excels in diagnosing thoracic diseases. It expands the diagnostic scope and addresses potential misdiagnosis. It can adapt to evolving diagnostic needs and respond to novel diseases. It can learn rare conditions from a few samples and transfer to new diagnostic settings without training. It tolerates data biases and long-tailed distributions, and it supports federated learning to preserve privacy. All codes and pretrained models have been released, so that Ark is open for fine-tuning, local adaptation and improvement. It is extensible to several modalities. Thus, it is a foundation model for medical imaging. The exceptional capabilities of Ark stem from our insight: aggregating various datasets diversifies the patient populations and accrues knowledge from many experts to yield unprecedented performance while reducing annotation costs. The development of Ark reveals that open models trained by accruing and reusing knowledge from heterogeneous expert annotations with a multitude of public (big or small) datasets can surpass the performance of proprietary models trained on large data. We hope that our findings will inspire more researchers to share code and datasets or federate privacy-preserving data to create open foundation models with diverse, global expertise and patient populations, thus accelerating open science and democratizing AI for medicine.
胸部X光检查常常作为大多数肺部疾病的基线成像方法。深度学习在实现胸部X光检查解释自动化方面具有巨大潜力。然而,现有的胸部X光深度学习模型在诊断范围、通用性、适应性、稳健性和可扩展性方面存在局限性。为了克服这些局限性,我们开发了Ark,这是一个应用于胸部X光检查的基础模型,通过循环积累和重用来自众多数据集中异构专家标签的知识进行预训练。Ark在诊断胸部疾病方面表现出色。它扩大了诊断范围并解决了潜在的误诊问题。它能够适应不断变化的诊断需求并应对新出现的疾病。它可以从少量样本中学习罕见病症,并在无需训练的情况下转移到新的诊断环境中。它能够容忍数据偏差和长尾分布,并且支持联邦学习以保护隐私。所有代码和预训练模型均已发布,因此Ark可供微调、本地适应和改进。它可扩展到多种模态。因此,它是医学成像的基础模型。Ark的卓越能力源于我们的洞察:聚合各种数据集可使患者群体多样化,并从众多专家那里积累知识,从而在降低注释成本的同时产生前所未有的性能。Ark的开发表明,通过从大量公共(无论大小)数据集中积累和重用来自异构专家注释的知识来训练的开放模型,可以超越在大数据上训练的专有模型的性能。我们希望我们的发现能激励更多研究人员分享代码和数据集,或者联合保护隐私的数据,以创建具有多样化、全球专业知识和患者群体的开放基础模型,从而加速开放科学并使医学人工智能民主化。