Pai Suraj, Bontempi Dennis, Prudente Vasco, Hadzic Ibrahim, Sokač Mateo, Chaunzwa Tafadzwa L, Bernatz Simon, Hosny Ahmed, Mak Raymond H, Birkbak Nicolai J, Aerts Hugo Jwl
Artificial Intelligence in Medicine (AIM) Program, Mass General Brigham, Harvard Medical School, Harvard Institutes of Medicine, 77 Avenue Louis Pasteur, Boston, MA 02115, United States of America.
Radiology and Nuclear Medicine, CARIM & GROW, Maastricht University, Universiteitssingel 40, 6229 ER Maastricht, The Netherlands.
medRxiv. 2023 Sep 5:2023.09.04.23294952. doi: 10.1101/2023.09.04.23294952.
Foundation models represent a recent paradigm shift in deep learning, where a single large-scale model trained on vast amounts of data can serve as the foundation for various downstream tasks. Foundation models are generally trained using self-supervised learning and excel in reducing the demand for training samples in downstream applications. This is especially important in medicine, where large labeled datasets are often scarce. Here, we developed a foundation model for imaging biomarker discovery by training a convolutional encoder through self-supervised learning using a comprehensive dataset of 11,467 radiographic lesions. The foundation model was evaluated in distinct and clinically relevant applications of imaging-based biomarkers. We found that they facilitated better and more efficient learning of imaging biomarkers and yielded task-specific models that significantly outperformed their conventional supervised counterparts on downstream tasks. The performance gain was most prominent when training dataset sizes were very limited. Furthermore, foundation models were more stable to input and inter-reader variations and showed stronger associations with underlying biology. Our results demonstrate the tremendous potential of foundation models in discovering novel imaging biomarkers that may extend to other clinical use cases and can accelerate the widespread translation of imaging biomarkers into clinical settings.
基础模型代表了深度学习中最近的一种范式转变,即一个在大量数据上训练的单一大规模模型可以作为各种下游任务的基础。基础模型通常使用自监督学习进行训练,并且在减少下游应用中对训练样本的需求方面表现出色。这在医学领域尤为重要,因为大型标记数据集往往很稀缺。在此,我们通过使用包含11467个放射学病变的综合数据集进行自监督学习来训练卷积编码器,从而开发了一种用于成像生物标志物发现的基础模型。该基础模型在基于成像的生物标志物的不同且与临床相关的应用中进行了评估。我们发现,它们有助于更好、更高效地学习成像生物标志物,并产生了特定任务的模型,这些模型在下游任务上显著优于传统的监督模型。当训练数据集规模非常有限时,性能提升最为显著。此外,基础模型对输入和阅片者间的差异更具稳定性,并且与潜在生物学表现出更强的关联。我们的结果证明了基础模型在发现新型成像生物标志物方面的巨大潜力,这些生物标志物可能扩展到其他临床用例,并可加速成像生物标志物在临床环境中的广泛转化。