Meijerink Lotta M, Dunias Zoë S, Leeuwenberg Artuur M, de Hond Anne A H, Jenkins David A, Martin Glen P, Sperrin Matthew, Peek Niels, Spijker René, Hooft Lotty, Moons Karel G M, van Smeden Maarten, Schuit Ewoud
Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands.
Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands.
J Clin Epidemiol. 2025 Feb;178:111636. doi: 10.1016/j.jclinepi.2024.111636. Epub 2024 Dec 9.
To give an overview of methods for updating artificial intelligence (AI)-based clinical prediction models based on new data.
We comprehensively searched Scopus and Embase up to August 2022 for articles that addressed developments, descriptions, or evaluations of prediction model updating methods. We specifically focused on articles in the medical domain involving AI-based prediction models that were updated based on new data, excluding regression-based updating methods as these have been extensively discussed elsewhere. We categorized and described the identified methods used to update the AI-based prediction model as well as the use cases in which they were used.
We included 78 articles. The majority of the included articles discussed updating for neural network methods (93.6%) with medical images as input data (65.4%). In many articles (51.3%) existing, pretrained models for broad tasks were updated to perform specialized clinical tasks. Other common reasons for model updating were to address changes in the data over time and cross-center differences; however, more unique use cases were also identified, such as updating a model from a broad population to a specific individual. We categorized the identified model updating methods into four categories: neural network-specific methods (described in 92.3% of the articles), ensemble-specific methods (2.5%), model-agnostic methods (9.0%), and other (1.3%). Variations of neural network-specific methods are further categorized based on the following: (1) the part of the original neural network that is kept, (2) whether and how the original neural network is extended with new parameters, and (3) to what extent the original neural network parameters are adjusted to the new data. The most frequently occurring method (n = 30) involved selecting the first layer(s) of an existing neural network, appending new, randomly initialized layers, and then optimizing the entire neural network.
We identified many ways to adjust or update AI-based prediction models based on new data, within a large variety of use cases. Updating methods for AI-based prediction models other than neural networks (eg, random forest) appear to be underexplored in clinical prediction research.
AI-based prediction models are increasingly used in health care, helping clinicians with diagnosing diseases, guiding treatment decisions, and informing patients. However, these prediction models do not always work well when applied to hospitals, patient populations, or times different from those used to develop the models. Developing new models for every situation is neither practical nor desired, as it wastes resources, time, and existing knowledge. A more efficient approach is to adjust existing models to new contexts ('updating'), but there is limited guidance on how to do this for AI-based clinical prediction models. To address this, we reviewed 78 studies in detail to understand how researchers are currently updating AI-based clinical prediction models, and the types of situations in which these updating methods are used. Our findings provide a comprehensive overview of the available methods to update existing models. This is intended to serve as guidance and inspiration for researchers. Ultimately, this can lead to better reuse of existing models and improve the quality and efficiency of AI-based prediction models in health care.
概述基于新数据更新人工智能(AI)临床预测模型的方法。
我们全面检索了截至2022年8月的Scopus和Embase数据库,以查找涉及预测模型更新方法的开发、描述或评估的文章。我们特别关注医学领域中基于AI的预测模型根据新数据进行更新的文章,不包括基于回归的更新方法,因为这些方法在其他地方已被广泛讨论。我们对识别出的用于更新基于AI的预测模型的方法以及其应用案例进行了分类和描述。
我们纳入了78篇文章。大多数纳入文章讨论了以医学图像作为输入数据(65.4%)的神经网络方法的更新(93.6%)。在许多文章(51.3%)中,针对广泛任务的现有预训练模型被更新以执行专门的临床任务。模型更新的其他常见原因是解决数据随时间的变化和跨中心差异;然而,也识别出了更多独特的应用案例,例如将模型从广泛人群更新到特定个体。我们将识别出的模型更新方法分为四类:特定于神经网络的方法(在92.3%的文章中描述)、特定于集成的方法(2.5%)、与模型无关的方法(9.0%)和其他方法(1.3%)。特定于神经网络的方法的变体根据以下方面进一步分类:(1)保留的原始神经网络的部分,(2)原始神经网络是否以及如何用新参数扩展,以及(3)原始神经网络参数在多大程度上根据新数据进行调整。最常出现的方法(n = 30)包括选择现有神经网络的第一层,附加新的、随机初始化的层,然后优化整个神经网络。
我们在各种各样的应用案例中识别出了许多基于新数据调整或更新基于AI的预测模型的方法。在临床预测研究中,除神经网络(如随机森林)之外的基于AI的预测模型的更新方法似乎尚未得到充分探索。
基于AI的预测模型在医疗保健中越来越多地被使用,帮助临床医生诊断疾病、指导治疗决策并告知患者。然而,当应用于与用于开发模型的医院、患者群体或时间不同的情况时,这些预测模型并不总是能很好地发挥作用。为每种情况开发新模型既不实际也不可取,因为这会浪费资源、时间和现有知识。一种更有效的方法是将现有模型调整到新的背景(“更新”),但对于基于AI的临床预测模型如何做到这一点的指导有限。为了解决这个问题,我们详细审查了78项研究,以了解研究人员目前如何更新基于AI的临床预测模型,以及使用这些更新方法的情况类型。我们的发现全面概述了更新现有模型的可用方法。这旨在为研究人员提供指导和灵感。最终,这可以导致更好地重用现有模型,并提高医疗保健中基于AI的预测模型的质量和效率。