Ansari Sardar, Baur Brittany, Singh Karandeep, Admon Andrew J
Department of Emergency Medicine, University of Michigan, Ann Arbor, MI.
Department of Medicine, University of California San Diego, San Diego, CA.
NEJM AI. 2025 May;2(5). doi: 10.1056/aip2401116. Epub 2025 Apr 24.
Predictive artificial intelligence (AI) models enhance clinical workflows with applications such as prognostication and decision support, yet suffer from postdeployment performance challenges due to dataset shifts. Regulatory guidelines emphasize the need for continuous monitoring, but actionable strategies are lacking. A significant issue is postdeployment assessment of predictive AI models due to confounding medical interventions where effective interventions modify outcomes, introducing bias into performance assessment. This can falsely suggest model decay, leading to unwarranted updates or decommissioning, harming clinical outcomes. Proposed solutions include withholding model outputs, monitoring outcomes as surrogates, or including clinician interventions in models, each with ethical or practical limitations. The lack of effective solutions for this problem can lead to an abundance of models that cannot be later evaluated, tuned, or withdrawn if they become ineffective, leading to patient harm. Advanced causal modeling to assess counterfactual outcomes may offer a reliable validation method. Until effective methods for postdeployment monitoring of predictive models are developed and validated, decisions on model updates should consider the causal pathways and be evidence based, ensuring the sustained utility of AI models in dynamic clinical environments.
预测性人工智能(AI)模型通过诸如预后预测和决策支持等应用增强了临床工作流程,但由于数据集的变化,在部署后仍面临性能挑战。监管指南强调持续监测的必要性,但缺乏可行的策略。一个重要问题是预测性AI模型的部署后评估,因为有效的医疗干预会改变结果,从而在性能评估中引入偏差,这是一种混淆的医疗干预。这可能会错误地表明模型衰退,导致不必要的更新或停用,损害临床结果。提出的解决方案包括保留模型输出、将结果作为替代指标进行监测,或在模型中纳入临床医生的干预措施,但每种方法都有伦理或实际限制。对于这个问题缺乏有效的解决方案可能会导致大量模型在变得无效后无法进行评估、调整或撤回,从而对患者造成伤害。用于评估反事实结果的高级因果建模可能提供一种可靠的验证方法。在开发和验证预测模型部署后监测的有效方法之前,关于模型更新的决策应考虑因果路径并以证据为基础,确保AI模型在动态临床环境中的持续效用。