作为一种干预手段的人工智能：改善临床结果依赖于对人工智能开发和验证采用因果方法。

AI as an intervention: improving clinical outcomes relies on a causal approach to AI development and validation.

作者信息

Joshi Shalmali, Urteaga Iñigo, van Amsterdam Wouter A C, Hripcsak George, Elias Pierre, Recht Benjamin, Elhadad Noémie, Fackler James, Sendak Mark P, Wiens Jenna, Deshpande Kaivalya, Wald Yoav, Fiterau Madalina, Lipton Zachary, Malinsky Daniel, Nayan Madhur, Namkoong Hongseok, Park Soojin, Vogt Julia E, Ranganath Rajesh

机构信息

Department of Biomedical Informatics, Columbia University, New York, NY 10032, United States.

BCAM-Basque Center for Applied Mathematics, Bilbao 48009, Spain.

出版信息

J Am Med Inform Assoc. 2025 Mar 1;32(3):589-594. doi: 10.1093/jamia/ocae301.

DOI:10.1093/jamia/ocae301

PMID:39775871

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11833492/

Abstract

The primary practice of healthcare artificial intelligence (AI) starts with model development, often using state-of-the-art AI, retrospectively evaluated using metrics lifted from the AI literature like AUROC and DICE score. However, good performance on these metrics may not translate to improved clinical outcomes. Instead, we argue for a better development pipeline constructed by working backward from the end goal of positively impacting clinically relevant outcomes using AI, leading to considerations of causality in model development and validation, and subsequently a better development pipeline. Healthcare AI should be "actionable," and the change in actions induced by AI should improve outcomes. Quantifying the effect of changes in actions on outcomes is causal inference. The development, evaluation, and validation of healthcare AI should therefore account for the causal effect of intervening with the AI on clinically relevant outcomes. Using a causal lens, we make recommendations for key stakeholders at various stages of the healthcare AI pipeline. Our recommendations aim to increase the positive impact of AI on clinical outcomes.

摘要

医疗保健人工智能（AI）的主要实践始于模型开发，通常使用最先进的人工智能技术，并使用从人工智能文献中借鉴的指标（如AUROC和DICE分数）进行回顾性评估。然而，这些指标上的良好表现不一定能转化为改善临床结果。相反，我们主张通过从利用人工智能积极影响临床相关结果的最终目标反向推导来构建更好的开发流程，这导致在模型开发和验证中考虑因果关系，进而形成更好的开发流程。医疗保健人工智能应该是“可操作的”，由人工智能引发的行动变化应该改善结果。量化行动变化对结果的影响就是因果推断。因此，医疗保健人工智能的开发、评估和验证应该考虑干预人工智能对临床相关结果的因果效应。从因果关系的角度出发，我们为医疗保健人工智能流程各个阶段的关键利益相关者提出建议。我们的建议旨在增加人工智能对临床结果的积极影响。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8af7/11833492/988a3a7d076d/ocae301f1.jpg

相似文献

AI as an intervention: improving clinical outcomes relies on a causal approach to AI development and validation.作为一种干预手段的人工智能：改善临床结果依赖于对人工智能开发和验证采用因果方法。

J Am Med Inform Assoc. 2025 Mar 1;32(3):589-594. doi: 10.1093/jamia/ocae301.

Artificial intelligence for detecting keratoconus.人工智能在圆锥角膜检测中的应用。

Cochrane Database Syst Rev. 2023 Nov 15;11(11):CD014911. doi: 10.1002/14651858.CD014911.pub2.

Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。

Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.

The impact of artificial intelligence on the endoscopic assessment of inflammatory bowel disease-related neoplasia.人工智能对炎症性肠病相关肿瘤内镜评估的影响。

Therap Adv Gastroenterol. 2025 Jun 23;18:17562848251348574. doi: 10.1177/17562848251348574. eCollection 2025.

Conservative, physical and surgical interventions for managing faecal incontinence and constipation in adults with central neurological diseases.保守治疗、物理治疗和手术干预用于治疗伴有中枢神经系统疾病的成年人的粪便失禁和便秘。

Cochrane Database Syst Rev. 2024 Oct 29;10(10):CD002115. doi: 10.1002/14651858.CD002115.pub6.

Artificial intelligence for diagnosing exudative age-related macular degeneration.人工智能在渗出性年龄相关性黄斑变性诊断中的应用。

Cochrane Database Syst Rev. 2024 Oct 17;10(10):CD015522. doi: 10.1002/14651858.CD015522.pub2.

Technological aids for the rehabilitation of memory and executive functioning in children and adolescents with acquired brain injury.脑损伤儿童和青少年记忆与执行功能康复的技术辅助手段。

Cochrane Database Syst Rev. 2016 Jul 1;7(7):CD011020. doi: 10.1002/14651858.CD011020.pub2.

Are Current Survival Prediction Tools Useful When Treating Subsequent Skeletal-related Events From Bone Metastases?当前的生存预测工具在治疗骨转移后的骨骼相关事件时有用吗？

Clin Orthop Relat Res. 2024 Sep 1;482(9):1710-1721. doi: 10.1097/CORR.0000000000003030. Epub 2024 Mar 22.

Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病：网络荟萃分析。

Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.

Exercise interventions and patient beliefs for people with hip, knee or hip and knee osteoarthritis: a mixed methods review.髋、膝或髋膝骨关节炎患者的运动干预和患者信念：一项混合方法综述

Cochrane Database Syst Rev. 2018 Apr 17;4(4):CD010842. doi: 10.1002/14651858.CD010842.pub2.

引用本文的文献

Artificial Intelligence in Health Education and Practice: A Systematic Review of Health Students' and Academics' Knowledge, Perceptions and Experiences.健康教育与实践中的人工智能：对健康专业学生和学者的知识、认知与经验的系统综述

Int Nurs Rev. 2025 Jun;72(2):e70045. doi: 10.1111/inr.70045.

NeoPred: dual-phase CT AI forecasts pathologic response to neoadjuvant chemo-immunotherapy in NSCLC.NeoPred：双期CT人工智能预测非小细胞肺癌新辅助化疗免疫治疗的病理反应。

J Immunother Cancer. 2025 May 31;13(5):e011773. doi: 10.1136/jitc-2025-011773.

Research Advance of Causal Inference in Clinical Medicine: A Bibliometrics Analysis via Citespace.临床医学中因果推断的研究进展：基于Citespace的文献计量学分析

J Multidiscip Healthc. 2025 May 10;18:2603-2627. doi: 10.2147/JMDH.S516826. eCollection 2025.

本文引用的文献

TRIPOD+AI statement: updated guidance for reporting clinical prediction models that use regression or machine learning methods.TRIPOD+AI 声明：报告使用回归或机器学习方法的临床预测模型的更新指南。

BMJ. 2024 Apr 16;385:e078378. doi: 10.1136/bmj-2023-078378.

Off-label use of artificial intelligence models in healthcare.人工智能模型在医疗保健中的超说明书使用。

Nat Med. 2024 Jun;30(6):1525-1527. doi: 10.1038/s41591-024-02870-6.

Regulate Artificial Intelligence in Health Care by Prioritizing Patient Outcomes.通过优先考虑患者治疗结果来规范医疗保健领域的人工智能。

JAMA. 2024 Feb 27;331(8):639-640. doi: 10.1001/jama.2024.0549.

External validation of AI models in health should be replaced with recurring local validation.健康领域人工智能模型的外部验证应由定期的本地验证取而代之。

Nat Med. 2023 Nov;29(11):2686-2687. doi: 10.1038/s41591-023-02540-z.

Algorithmic fairness in artificial intelligence for medicine and healthcare.人工智能在医学和医疗保健中的算法公平性。

Nat Biomed Eng. 2023 Jun;7(6):719-742. doi: 10.1038/s41551-023-01056-8. Epub 2023 Jun 28.

DEPLOYR: a technical framework for deploying custom real-time machine learning models into the electronic medical record.DEPLOYR：一个将定制的实时机器学习模型部署到电子病历中的技术框架。

J Am Med Inform Assoc. 2023 Aug 18;30(9):1532-1542. doi: 10.1093/jamia/ocad114.

Implementation frameworks for end-to-end clinical AI: derivation of the SALIENT framework.端到端临床人工智能实施框架：SALIENT 框架的推导。

J Am Med Inform Assoc. 2023 Aug 18;30(9):1503-1515. doi: 10.1093/jamia/ocad088.

The impact of commercial health datasets on medical research and health-care algorithms.商业健康数据集对医学研究和医疗保健算法的影响。

Lancet Digit Health. 2023 May;5(5):e288-e294. doi: 10.1016/S2589-7500(23)00025-0.

Making machine learning matter to clinicians: model actionability in medical decision-making.让机器学习对临床医生产生重要影响：医学决策中的模型可操作性。

NPJ Digit Med. 2023 Jan 24;6(1):7. doi: 10.1038/s41746-023-00753-7.

DECIDE-AI: a new reporting guideline and its relevance to artificial intelligence studies in radiology.DECIDE-AI：一种新的报告指南及其与放射学人工智能研究的相关性。

Clin Radiol. 2023 Feb;78(2):130-136. doi: 10.1016/j.crad.2022.09.131.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

作为一种干预手段的人工智能：改善临床结果依赖于对人工智能开发和验证采用因果方法。

AI as an intervention: improving clinical outcomes relies on a causal approach to AI development and validation.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献