• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于评估人工智能驱动的临床医生工具长期现实世界影响的AI for IMPACTS框架:系统评价与叙述性综合分析

AI for IMPACTS Framework for Evaluating the Long-Term Real-World Impacts of AI-Powered Clinician Tools: Systematic Review and Narrative Synthesis.

作者信息

Jacob Christine, Brasier Noé, Laurenzi Emanuele, Heuss Sabina, Mougiakakou Stavroula-Georgia, Cöltekin Arzu, Peter Marc K

机构信息

FHNW, University of Applied Sciences and Arts Northwestern Switzerland, Windisch, Switzerland.

Institute of Translational Medicine, Department of Health Science and Technology, ETH Zurich, Zurich, Switzerland.

出版信息

J Med Internet Res. 2025 Feb 5;27:e67485. doi: 10.2196/67485.

DOI:10.2196/67485
PMID:39909417
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11840377/
Abstract

BACKGROUND

Artificial intelligence (AI) has the potential to revolutionize health care by enhancing both clinical outcomes and operational efficiency. However, its clinical adoption has been slower than anticipated, largely due to the absence of comprehensive evaluation frameworks. Existing frameworks remain insufficient and tend to emphasize technical metrics such as accuracy and validation, while overlooking critical real-world factors such as clinical impact, integration, and economic sustainability. This narrow focus prevents AI tools from being effectively implemented, limiting their broader impact and long-term viability in clinical practice.

OBJECTIVE

This study aimed to create a framework for assessing AI in health care, extending beyond technical metrics to incorporate social and organizational dimensions. The framework was developed by systematically reviewing, analyzing, and synthesizing the evaluation criteria necessary for successful implementation, focusing on the long-term real-world impact of AI in clinical practice.

METHODS

A search was performed in July 2024 across the PubMed, Cochrane, Scopus, and IEEE Xplore databases to identify relevant studies published in English between January 2019 and mid-July 2024, yielding 3528 results, among which 44 studies met the inclusion criteria. The systematic review followed PRISMA (Preferred Reporting Items for Systematic reviews and Meta-Analyses) guidelines and the Cochrane Handbook for Systematic Reviews. Data were analyzed using NVivo through thematic analysis and narrative synthesis to identify key emergent themes in the studies.

RESULTS

By synthesizing the included studies, we developed a framework that goes beyond the traditional focus on technical metrics or study-level methodologies. It integrates clinical context and real-world implementation factors, offering a more comprehensive approach to evaluating AI tools. With our focus on assessing the long-term real-world impact of AI technologies in health care, we named the framework AI for IMPACTS. The criteria are organized into seven key clusters, each corresponding to a letter in the acronym: (1) I-integration, interoperability, and workflow; (2) M-monitoring, governance, and accountability; (3) P-performance and quality metrics; (4) A-acceptability, trust, and training; (5) C-cost and economic evaluation; (6) T-technological safety and transparency; and (7) S-scalability and impact. These are further broken down into 28 specific subcriteria.

CONCLUSIONS

The AI for IMPACTS framework offers a holistic approach to evaluate the long-term real-world impact of AI tools in the heterogeneous and challenging health care context and lays the groundwork for further validation through expert consensus and testing of the framework in real-world health care settings. It is important to emphasize that multidisciplinary expertise is essential for assessment, yet many assessors lack the necessary training. In addition, traditional evaluation methods struggle to keep pace with AI's rapid development. To ensure successful AI integration, flexible, fast-tracked assessment processes and proper assessor training are needed to maintain rigorous standards while adapting to AI's dynamic evolution.

TRIAL REGISTRATION

reviewregistry1859; https://tinyurl.com/ysn2d7sh.

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/508b/11840377/821240f81440/jmir_v27i1e67485_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/508b/11840377/c5068fe7b110/jmir_v27i1e67485_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/508b/11840377/0acc36b35eeb/jmir_v27i1e67485_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/508b/11840377/821240f81440/jmir_v27i1e67485_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/508b/11840377/c5068fe7b110/jmir_v27i1e67485_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/508b/11840377/0acc36b35eeb/jmir_v27i1e67485_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/508b/11840377/821240f81440/jmir_v27i1e67485_fig3.jpg
摘要

背景

人工智能(AI)有潜力通过改善临床结果和运营效率来彻底改变医疗保健行业。然而,其在临床中的应用速度比预期要慢,这主要是由于缺乏全面的评估框架。现有的框架仍然不够完善,往往侧重于技术指标,如准确性和验证,而忽视了关键的现实世界因素,如临床影响、整合和经济可持续性。这种狭隘的关注点阻碍了人工智能工具的有效实施,限制了它们在临床实践中的更广泛影响和长期可行性。

目的

本研究旨在创建一个评估医疗保健领域人工智能的框架,超越技术指标,纳入社会和组织层面。该框架是通过系统地审查、分析和综合成功实施所需的评估标准而制定的,重点关注人工智能在临床实践中的长期现实世界影响。

方法

2024年7月在PubMed、Cochrane、Scopus和IEEE Xplore数据库中进行了检索,以识别2019年1月至2024年7月中旬期间发表的英文相关研究,共获得3528条结果,其中44项研究符合纳入标准。系统评价遵循PRISMA(系统评价和Meta分析的首选报告项目)指南和Cochrane系统评价手册。使用NVivo通过主题分析和叙述性综合对数据进行分析,以确定研究中的关键新兴主题。

结果

通过综合纳入的研究,我们开发了一个框架,该框架超越了传统上对技术指标或研究层面方法的关注。它整合了临床背景和现实世界的实施因素,为评估人工智能工具提供了更全面的方法。由于我们专注于评估人工智能技术在医疗保健中的长期现实世界影响,我们将该框架命名为IMPACTS人工智能。这些标准被组织成七个关键集群,每个集群对应首字母缩写中的一个字母:(1)I-整合、互操作性和工作流程;(2)M-监测、治理和问责制;(3)P-性能和质量指标;(4)A-可接受性、信任和培训;(5)C-成本和经济评估;(6)T-技术安全性和透明度;(7)S-可扩展性和影响。这些进一步细分为28个具体的子标准。

结论

IMPACTS人工智能框架提供了一种整体方法,用于评估人工智能工具在异质且具有挑战性的医疗保健环境中的长期现实世界影响,并为通过专家共识和在现实世界医疗保健环境中对该框架进行测试来进一步验证奠定了基础。需要强调的是,多学科专业知识对于评估至关重要,但许多评估人员缺乏必要的培训。此外,传统评估方法难以跟上人工智能的快速发展。为确保人工智能的成功整合,需要灵活、快速的评估流程和适当的评估人员培训,以在适应人工智能动态发展的同时保持严格的标准。

试验注册

reviewregistry1859;https://tinyurl.com/ysn2d7sh。

相似文献

1
AI for IMPACTS Framework for Evaluating the Long-Term Real-World Impacts of AI-Powered Clinician Tools: Systematic Review and Narrative Synthesis.用于评估人工智能驱动的临床医生工具长期现实世界影响的AI for IMPACTS框架:系统评价与叙述性综合分析
J Med Internet Res. 2025 Feb 5;27:e67485. doi: 10.2196/67485.
2
The Role of AI in Nursing Education and Practice: Umbrella Review.人工智能在护理教育与实践中的作用:综合述评
J Med Internet Res. 2025 Apr 4;27:e69881. doi: 10.2196/69881.
3
[Volume and health outcomes: evidence from systematic reviews and from evaluation of Italian hospital data].[容量与健康结果:来自系统评价和意大利医院数据评估的证据]
Epidemiol Prev. 2013 Mar-Jun;37(2-3 Suppl 2):1-100.
4
Home treatment for mental health problems: a systematic review.心理健康问题的居家治疗:一项系统综述
Health Technol Assess. 2001;5(15):1-139. doi: 10.3310/hta5150.
5
Health professionals' experience of teamwork education in acute hospital settings: a systematic review of qualitative literature.医疗专业人员在急症医院环境中团队合作教育的经验:对定性文献的系统综述
JBI Database System Rev Implement Rep. 2016 Apr;14(4):96-137. doi: 10.11124/JBISRIR-2016-1843.
6
A Scoping Review of the Observed and Perceived Functional Impacts Associated With Language and Learning Disorders in School-Aged Children.一项关于学龄儿童语言和学习障碍相关的观察到的和感知到的功能影响的范围综述。
Int J Lang Commun Disord. 2025 Jul-Aug;60(4):e70086. doi: 10.1111/1460-6984.70086.
7
Revolutionizing e-health: the transformative role of AI-powered hybrid chatbots in healthcare solutions.变革电子健康:人工智能驱动的混合聊天机器人在医疗保健解决方案中的变革性作用。
Front Public Health. 2025 Feb 13;13:1530799. doi: 10.3389/fpubh.2025.1530799. eCollection 2025.
8
Health Care Professionals' Experience of Using AI: Systematic Review With Narrative Synthesis.医疗保健专业人员使用人工智能的体验:系统评价与叙事综合。
J Med Internet Res. 2024 Oct 30;26:e55766. doi: 10.2196/55766.
9
Comparison of self-administered survey questionnaire responses collected using mobile apps versus other methods.使用移动应用程序与其他方法收集的自我管理调查问卷回复的比较。
Cochrane Database Syst Rev. 2015 Jul 27;2015(7):MR000042. doi: 10.1002/14651858.MR000042.pub2.
10
Gaps in Artificial Intelligence Research for Rural Health in the United States: A Scoping Review.美国农村卫生人工智能研究的差距:一项范围综述
medRxiv. 2025 Jun 27:2025.06.26.25330361. doi: 10.1101/2025.06.26.25330361.

引用本文的文献

1
Artificial Intelligence in Risk Stratification and Outcome Prediction for Transcatheter Aortic Valve Replacement: A Systematic Review and Meta-Analysis.人工智能在经导管主动脉瓣置换术风险分层和结果预测中的应用:一项系统评价和荟萃分析。
J Pers Med. 2025 Jul 11;15(7):302. doi: 10.3390/jpm15070302.
2
Hospital Investment Decisions and Prioritization of Clinical Programs.医院投资决策与临床项目的优先级排序
Cureus. 2025 Mar 3;17(3):e79998. doi: 10.7759/cureus.79998. eCollection 2025 Mar.
3
Artificial intelligence in pediatric medicine: a call for rigorous reporting standards.

本文引用的文献

1
A scoping review of reporting gaps in FDA-approved AI medical devices.对美国食品药品监督管理局(FDA)批准的人工智能医疗设备报告漏洞的范围审查。
NPJ Digit Med. 2024 Oct 3;7(1):273. doi: 10.1038/s41746-024-01270-x.
2
Enhanced gastric cancer classification and quantification interpretable framework using digital histopathology images.基于数字病理图像的胃癌分类和定量可解释性增强框架。
Sci Rep. 2024 Sep 28;14(1):22533. doi: 10.1038/s41598-024-73823-9.
3
Skin lesion segmentation using deep learning algorithm with ant colony optimization.基于蚁群算法的深度学习算法在皮肤损伤分割中的应用。
儿科学中的人工智能:呼吁采用严格的报告标准。
J Perinatol. 2025 Apr 2. doi: 10.1038/s41372-025-02284-3.
BMC Med Inform Decis Mak. 2024 Sep 27;24(1):265. doi: 10.1186/s12911-024-02686-x.
4
Generalizability assessment of AI models across hospitals in a low-middle and high income country.在中低收入和高收入国家的医院之间评估人工智能模型的泛化能力。
Nat Commun. 2024 Sep 27;15(1):8270. doi: 10.1038/s41467-024-52618-6.
5
Generative artificial intelligence and ethical considerations in health care: a scoping review and ethics checklist.生成式人工智能与医疗保健中的伦理考量:范围综述与伦理检查表。
Lancet Digit Health. 2024 Nov;6(11):e848-e856. doi: 10.1016/S2589-7500(24)00143-2. Epub 2024 Sep 17.
6
Barriers to and Facilitators of Artificial Intelligence Adoption in Health Care: Scoping Review.医疗保健中人工智能采用的障碍和促进因素:范围综述。
JMIR Hum Factors. 2024 Aug 29;11:e48633. doi: 10.2196/48633.
7
Not all AI health tools with regulatory authorization are clinically validated.并非所有获得监管授权的人工智能健康工具都经过临床验证。
Nat Med. 2024 Oct;30(10):2718-2720. doi: 10.1038/s41591-024-03203-3.
8
Clinical and Surgical Applications of Large Language Models: A Systematic Review.大语言模型的临床与外科应用:一项系统综述
J Clin Med. 2024 May 22;13(11):3041. doi: 10.3390/jcm13113041.
9
Frameworks for procurement, integration, monitoring, and evaluation of artificial intelligence tools in clinical settings: A systematic review.临床环境中人工智能工具的采购、整合、监测和评估框架:一项系统综述。
PLOS Digit Health. 2024 May 29;3(5):e0000514. doi: 10.1371/journal.pdig.0000514. eCollection 2024 May.
10
Consolidated Health Economic Evaluation Reporting Standards for Interventions That Use Artificial Intelligence (CHEERS-AI).人工智能干预措施的综合健康经济评估报告标准 (CHEERS-AI)。
Value Health. 2024 Sep;27(9):1196-1205. doi: 10.1016/j.jval.2024.05.006. Epub 2024 May 23.