人工智能在健康干预中的会话代理评估框架：系统范围综述。

Evaluation framework for conversational agents with artificial intelligence in health interventions: a systematic scoping review.

机构信息

RECOVER Injury Research Centre, Faculty of Health and Behavioural Sciences, The University of Queensland, Brisbane, QLD, Australia.

STARS Education and Research Alliance, Surgical Treatment and Rehabilitation Service (STARS), The University of Queensland and Metro North Health, Brisbane, QLD, Australia.

出版信息

J Am Med Inform Assoc. 2024 Feb 16;31(3):746-761. doi: 10.1093/jamia/ocad222.

DOI:10.1093/jamia/ocad222

PMID:38070173

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10873847/

Abstract

OBJECTIVES

Conversational agents (CAs) with emerging artificial intelligence present new opportunities to assist in health interventions but are difficult to evaluate, deterring their applications in the real world. We aimed to synthesize existing evidence and knowledge and outline an evaluation framework for CA interventions.

MATERIALS AND METHODS

We conducted a systematic scoping review to investigate designs and outcome measures used in the studies that evaluated CAs for health interventions. We then nested the results into an overarching digital health framework proposed by the World Health Organization (WHO).

RESULTS

The review included 81 studies evaluating CAs in experimental (n = 59), observational (n = 15) trials, and other research designs (n = 7). Most studies (n = 72, 89%) were published in the past 5 years. The proposed CA-evaluation framework includes 4 evaluation stages: (1) feasibility/usability, (2) efficacy, (3) effectiveness, and (4) implementation, aligning with WHO's stepwise evaluation strategy. Across these stages, this article presents the essential evidence of different study designs (n = 8), sample sizes, and main evaluation categories (n = 7) with subcategories (n = 40). The main evaluation categories included (1) functionality, (2) safety and information quality, (3) user experience, (4) clinical and health outcomes, (5) costs and cost benefits, (6) usage, adherence, and uptake, and (7) user characteristics for implementation research. Furthermore, the framework highlighted the essential evaluation areas (potential primary outcomes) and gaps across the evaluation stages.

DISCUSSION AND CONCLUSION

This review presents a new framework with practical design details to support the evaluation of CA interventions in healthcare research.

PROTOCOL REGISTRATION

The Open Science Framework (https://osf.io/9hq2v) on March 22, 2021.

摘要

目的

具有新兴人工智能的会话代理 (CA) 为辅助健康干预提供了新的机会，但由于难以评估，阻碍了它们在现实世界中的应用。我们旨在综合现有证据和知识，并概述 CA 干预措施的评估框架。

材料和方法

我们进行了系统的范围综述，以调查评估健康干预措施的 CA 的研究中使用的设计和结果测量。然后，我们将结果嵌套到世界卫生组织 (WHO) 提出的总体数字健康框架中。

结果

该综述包括 81 项评估 CA 在实验（n=59）、观察（n=15）试验和其他研究设计（n=7）中应用的研究。大多数研究（n=72，89%）是在过去 5 年内发表的。所提出的 CA 评估框架包括 4 个评估阶段：（1）可行性/可用性，（2）功效，（3）效果，（4）实施，与 WHO 的逐步评估策略一致。在这些阶段中，本文介绍了不同研究设计（n=8）、样本量和主要评估类别（n=7）的基本证据，以及子类别（n=40）。主要评估类别包括（1）功能，（2）安全性和信息质量，（3）用户体验，（4）临床和健康结果，（5）成本和成本效益，（6）使用、依从性和采用，以及（7）实施研究的用户特征。此外，该框架突出了评估阶段的基本评估领域（潜在的主要结果）和差距。

讨论与结论

本综述提出了一个新的框架，具有实用的设计细节，以支持医疗保健研究中 CA 干预措施的评估。

协议注册

2021 年 3 月 22 日在开放科学框架（https://osf.io/9hq2v）上。

相似文献

Evaluation framework for conversational agents with artificial intelligence in health interventions: a systematic scoping review.

J Am Med Inform Assoc. 2024 Feb 16;31(3):746-761. doi: 10.1093/jamia/ocad222.

Conversational Agents in Health Care: Expert Interviews to Inform the Definition, Classification, and Conceptual Framework.

J Med Internet Res. 2023 Nov 1;25:e50767. doi: 10.2196/50767.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

Evaluating the Potential and Pitfalls of AI-Powered Conversational Agents as Humanlike Virtual Health Carers in the Remote Management of Noncommunicable Diseases: Scoping Review.

J Med Internet Res. 2024 Jul 16;26:e56114. doi: 10.2196/56114.

The Effectiveness of Artificial Intelligence Conversational Agents in Health Care: Systematic Review.

J Med Internet Res. 2020 Oct 22;22(10):e20346. doi: 10.2196/20346.

Evaluation of the Current State of Chatbots for Digital Health: Scoping Review.

J Med Internet Res. 2023 Dec 19;25:e47217. doi: 10.2196/47217.

Feasibility and effectiveness of artificial intelligence-driven conversational agents in healthcare interventions: A systematic review of randomized controlled trials.

Int J Nurs Stud. 2023 Jul;143:104494. doi: 10.1016/j.ijnurstu.2023.104494. Epub 2023 Apr 5.

Conversational Agents to Support Pain Management: A Scoping Review.

Eur J Pain. 2025 May;29(5):e70016. doi: 10.1002/ejp.70016.

Transforming Health Care Through Chatbots for Medical History-Taking and Future Directions: Comprehensive Systematic Review.

JMIR Med Inform. 2024 Aug 29;12:e56628. doi: 10.2196/56628.

Designing, Developing, Evaluating, and Implementing a Smartphone-Delivered, Rule-Based Conversational Agent (DISCOVER): Development of a Conceptual Framework.

JMIR Mhealth Uhealth. 2022 Oct 4;10(10):e38740. doi: 10.2196/38740.

引用本文的文献

Beyond the Bot: A Dual-Phase Framework for Evaluating AI Chatbot Simulations in Nursing Education.

Nurs Rep. 2025 Jul 31;15(8):280. doi: 10.3390/nursrep15080280.

User Engagement with A Multimodal Conversational Agent for Self-Care and Chronic Disease Management: A Retrospective Analysis.

J Med Syst. 2025 Jun 9;49(1):76. doi: 10.1007/s10916-025-02202-2.

AI for IMPACTS Framework for Evaluating the Long-Term Real-World Impacts of AI-Powered Clinician Tools: Systematic Review and Narrative Synthesis.

J Med Internet Res. 2025 Feb 5;27:e67485. doi: 10.2196/67485.

Large language models for the mental health community: framework for translating code to care.

Lancet Digit Health. 2025 Apr;7(4):e282-e285. doi: 10.1016/S2589-7500(24)00255-3. Epub 2025 Jan 7.

Encouraging arm use in stroke survivors: the impact of smart reminders during a home-based intervention.

J Neuroeng Rehabil. 2024 Dec 21;21(1):220. doi: 10.1186/s12984-024-01527-2.

Nurses' perspectives on privacy and ethical concerns regarding artificial intelligence adoption in healthcare.

Heliyon. 2024 Aug 22;10(17):e36702. doi: 10.1016/j.heliyon.2024.e36702. eCollection 2024 Sep 15.

Large language models and generative AI in telehealth: a responsible use lens.

J Am Med Inform Assoc. 2024 Sep 1;31(9):2125-2136. doi: 10.1093/jamia/ocae035.

Celebrating Eta Berner and her influence on biomedical and health informatics.

J Am Med Inform Assoc. 2024 Feb 16;31(3):549-551. doi: 10.1093/jamia/ocae011.

Redefining Virtual Assistants in Health Care: The Future With Large Language Models.

J Med Internet Res. 2024 Jan 19;26:e53225. doi: 10.2196/53225.

本文引用的文献

Will ChatGPT transform healthcare?

Nat Med. 2023 Mar;29(3):505-506. doi: 10.1038/s41591-023-02289-5.

Design and Evaluation Challenges of Conversational Agents in Health Care and Well-being: Selective Review Study.

J Med Internet Res. 2022 Nov 15;24(11):e38525. doi: 10.2196/38525.

Development and usability testing of a chatbot to promote mental health services use among individuals with eating disorders following screening.

Int J Eat Disord. 2022 Sep;55(9):1229-1244. doi: 10.1002/eat.23798. Epub 2022 Aug 18.

SlimMe, a Chatbot With Artificial Empathy for Personal Weight Management: System Design and Finding.

Front Nutr. 2022 Jun 23;9:870775. doi: 10.3389/fnut.2022.870775. eCollection 2022.

Effectiveness of a Conversational Chatbot (Dejal@bot) for the Adult Population to Quit Smoking: Pragmatic, Multicenter, Controlled, Randomized Clinical Trial in Primary Care.

JMIR Mhealth Uhealth. 2022 Jun 27;10(6):e34273. doi: 10.2196/34273.

Assisting Personalized Healthcare of Elderly People: Developing a Rule-Based Virtual Caregiver System Using Mobile Chatbot.

Sensors (Basel). 2022 May 18;22(10):3829. doi: 10.3390/s22103829.

A Mental Health Chatbot with Cognitive Skills for Personalised Behavioural Activation and Remote Health Monitoring.

Sensors (Basel). 2022 May 11;22(10):3653. doi: 10.3390/s22103653.

Digital health in medicine: Important considerations in evaluating health economic analysis.

Lancet Reg Health West Pac. 2022 May 8;23:100476. doi: 10.1016/j.lanwpc.2022.100476. eCollection 2022 Jun.

Can AI make people happy? The effect of AI-based chatbot on smile and speech in Parkinson's disease.

Parkinsonism Relat Disord. 2022 Jun;99:43-46. doi: 10.1016/j.parkreldis.2022.04.018. Epub 2022 May 5.

Conversational Artificial Intelligence for Spinal Pain Questionnaire: Validation and User Satisfaction.

Neurospine. 2022 Jun;19(2):348-356. doi: 10.14245/ns.2143080.540. Epub 2022 May 12.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

人工智能在健康干预中的会话代理评估框架：系统范围综述。

Evaluation framework for conversational agents with artificial intelligence in health interventions: a systematic scoping review.

机构信息

RECOVER Injury Research Centre, Faculty of Health and Behavioural Sciences, The University of Queensland, Brisbane, QLD, Australia.

STARS Education and Research Alliance, Surgical Treatment and Rehabilitation Service (STARS), The University of Queensland and Metro North Health, Brisbane, QLD, Australia.

出版信息

J Am Med Inform Assoc. 2024 Feb 16;31(3):746-761. doi: 10.1093/jamia/ocad222.

DOI:10.1093/jamia/ocad222

PMID:38070173

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10873847/

Abstract

OBJECTIVES

MATERIALS AND METHODS

RESULTS

DISCUSSION AND CONCLUSION

This review presents a new framework with practical design details to support the evaluation of CA interventions in healthcare research.

PROTOCOL REGISTRATION

The Open Science Framework (https://osf.io/9hq2v) on March 22, 2021.

摘要

目的

材料和方法

结果

讨论与结论

本综述提出了一个新的框架，具有实用的设计细节，以支持医疗保健研究中 CA 干预措施的评估。

协议注册

2021 年 3 月 22 日在开放科学框架（https://osf.io/9hq2v）上。

人工智能在健康干预中的会话代理评估框架：系统范围综述。

Evaluation framework for conversational agents with artificial intelligence in health interventions: a systematic scoping review.

机构信息

出版信息

OBJECTIVES

MATERIALS AND METHODS

RESULTS

DISCUSSION AND CONCLUSION

PROTOCOL REGISTRATION

目的

材料和方法

结果

讨论与结论

协议注册

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

人工智能在健康干预中的会话代理评估框架：系统范围综述。

Evaluation framework for conversational agents with artificial intelligence in health interventions: a systematic scoping review.

机构信息

出版信息

OBJECTIVES

MATERIALS AND METHODS

RESULTS

DISCUSSION AND CONCLUSION

PROTOCOL REGISTRATION

目的

材料和方法

结果

讨论与结论

协议注册