对话智能体对健康与生活方式提示的回应：适宜性及呈现结构研究

Responses of Conversational Agents to Health and Lifestyle Prompts: Investigation of Appropriateness and Presentation Structures.

作者信息

Kocaballi Ahmet Baki, Quiroz Juan C, Rezazadegan Dana, Berkovsky Shlomo, Magrabi Farah, Coiera Enrico, Laranjo Liliana

机构信息

Australian Institute of Health Innovation , Macquarie University, Sydney, Australia.

NOVA National School of Public Health, Public Health Research Centre, Universidade NOVA de Lisboa, Lisbon, Portugal.

出版信息

J Med Internet Res. 2020 Feb 9;22(2):e15823. doi: 10.2196/15823.

DOI:10.2196/15823

PMID:32039810

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7055771/

Abstract

BACKGROUND

Conversational agents (CAs) are systems that mimic human conversations using text or spoken language. Their widely used examples include voice-activated systems such as Apple Siri, Google Assistant, Amazon Alexa, and Microsoft Cortana. The use of CAs in health care has been on the rise, but concerns about their potential safety risks often remain understudied.

OBJECTIVE

This study aimed to analyze how commonly available, general-purpose CAs on smartphones and smart speakers respond to health and lifestyle prompts (questions and open-ended statements) by examining their responses in terms of content and structure alike.

METHODS

We followed a piloted script to present health- and lifestyle-related prompts to 8 CAs. The CAs' responses were assessed for their appropriateness on the basis of the prompt type: responses to safety-critical prompts were deemed appropriate if they included a referral to a health professional or service, whereas responses to lifestyle prompts were deemed appropriate if they provided relevant information to address the problem prompted. The response structure was also examined according to information sources (Web search-based or precoded), response content style (informative and/or directive), confirmation of prompt recognition, and empathy.

RESULTS

The 8 studied CAs provided in total 240 responses to 30 prompts. They collectively responded appropriately to 41% (46/112) of the safety-critical and 39% (37/96) of the lifestyle prompts. The ratio of appropriate responses deteriorated when safety-critical prompts were rephrased or when the agent used a voice-only interface. The appropriate responses included mostly directive content and empathy statements for the safety-critical prompts and a mix of informative and directive content for the lifestyle prompts.

CONCLUSIONS

Our results suggest that the commonly available, general-purpose CAs on smartphones and smart speakers with unconstrained natural language interfaces are limited in their ability to advise on both the safety-critical health prompts and lifestyle prompts. Our study also identified some response structures the CAs employed to present their appropriate responses. Further investigation is needed to establish guidelines for designing suitable response structures for different prompt types.

摘要

背景

对话代理（CAs）是使用文本或口语模仿人类对话的系统。其广泛应用的例子包括语音激活系统，如苹果Siri、谷歌助手、亚马逊Alexa和微软小娜。对话代理在医疗保健领域的应用一直在增加，但对其潜在安全风险的担忧往往研究不足。

目的

本研究旨在通过从内容和结构两方面检查智能手机和智能音箱上常见的通用对话代理对健康和生活方式提示（问题和开放式陈述）的回应，分析它们的回应情况。

方法

我们按照一个试点脚本，向8个对话代理提出与健康和生活方式相关的提示。根据提示类型评估对话代理的回应是否恰当：对关键安全提示的回应若包含转介给医疗专业人员或服务，则被视为恰当；对生活方式提示的回应若提供了相关信息以解决所提示的问题，则被视为恰当。还根据信息来源（基于网络搜索或预编码）、回应内容风格（信息性和/或指导性）、对提示识别的确认以及同理心来检查回应结构。

结果

8个被研究的对话代理对30个提示总共提供了240个回应。它们对41%（46/112）的关键安全提示和39%（37/96）的生活方式提示做出了恰当回应。当关键安全提示被重新表述或代理使用仅语音界面时，恰当回应的比例会下降。恰当回应大多包括针对关键安全提示的指导性内容和同理心陈述，以及针对生活方式提示的信息性和指导性内容的混合。

结论

我们的结果表明，智能手机和智能音箱上具有无约束自然语言界面的常见通用对话代理在为关键安全健康提示和生活方式提示提供建议方面能力有限。我们的研究还识别了对话代理用于给出恰当回应的一些回应结构。需要进一步研究以建立针对不同提示类型设计合适回应结构的指导原则。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f99b/7055771/d2962cb12617/jmir_v22i2e15823_fig1.jpg

相似文献

Responses of Conversational Agents to Health and Lifestyle Prompts: Investigation of Appropriateness and Presentation Structures.对话智能体对健康与生活方式提示的回应：适宜性及呈现结构研究

J Med Internet Res. 2020 Feb 9;22(2):e15823. doi: 10.2196/15823.

Smartphone-Based Conversational Agents and Responses to Questions About Mental Health, Interpersonal Violence, and Physical Health.基于智能手机的对话代理以及对心理健康、人际暴力和身体健康相关问题的回应。

JAMA Intern Med. 2016 May 1;176(5):619-25. doi: 10.1001/jamainternmed.2016.0400.

Clinical Advice by Voice Assistants on Postpartum Depression: Cross-Sectional Investigation Using Apple Siri, Amazon Alexa, Google Assistant, and Microsoft Cortana.基于产后抑郁的语音助手临床建议：使用苹果 Siri、亚马逊 Alexa、谷歌助手和微软 Cortana 的横断面调查

JMIR Mhealth Uhealth. 2021 Jan 11;9(1):e24045. doi: 10.2196/24045.

Language Use in Conversational Agent-Based Health Communication: Systematic Review.基于对话代理的健康传播中的语言使用：系统评价。

J Med Internet Res. 2022 Jul 8;24(7):e37403. doi: 10.2196/37403.

Evaluating Smart Assistant Responses for Accuracy and Misinformation Regarding Human Papillomavirus Vaccination: Content Analysis Study.评估智能助手在 HPV 疫苗接种方面的准确性和错误信息的反应：内容分析研究。

J Med Internet Res. 2020 Aug 3;22(8):e19018. doi: 10.2196/19018.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区，服用抗叶酸抗疟药物的人群中，叶酸补充剂与疟疾易感性和严重程度的关系。

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

Conversational Agents in Health Care: Expert Interviews to Inform the Definition, Classification, and Conceptual Framework.医疗保健中的会话代理：专家访谈以提供定义、分类和概念框架。

J Med Internet Res. 2023 Nov 1;25:e50767. doi: 10.2196/50767.

Reliability of Commercial Voice Assistants' Responses to Health-Related Questions in Noncommunicable Disease Management: Factorial Experiment Assessing Response Rate and Source of Information.商业语音助手在非传染性疾病管理中对健康相关问题回答的可靠性：评估回答率和信息来源的析因实验

J Med Internet Res. 2021 Dec 20;23(12):e32161. doi: 10.2196/32161.

Conversational Agents for Body Weight Management: Systematic Review.用于体重管理的会话代理：系统评价。

J Med Internet Res. 2023 May 26;25:e42238. doi: 10.2196/42238.

Designing, Developing, Evaluating, and Implementing a Smartphone-Delivered, Rule-Based Conversational Agent (DISCOVER): Development of a Conceptual Framework.设计、开发、评估和实施基于规则的智能手机对话代理（DISCOVER）：概念框架的开发。

JMIR Mhealth Uhealth. 2022 Oct 4;10(10):e38740. doi: 10.2196/38740.

引用本文的文献

Development and Evaluation of a Mental Health Chatbot Using ChatGPT 4.0: Mixed Methods User Experience Study With Korean Users.使用ChatGPT 4.0开发和评估心理健康聊天机器人：针对韩国用户的混合方法用户体验研究

JMIR Med Inform. 2025 Jan 3;13:e63538. doi: 10.2196/63538.

Chatbot for Social Need Screening and Resource Sharing With Vulnerable Families: Iterative Design and Evaluation Study.用于弱势群体家庭社会需求筛查和资源共享的聊天机器人：迭代设计和评估研究。

JMIR Hum Factors. 2024 Jul 19;11:e57114. doi: 10.2196/57114.

Evaluating the Potential and Pitfalls of AI-Powered Conversational Agents as Humanlike Virtual Health Carers in the Remote Management of Noncommunicable Diseases: Scoping Review.评估人工智能驱动的会话代理作为非传染性疾病远程管理中类人虚拟健康护理员的潜力和陷阱：范围综述。

J Med Internet Res. 2024 Jul 16;26:e56114. doi: 10.2196/56114.

Physical Activity Evaluation Using a Voice Recognition App: Development and Validation Study.使用语音识别应用程序进行身体活动评估：开发与验证研究。

JMIR Biomed Eng. 2021 Jan 21;6(1):e19088. doi: 10.2196/19088.

A systematic review of artificial intelligence-powered (AI-powered) chatbot intervention for managing chronic illness.人工智能驱动的（AI 驱动）聊天机器人干预管理慢性疾病的系统评价。

Ann Med. 2024 Dec;56(1):2302980. doi: 10.1080/07853890.2024.2302980. Epub 2024 Mar 11.

Evaluation framework for conversational agents with artificial intelligence in health interventions: a systematic scoping review.人工智能在健康干预中的会话代理评估框架：系统范围综述。

J Am Med Inform Assoc. 2024 Feb 16;31(3):746-761. doi: 10.1093/jamia/ocad222.

We need to chat about artificial intelligence.我们需要聊聊人工智能。

Med J Aust. 2023 Aug 7;219(3):98-100. doi: 10.5694/mja2.51992. Epub 2023 Jun 11.

Effects of a virtual voice-based coach delivering problem-solving treatment on emotional distress and brain function: a pilot RCT in depression and anxiety.基于虚拟语音的教练提供问题解决治疗对情绪困扰和大脑功能的影响：抑郁症和焦虑症的一项先导 RCT 研究

Transl Psychiatry. 2023 May 12;13(1):166. doi: 10.1038/s41398-023-02462-x.

Artificial intelligence technologies and compassion in healthcare: A systematic scoping review.医疗保健中的人工智能技术与人文关怀：一项系统综述。

Front Psychol. 2023 Jan 17;13:971044. doi: 10.3389/fpsyg.2022.971044. eCollection 2022.

Voice Assistants' Responses to Questions About the COVID-19 Vaccine: National Cross-sectional Study.语音助手对关于新冠疫苗问题的回应：全国横断面研究。

JMIR Form Res. 2023 Feb 8;7:e43007. doi: 10.2196/43007.

本文引用的文献

A scoping review of patient-facing, behavioral health interventions with voice assistant technology targeting self-management and healthy lifestyle behaviors.面向患者的、基于语音助手技术的行为健康干预措施的范围综述，旨在实现自我管理和健康生活方式行为。

Transl Behav Med. 2020 Aug 7;10(3):606-628. doi: 10.1093/tbm/ibz141.

Using Health Chatbots for Behavior Change: A Mapping Study.利用健康聊天机器人进行行为改变：一项映射研究。

J Med Syst. 2019 Apr 4;43(5):135. doi: 10.1007/s10916-019-1237-1.

Chatbots and Conversational Agents in Mental Health: A Review of the Psychiatric Landscape.聊天机器人和会话代理在心理健康中的应用：精神病学领域的综述。

Can J Psychiatry. 2019 Jul;64(7):456-464. doi: 10.1177/0706743719828977. Epub 2019 Mar 21.

Using Psychological Artificial Intelligence (Tess) to Relieve Symptoms of Depression and Anxiety: Randomized Controlled Trial.使用心理人工智能（苔丝）缓解抑郁和焦虑症状：随机对照试验。

JMIR Ment Health. 2018 Dec 13;5(4):e64. doi: 10.2196/mental.9782.

Should Machines Express Sympathy and Empathy? Experiments with a Health Advice Chatbot.机器应该表达同情和同理心吗？健康咨询聊天机器人的实验。

Cyberpsychol Behav Soc Netw. 2018 Oct;21(10):625-636. doi: 10.1089/cyber.2018.0110.

Patient and Consumer Safety Risks When Using Conversational Assistants for Medical Information: An Observational Study of Siri, Alexa, and Google Assistant.使用对话式助手获取医疗信息时的患者和消费者安全风险：对Siri、Alexa和谷歌助手的观察性研究

J Med Internet Res. 2018 Sep 4;20(9):e11510. doi: 10.2196/11510.

Dr Google in the ED: searching for online health information by adult emergency department patients.急诊室里的“谷歌医生”：成年急诊患者在线搜索健康信息。

Med J Aust. 2018 Oct 15;209(8):342-347. doi: 10.5694/mja17.00889. Epub 2018 Aug 20.

Conversational agents in healthcare: a systematic review.医疗保健中的会话代理：系统评价。

J Am Med Inform Assoc. 2018 Sep 1;25(9):1248-1258. doi: 10.1093/jamia/ocy072.

Does health informatics have a replication crisis?健康信息学是否存在复制危机？

J Am Med Inform Assoc. 2018 Aug 1;25(8):963-968. doi: 10.1093/jamia/ocy028.

Just ask Siri? A pilot study comparing smartphone digital assistants and laptop Google searches for smoking cessation advice.仅需询问 Siri 即可？一项比较智能手机数字助手和笔记本电脑谷歌搜索戒烟建议的初步研究。

PLoS One. 2018 Mar 28;13(3):e0194811. doi: 10.1371/journal.pone.0194811. eCollection 2018.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

对话智能体对健康与生活方式提示的回应：适宜性及呈现结构研究

Responses of Conversational Agents to Health and Lifestyle Prompts: Investigation of Appropriateness and Presentation Structures.

作者信息

机构信息

出版信息

BACKGROUND

OBJECTIVE

METHODS

RESULTS

CONCLUSIONS

背景

目的

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献