基于产后抑郁的语音助手临床建议：使用苹果 Siri、亚马逊 Alexa、谷歌助手和微软 Cortana 的横断面调查

Clinical Advice by Voice Assistants on Postpartum Depression: Cross-Sectional Investigation Using Apple Siri, Amazon Alexa, Google Assistant, and Microsoft Cortana.

机构信息

The Ohio State University University Wexner Medical Center, Columbus, OH, United States.

Nationwide Children's Hospital, Columbus, OH, United States.

出版信息

JMIR Mhealth Uhealth. 2021 Jan 11;9(1):e24045. doi: 10.2196/24045.

DOI:10.2196/24045

PMID:33427680

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7834933/

Abstract

BACKGROUND

A voice assistant (VA) is inanimate audio-interfaced software augmented with artificial intelligence, capable of 2-way dialogue, and increasingly used to access health care advice. Postpartum depression (PPD) is a common perinatal mood disorder with an annual estimated cost of $14.2 billion. Only a small percentage of PPD patients seek care due to lack of screening and insufficient knowledge of the disease, and this is, therefore, a prime candidate for a VA-based digital health intervention.

OBJECTIVE

In order to understand the capability of VAs, our aim was to assess VA responses to PPD questions in terms of accuracy, verbal response, and clinically appropriate advice given.

METHODS

This cross-sectional study examined four VAs (Apple Siri, Amazon Alexa, Google Assistant, and Microsoft Cortana) installed on two mobile devices in early 2020. We posed 14 questions to each VA that were retrieved from the American College of Obstetricians and Gynecologists (ACOG) patient-focused Frequently Asked Questions (FAQ) on PPD. We scored the VA responses according to accuracy of speech recognition, presence of a verbal response, and clinically appropriate advice in accordance with ACOG FAQ, which were assessed by two board-certified physicians.

RESULTS

Accurate recognition of the query ranged from 79% to 100%. Verbal response ranged from 36% to 79%. If no verbal response was given, queries were treated like a web search between 33% and 89% of the time. Clinically appropriate advice given by VA ranged from 14% to 29%. We compared the category proportions using the Fisher exact test. No single VA statistically outperformed other VAs in the three performance categories. Additional observations showed that two VAs (Google Assistant and Microsoft Cortana) included advertisements in their responses.

CONCLUSIONS

While the best performing VA gave clinically appropriate advice to 29% of the PPD questions, all four VAs taken together achieved 64% clinically appropriate advice. All four VAs performed well in accurately recognizing a PPD query, but no VA achieved even a 30% threshold for providing clinically appropriate PPD information. Technology companies and clinical organizations should partner to improve guidance, screen patients for mental health disorders, and educate patients on potential treatment.

摘要

背景

语音助手（VA）是一种具有人工智能功能的、可进行双向对话的、越来越多被用于获取医疗保健建议的非生命音频接口软件。产后抑郁症（PPD）是一种常见的围产期情绪障碍，每年估计造成 142 亿美元的损失。由于缺乏筛查和对疾病的了解不足，只有一小部分 PPD 患者寻求治疗，因此，VA 是基于数字健康干预的一个主要候选者。

目的

为了了解 VAs 的能力，我们的目的是评估 VA 对 PPD 问题的回答在准确性、口头回答和提供临床适当建议方面的表现。

方法

这项横断面研究于 2020 年初检查了安装在两台移动设备上的四个 VA（苹果 Siri、亚马逊 Alexa、谷歌助手和微软 Cortana）。我们向每个 VA 提出了 14 个问题，这些问题取自美国妇产科医师学会（ACOG）以患者为中心的产后抑郁症常见问题解答（FAQ）。我们根据语音识别的准确性、口头回答的存在以及与 ACOG FAQ 相符的临床适当建议对 VA 回答进行评分，由两名董事会认证的医生进行评估。

结果

查询的准确识别率从 79%到 100%不等。口头回答率从 36%到 79%不等。如果没有口头回答，查询将在 33%到 89%的时间内被视为网络搜索。VA 提供的临床适当建议率从 14%到 29%不等。我们使用 Fisher 精确检验比较了类别比例。没有一个 VA 在三个性能类别中都明显优于其他 VA。进一步的观察表明，有两个 VA（谷歌助手和微软 Cortana）在其回复中包含了广告。

结论

尽管表现最好的 VA 对 29%的 PPD 问题提供了临床适当的建议，但四个 VA 综合起来提供了 64%的临床适当建议。四个 VA 在准确识别 PPD 查询方面表现良好，但没有一个 VA 达到提供临床适当 PPD 信息的 30%的阈值。科技公司和临床组织应合作改进指导，为精神健康障碍患者筛查，并教育患者潜在的治疗方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/13a9/7834933/fa3479437059/mhealth_v9i1e24045_fig1.jpg

相似文献

Clinical Advice by Voice Assistants on Postpartum Depression: Cross-Sectional Investigation Using Apple Siri, Amazon Alexa, Google Assistant, and Microsoft Cortana.基于产后抑郁的语音助手临床建议：使用苹果 Siri、亚马逊 Alexa、谷歌助手和微软 Cortana 的横断面调查

JMIR Mhealth Uhealth. 2021 Jan 11;9(1):e24045. doi: 10.2196/24045.

Smartphone-Based Conversational Agents and Responses to Questions About Mental Health, Interpersonal Violence, and Physical Health.基于智能手机的对话代理以及对心理健康、人际暴力和身体健康相关问题的回应。

JAMA Intern Med. 2016 May 1;176(5):619-25. doi: 10.1001/jamainternmed.2016.0400.

Evaluating the quality of voice assistants' responses to consumer health questions about vaccines: an exploratory comparison of Alexa, Google Assistant and Siri.评估语音助手对消费者有关疫苗的健康问题的回答质量：对Alexa、谷歌助手和Siri的探索性比较。

BMJ Health Care Inform. 2019 Nov;26(1). doi: 10.1136/bmjhci-2019-100075.

Voice Assistants and Cancer Screening: A Comparison of Alexa, Siri, Google Assistant, and Cortana.语音助手与癌症筛查：亚马逊Alexa、苹果Siri、谷歌Assistant和微软Cortana的比较

Ann Fam Med. 2021 Sep-Oct;19(5):447-449. doi: 10.1370/afm.2713.

Evaluating Smart Assistant Responses for Accuracy and Misinformation Regarding Human Papillomavirus Vaccination: Content Analysis Study.评估智能助手在 HPV 疫苗接种方面的准确性和错误信息的反应：内容分析研究。

J Med Internet Res. 2020 Aug 3;22(8):e19018. doi: 10.2196/19018.

The accuracy of artificial intelligence-based virtual assistants in responding to routinely asked questions about orthodontics.人工智能虚拟助手在回答正畸常见问题方面的准确性。

Angle Orthod. 2023 Jul 1;93(4):427-432. doi: 10.2319/100922-691.1.

Medication Name Comprehension of Intelligent Virtual Assistants: A Comparison of Amazon Alexa, Google Assistant, and Apple Siri Between 2019 and 2021.智能虚拟助手对药物名称的理解：2019年至2021年亚马逊Alexa、谷歌助手和苹果Siri的比较

Front Digit Health. 2021 May 19;3:669971. doi: 10.3389/fdgth.2021.669971. eCollection 2021.

Reliability of Commercial Voice Assistants' Responses to Health-Related Questions in Noncommunicable Disease Management: Factorial Experiment Assessing Response Rate and Source of Information.商业语音助手在非传染性疾病管理中对健康相关问题回答的可靠性：评估回答率和信息来源的析因实验

J Med Internet Res. 2021 Dec 20;23(12):e32161. doi: 10.2196/32161.

Virtual Assistants' Response to Queries About Nicotine Replacement Therapy: A Mixed-Method Analysis.虚拟助手对尼古丁替代疗法相关问题的回应：一项混合方法分析。

Eval Health Prof. 2025 Jun;48(2):174-181. doi: 10.1177/01632787241235689. Epub 2024 Feb 26.

Voice Assistants' Responses to Questions About the COVID-19 Vaccine: National Cross-sectional Study.语音助手对关于新冠疫苗问题的回应：全国横断面研究。

JMIR Form Res. 2023 Feb 8;7:e43007. doi: 10.2196/43007.

引用本文的文献

Histological Image Classification Between Follicular Lymphoma and Reactive Lymphoid Tissue Using Deep Learning and Explainable Artificial Intelligence (XAI).使用深度学习和可解释人工智能（XAI）对滤泡性淋巴瘤和反应性淋巴组织进行组织学图像分类

Cancers (Basel). 2025 Jul 22;17(15):2428. doi: 10.3390/cancers17152428.

Navigating promise and perils: applying artificial intelligence to the perinatal mental health care cascade.应对希望与风险：将人工智能应用于围产期心理健康照护流程

Npj Health Syst. 2025;2(1):26. doi: 10.1038/s44401-025-00030-7. Epub 2025 Jul 23.

Persuasive chatbot-based interventions for depression: a list of recommendations for improving reporting standards.基于聊天机器人的抑郁症劝导干预措施：提高报告标准的建议清单

Front Psychiatry. 2025 Jun 19;16:1429304. doi: 10.3389/fpsyt.2025.1429304. eCollection 2025.

User Engagement with A Multimodal Conversational Agent for Self-Care and Chronic Disease Management: A Retrospective Analysis.用户与用于自我护理和慢性病管理的多模态对话代理的互动：一项回顾性分析。

J Med Syst. 2025 Jun 9;49(1):76. doi: 10.1007/s10916-025-02202-2.

From Command to Care: A Scoping Review on Utilization of Smart Speakers by Patients and Providers.从指令到关怀：关于患者和医疗服务提供者对智能音箱使用情况的范围综述

Mayo Clin Proc Digit Health. 2024 Apr 11;2(2):207-220. doi: 10.1016/j.mcpdig.2024.03.002. eCollection 2024 Jun.

Perceptions about the use of virtual assistants for seeking health information among caregivers of young childhood cancer survivors.关于幼儿期癌症幸存者照料者使用虚拟助手获取健康信息的认知。

Digit Health. 2025 Mar 13;11:20552076251326160. doi: 10.1177/20552076251326160. eCollection 2025 Jan-Dec.

Potential association between mobile phone usage duration and postpartum depression risk: Evidence from a Mendelian randomization study.手机使用时长与产后抑郁风险的潜在关联：基于孟德尔随机化研究的证据。

Medicine (Baltimore). 2024 Oct 11;103(41):e39973. doi: 10.1097/MD.0000000000039973.

Chatbot for Social Need Screening and Resource Sharing With Vulnerable Families: Iterative Design and Evaluation Study.用于弱势群体家庭社会需求筛查和资源共享的聊天机器人：迭代设计和评估研究。

JMIR Hum Factors. 2024 Jul 19;11:e57114. doi: 10.2196/57114.

Virtual Assistants' Response to Queries About Nicotine Replacement Therapy: A Mixed-Method Analysis.虚拟助手对尼古丁替代疗法相关问题的回应：一项混合方法分析。

Eval Health Prof. 2025 Jun;48(2):174-181. doi: 10.1177/01632787241235689. Epub 2024 Feb 26.

Redefining Virtual Assistants in Health Care: The Future With Large Language Models.重新定义医疗保健中的虚拟助手：大语言模型的未来。

J Med Internet Res. 2024 Jan 19;26:e53225. doi: 10.2196/53225.

本文引用的文献

Readiness for voice assistants to support healthcare delivery during a health crisis and pandemic.语音助手在健康危机和大流行期间支持医疗服务的准备情况。

NPJ Digit Med. 2020 Sep 16;3:122. doi: 10.1038/s41746-020-00332-0. eCollection 2020.

A scoping review of patient-facing, behavioral health interventions with voice assistant technology targeting self-management and healthy lifestyle behaviors.面向患者的、基于语音助手技术的行为健康干预措施的范围综述，旨在实现自我管理和健康生活方式行为。

Transl Behav Med. 2020 Aug 7;10(3):606-628. doi: 10.1093/tbm/ibz141.

Financial Toll of Untreated Perinatal Mood and Anxiety Disorders Among 2017 Births in the United States.美国 2017 年分娩人群中未经治疗的围产期情绪和焦虑障碍的经济代价。

Am J Public Health. 2020 Jun;110(6):888-896. doi: 10.2105/AJPH.2020.305619. Epub 2020 Apr 16.

Covid-19 and Health Care's Digital Revolution.新冠疫情与医疗保健的数字革命

N Engl J Med. 2020 Jun 4;382(23):e82. doi: 10.1056/NEJMp2005835. Epub 2020 Apr 2.

Responses of Conversational Agents to Health and Lifestyle Prompts: Investigation of Appropriateness and Presentation Structures.对话智能体对健康与生活方式提示的回应：适宜性及呈现结构研究

J Med Internet Res. 2020 Feb 9;22(2):e15823. doi: 10.2196/15823.

Responses to addiction help-seeking from Alexa, Siri, Google Assistant, Cortana, and Bixby intelligent virtual assistants.对来自Alexa、Siri、谷歌助手、Cortana和Bixby智能虚拟助手的成瘾求助响应。

NPJ Digit Med. 2020 Jan 29;3:11. doi: 10.1038/s41746-019-0215-9. eCollection 2020.

BMJ Health Care Inform. 2019 Nov;26(1). doi: 10.1136/bmjhci-2019-100075.

Do you understand the words that are comin outta my mouth? Voice assistant comprehension of medication names.你能听懂我嘴里说出的话吗？语音助手对药物名称的理解。

NPJ Digit Med. 2019 Jun 20;2:55. doi: 10.1038/s41746-019-0133-x. eCollection 2019.

Interventions to Prevent Perinatal Depression: US Preventive Services Task Force Recommendation Statement.预防围产期抑郁的干预措施：美国预防服务工作组推荐声明。

JAMA. 2019 Feb 12;321(6):580-587. doi: 10.1001/jama.2019.0007.

ACOG Committee Opinion No. 757: Screening for Perinatal Depression.美国妇产科医师学会委员会意见 No.757：围产期抑郁筛查。

Obstet Gynecol. 2018 Nov;132(5):e208-e212. doi: 10.1097/AOG.0000000000002927.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于产后抑郁的语音助手临床建议：使用苹果 Siri、亚马逊 Alexa、谷歌助手和微软 Cortana 的横断面调查

Clinical Advice by Voice Assistants on Postpartum Depression: Cross-Sectional Investigation Using Apple Siri, Amazon Alexa, Google Assistant, and Microsoft Cortana.

机构信息

出版信息

BACKGROUND

OBJECTIVE

METHODS

RESULTS

CONCLUSIONS

背景

目的

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献