Bérubé Caterina, Kovacs Zsolt Ferenc, Fleisch Elgar, Kowatsch Tobias
Centre for Digital Health Interventions, Department of Management, Technology, and Economics, ETH Zurich, Zurich, Switzerland.
Future Health Technologies Programme, Campus for Research Excellence and Technological Enterprise (CREATE), Singapore-ETH Centre, Singapore, Singapore.
J Med Internet Res. 2021 Dec 20;23(12):e32161. doi: 10.2196/32161.
Noncommunicable diseases (NCDs) constitute a burden on public health. These are best controlled through self-management practices, such as self-information. Fostering patients' access to health-related information through efficient and accessible channels, such as commercial voice assistants (VAs), may support the patients' ability to make health-related decisions and manage their chronic conditions.
This study aims to evaluate the reliability of the most common VAs (ie, Amazon Alexa, Apple Siri, and Google Assistant) in responding to questions about management of the main NCD.
We generated health-related questions based on frequently asked questions from health organization, government, medical nonprofit, and other recognized health-related websites about conditions associated with Alzheimer's disease (AD), lung cancer (LCA), chronic obstructive pulmonary disease, diabetes mellitus (DM), cardiovascular disease, chronic kidney disease (CKD), and cerebrovascular accident (CVA). We then validated them with practicing medical specialists, selecting the 10 most frequent ones. Given the low average frequency of the AD-related questions, we excluded such questions. This resulted in a pool of 60 questions. We submitted the selected questions to VAs in a 3×3×6 fractional factorial design experiment with 3 developers (ie, Amazon, Apple, and Google), 3 modalities (ie, voice only, voice and display, display only), and 6 diseases. We assessed the rate of error-free voice responses and classified the web sources based on previous research (ie, expert, commercial, crowdsourced, or not stated).
Google showed the highest total response rate, followed by Amazon and Apple. Moreover, although Amazon and Apple showed a comparable response rate in both voice-and-display and voice-only modalities, Google showed a slightly higher response rate in voice only. The same pattern was observed for the rate of expert sources. When considering the response and expert source rate across diseases, we observed that although Google remained comparable, with a slight advantage for LCA and CKD, both Amazon and Apple showed the highest response rate for LCA. However, both Google and Apple showed most often expert sources for CVA, while Amazon did so for DM.
Google showed the highest response rate and the highest rate of expert sources, leading to the conclusion that Google Assistant would be the most reliable tool in responding to questions about NCD management. However, the rate of expert sources differed across diseases. We urge health organizations to collaborate with Google, Amazon, and Apple to allow their VAs to consistently provide reliable answers to health-related questions on NCD management across the different diseases.
非传染性疾病(NCDs)给公共卫生带来负担。通过自我管理措施,如自我信息获取,能最好地控制这些疾病。通过高效且便捷的渠道,如商业语音助手(VAs),促进患者获取健康相关信息,可能有助于患者做出健康相关决策并管理其慢性病。
本研究旨在评估最常见的语音助手(即亚马逊Alexa、苹果Siri和谷歌助手)在回答有关主要非传染性疾病管理问题时的可靠性。
我们根据健康组织、政府、医学非营利组织及其他公认的健康相关网站上关于阿尔茨海默病(AD)、肺癌(LCA)、慢性阻塞性肺疾病、糖尿病(DM)、心血管疾病、慢性肾脏病(CKD)和脑血管意外(CVA)相关病症的常见问题,生成了与健康相关的问题。然后我们与执业医学专家对这些问题进行了验证,选出了10个最常见的问题。鉴于与AD相关问题的平均出现频率较低,我们排除了此类问题。这就形成了一组60个问题。我们将所选问题以3×3×6析因设计实验提交给语音助手,该实验涉及3个开发者(即亚马逊、苹果和谷歌)、3种模式(即仅语音、语音和显示、仅显示)以及6种疾病。我们评估了无错误语音回复率,并根据先前研究对网络来源进行了分类(即专家、商业、众包或未说明)。
谷歌的总回复率最高,其次是亚马逊和苹果。此外,虽然亚马逊和苹果在语音和显示以及仅语音模式下的回复率相当,但谷歌在仅语音模式下的回复率略高。专家来源率也呈现相同模式。在考虑各疾病的回复率和专家来源率时,我们观察到,虽然谷歌保持相当水平,在LCA和CKD方面略有优势,但亚马逊和苹果在LCA方面的回复率最高。然而,谷歌和苹果在CVA方面最常显示专家来源,而亚马逊在DM方面是这样。
谷歌显示出最高的回复率和最高的专家来源率,得出的结论是谷歌助手在回答有关非传染性疾病管理问题时是最可靠的工具。然而,专家来源率因疾病而异。我们敦促健康组织与谷歌、亚马逊和苹果合作,使其语音助手能够始终如一地为不同疾病的非传染性疾病管理相关健康问题提供可靠答案。