Department of Management, Technology, and Economics, ETH Zurich, Zurich, Switzerland.
Future Health Technologies programme, Campus for Research Excellence and Technological Enterprise, Singapore-ETH Centre, Singapore.
J Med Internet Res. 2020 Sep 14;22(9):e20701. doi: 10.2196/20701.
BACKGROUND: A rising number of conversational agents or chatbots are equipped with artificial intelligence (AI) architecture. They are increasingly prevalent in health care applications such as those providing education and support to patients with chronic diseases, one of the leading causes of death in the 21st century. AI-based chatbots enable more effective and frequent interactions with such patients. OBJECTIVE: The goal of this systematic literature review is to review the characteristics, health care conditions, and AI architectures of AI-based conversational agents designed specifically for chronic diseases. METHODS: We conducted a systematic literature review using PubMed MEDLINE, EMBASE, PyscInfo, CINAHL, ACM Digital Library, ScienceDirect, and Web of Science. We applied a predefined search strategy using the terms "conversational agent," "healthcare," "artificial intelligence," and their synonyms. We updated the search results using Google alerts, and screened reference lists for other relevant articles. We included primary research studies that involved the prevention, treatment, or rehabilitation of chronic diseases, involved a conversational agent, and included any kind of AI architecture. Two independent reviewers conducted screening and data extraction, and Cohen kappa was used to measure interrater agreement.A narrative approach was applied for data synthesis. RESULTS: The literature search found 2052 articles, out of which 10 papers met the inclusion criteria. The small number of identified studies together with the prevalence of quasi-experimental studies (n=7) and prevailing prototype nature of the chatbots (n=7) revealed the immaturity of the field. The reported chatbots addressed a broad variety of chronic diseases (n=6), showcasing a tendency to develop specialized conversational agents for individual chronic conditions. However, there lacks comparison of these chatbots within and between chronic diseases. In addition, the reported evaluation measures were not standardized, and the addressed health goals showed a large range. Together, these study characteristics complicated comparability and open room for future research. While natural language processing represented the most used AI technique (n=7) and the majority of conversational agents allowed for multimodal interaction (n=6), the identified studies demonstrated broad heterogeneity, lack of depth of reported AI techniques and systems, and inconsistent usage of taxonomy of the underlying AI software, further aggravating comparability and generalizability of study results. CONCLUSIONS: The literature on AI-based conversational agents for chronic conditions is scarce and mostly consists of quasi-experimental studies with chatbots in prototype stage that use natural language processing and allow for multimodal user interaction. Future research could profit from evidence-based evaluation of the AI-based conversational agents and comparison thereof within and between different chronic health conditions. Besides increased comparability, the quality of chatbots developed for specific chronic conditions and their subsequent impact on the target patients could be enhanced by more structured development and standardized evaluation processes.
背景:越来越多的会话代理或聊天机器人配备了人工智能 (AI) 架构。它们在医疗保健应用中越来越普遍,例如为慢性病患者提供教育和支持,慢性病是 21 世纪的主要死因之一。基于 AI 的聊天机器人使与这些患者进行更有效和更频繁的交互成为可能。
目的:本系统文献综述的目的是回顾专为慢性病设计的基于 AI 的会话代理的特点、医疗保健条件和 AI 架构。
方法:我们使用 PubMed MEDLINE、EMBASE、PyscInfo、CINAHL、ACM 数字图书馆、ScienceDirect 和 Web of Science 进行了系统文献综述。我们使用术语“对话代理”、“医疗保健”、“人工智能”及其同义词应用了预定义的搜索策略。我们使用 Google 警报更新了搜索结果,并筛选了其他相关文章的参考文献列表。我们纳入了涉及慢性病预防、治疗或康复的原始研究,涉及会话代理,并包含任何类型的 AI 架构。两名独立审查员进行了筛选和数据提取,并使用 Cohen kappa 测量了评分者间的一致性。采用叙述方法进行数据综合。
结果:文献检索共发现 2052 篇文章,其中 10 篇符合纳入标准。由于识别出的研究数量较少,再加上准实验研究的流行(n=7)和聊天机器人原型的流行(n=7),表明该领域尚不成熟。报告的聊天机器人针对多种慢性疾病(n=6),展示了为个别慢性疾病开发专门的会话代理的趋势。然而,这些聊天机器人之间和之间没有进行比较。此外,报告的评估措施没有标准化,所解决的健康目标范围也很大。总的来说,这些研究特征增加了可比性,并为未来的研究留出了空间。虽然自然语言处理是使用最多的 AI 技术(n=7),并且大多数会话代理允许多模态交互(n=6),但所识别的研究表现出广泛的异质性,缺乏报告的 AI 技术和系统的深度,以及基础 AI 软件的分类法使用不一致,进一步加剧了研究结果的可比性和可推广性。
结论:关于基于 AI 的慢性病会话代理的文献很少,并且主要由处于原型阶段的基于自然语言处理且允许多模态用户交互的准实验研究组成。未来的研究可以从基于 AI 的会话代理的基于证据的评估中受益,并在不同的慢性健康状况之间进行比较。除了提高可比性外,通过更结构化的开发和标准化的评估流程,还可以提高为特定慢性疾病开发的聊天机器人的质量,并提高其对目标患者的后续影响。
J Med Internet Res. 2020-9-14
J Med Internet Res. 2023-10-3
Sensors (Basel). 2022-3-29
J Med Internet Res. 2020-10-22
J Med Internet Res. 2020-8-7
Healthcare (Basel). 2025-7-7
JMIR Mhealth Uhealth. 2025-6-6
Discov Artif Intell. 2021
Radiol Artif Intell. 2020-3-25
J Med Internet Res. 2019-11-7
J Biomed Inform. 2020-2
Future Healthc J. 2019-6
Methods Inf Med. 2019-6