Li Han, Zhang Renwen, Lee Yi-Chieh, Kraut Robert E, Mohr David C
Department of Communications and New Media, National University of Singapore, Singapore, 117416, Singapore.
Department of Computer Science, National University of Singapore, Singapore, 117416, Singapore.
NPJ Digit Med. 2023 Dec 19;6(1):236. doi: 10.1038/s41746-023-00979-5.
Conversational artificial intelligence (AI), particularly AI-based conversational agents (CAs), is gaining traction in mental health care. Despite their growing usage, there is a scarcity of comprehensive evaluations of their impact on mental health and well-being. This systematic review and meta-analysis aims to fill this gap by synthesizing evidence on the effectiveness of AI-based CAs in improving mental health and factors influencing their effectiveness and user experience. Twelve databases were searched for experimental studies of AI-based CAs' effects on mental illnesses and psychological well-being published before May 26, 2023. Out of 7834 records, 35 eligible studies were identified for systematic review, out of which 15 randomized controlled trials were included for meta-analysis. The meta-analysis revealed that AI-based CAs significantly reduce symptoms of depression (Hedge's g 0.64 [95% CI 0.17-1.12]) and distress (Hedge's g 0.7 [95% CI 0.18-1.22]). These effects were more pronounced in CAs that are multimodal, generative AI-based, integrated with mobile/instant messaging apps, and targeting clinical/subclinical and elderly populations. However, CA-based interventions showed no significant improvement in overall psychological well-being (Hedge's g 0.32 [95% CI -0.13 to 0.78]). User experience with AI-based CAs was largely shaped by the quality of human-AI therapeutic relationships, content engagement, and effective communication. These findings underscore the potential of AI-based CAs in addressing mental health issues. Future research should investigate the underlying mechanisms of their effectiveness, assess long-term effects across various mental health outcomes, and evaluate the safe integration of large language models (LLMs) in mental health care.
对话式人工智能(AI),尤其是基于AI的对话代理(CA),在心理健康护理领域正越来越受到关注。尽管它们的使用越来越广泛,但对其对心理健康和幸福感影响的全面评估却很匮乏。本系统评价和荟萃分析旨在通过综合基于AI的CA在改善心理健康方面的有效性以及影响其有效性和用户体验的因素的证据来填补这一空白。我们检索了12个数据库,以查找2023年5月26日之前发表的关于基于AI的CA对精神疾病和心理健康影响的实验研究。在7834条记录中,确定了35项符合条件的研究进行系统评价,其中15项随机对照试验被纳入荟萃分析。荟萃分析显示,基于AI的CA能显著减轻抑郁症状(Hedge's g 0.64 [95% CI 0.17 - 1.12])和痛苦程度(Hedge's g 0.7 [95% CI 0.18 - 1.22])。这些效果在多模态、基于生成式AI、与移动/即时通讯应用集成以及针对临床/亚临床和老年人群的CA中更为明显。然而,基于CA的干预措施在整体心理健康方面并未显示出显著改善(Hedge's g 0.32 [95% CI -0.13至0.78])。基于AI的CA的用户体验在很大程度上受到人机治疗关系质量、内容参与度和有效沟通的影响。这些发现强调了基于AI的CA在解决心理健康问题方面的潜力。未来的研究应调查其有效性的潜在机制,评估各种心理健康结果的长期影响,并评估大语言模型(LLM)在心理健康护理中的安全整合情况。