• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

这个聊天机器人安全且基于证据吗?呼吁对生成式人工智能心理健康聊天机器人进行批判性评估。

Is This Chatbot Safe and Evidence-Based? A Call for the Critical Evaluation of Generative AI Mental Health Chatbots.

作者信息

Parks Acacia, Travers Eoin, Perera-Delcourt Ramesh, Major Max, Economides Marcos, Mullan Phil

机构信息

Unmind Ltd, 140 Borough High St, London, SE1 1LB, United Kingdom, 1 2678798387.

出版信息

J Particip Med. 2025 May 29;17:e69534. doi: 10.2196/69534.

DOI:10.2196/69534
PMID:40440646
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12140500/
Abstract

The proliferation of artificial intelligence (AI)-based mental health chatbots, such as those on platforms like OpenAI's GPT Store and Character. AI, raises issues of safety, effectiveness, and ethical use; they also raise an opportunity for patients and consumers to ensure AI tools clearly communicate how they meet their needs. While many of these tools claim to offer therapeutic advice, their unregulated status and lack of systematic evaluation create risks for users, particularly vulnerable individuals. This viewpoint article highlights the urgent need for a standardized framework to assess and demonstrate the safety, ethics, and evidence basis of AI chatbots used in mental health contexts. Drawing on clinical expertise, research, co-design experience, and the World Health Organization's guidance, the authors propose key evaluation criteria: adherence to ethical principles, evidence-based responses, conversational skills, safety protocols, and accessibility. Implementation challenges, including setting output criteria without one "right answer," evaluating multiturn conversations, and involving experts for oversight at scale, are explored. The authors advocate for greater consumer engagement in chatbot evaluation to ensure that these tools address users' needs effectively and responsibly, emphasizing the ethical obligation of developers to prioritize safety and a strong base in empirical evidence.

摘要

基于人工智能(AI)的心理健康聊天机器人不断涌现,比如OpenAI的GPT Store和Character.AI等平台上的那些聊天机器人,这引发了安全、有效性及道德使用等问题;它们也为患者和消费者提供了一个机会,以确保人工智能工具能清晰说明如何满足他们的需求。虽然这些工具中有许多声称能提供治疗建议,但其不受监管的状态以及缺乏系统评估给用户,尤其是易受伤害的个体带来了风险。这篇观点文章强调,迫切需要一个标准化框架,以评估和证明用于心理健康领域的人工智能聊天机器人的安全性、道德性及证据基础。作者借鉴临床专业知识、研究、共同设计经验以及世界卫生组织的指导意见,提出了关键评估标准:遵守道德原则、基于证据的回应、对话技巧、安全协议及可及性。探讨了实施挑战,包括在没有“正确答案”的情况下设定输出标准、评估多轮对话以及大规模引入专家进行监督等。作者主张让消费者更多地参与聊天机器人评估,以确保这些工具能有效且负责地满足用户需求,强调开发者有道德义务将安全置于首位并以坚实的实证证据为基础。

相似文献

1
Is This Chatbot Safe and Evidence-Based? A Call for the Critical Evaluation of Generative AI Mental Health Chatbots.这个聊天机器人安全且基于证据吗?呼吁对生成式人工智能心理健康聊天机器人进行批判性评估。
J Particip Med. 2025 May 29;17:e69534. doi: 10.2196/69534.
2
Evaluating the Quality of Psychotherapy Conversational Agents: Framework Development and Cross-Sectional Study.评估心理治疗对话代理的质量:框架开发与横断面研究。
JMIR Form Res. 2025 Jul 2;9:e65605. doi: 10.2196/65605.
3
Stench of Errors or the Shine of Potential: The Challenge of (Ir)Responsible Use of ChatGPT in Speech-Language Pathology.错误的恶臭还是潜力的光辉:言语病理学中(不)负责任地使用ChatGPT的挑战。
Int J Lang Commun Disord. 2025 Jul-Aug;60(4):e70088. doi: 10.1111/1460-6984.70088.
4
Conversational AI and Vaccine Communication: Systematic Review of the Evidence.会话式人工智能与疫苗传播:证据的系统评价。
J Med Internet Res. 2023 Oct 3;25:e42758. doi: 10.2196/42758.
5
Chatbots That Deliver Contraceptive Support: Systematic Review.提供避孕支持的聊天机器人:系统评价。
J Med Internet Res. 2024 Feb 27;26:e46758. doi: 10.2196/46758.
6
Using Generative Artificial Intelligence in Health Economics and Outcomes Research: A Primer on Techniques and Breakthroughs.在卫生经济学与结果研究中使用生成式人工智能:技术与突破入门
Pharmacoecon Open. 2025 Apr 29. doi: 10.1007/s41669-025-00580-4.
7
Revolutionizing e-health: the transformative role of AI-powered hybrid chatbots in healthcare solutions.变革电子健康:人工智能驱动的混合聊天机器人在医疗保健解决方案中的变革性作用。
Front Public Health. 2025 Feb 13;13:1530799. doi: 10.3389/fpubh.2025.1530799. eCollection 2025.
8
Gaps in Artificial Intelligence Research for Rural Health in the United States: A Scoping Review.美国农村卫生人工智能研究的差距:一项范围综述
medRxiv. 2025 Jun 27:2025.06.26.25330361. doi: 10.1101/2025.06.26.25330361.
9
Interventions to improve safe and effective medicines use by consumers: an overview of systematic reviews.改善消费者安全有效用药的干预措施:系统评价概述
Cochrane Database Syst Rev. 2014 Apr 29;2014(4):CD007768. doi: 10.1002/14651858.CD007768.pub3.
10
Health professionals' experience of teamwork education in acute hospital settings: a systematic review of qualitative literature.医疗专业人员在急症医院环境中团队合作教育的经验:对定性文献的系统综述
JBI Database System Rev Implement Rep. 2016 Apr;14(4):96-137. doi: 10.11124/JBISRIR-2016-1843.

引用本文的文献

1
Beyond the Bot: A Dual-Phase Framework for Evaluating AI Chatbot Simulations in Nursing Education.超越聊天机器人:护理教育中评估人工智能聊天机器人模拟的双阶段框架。
Nurs Rep. 2025 Jul 31;15(8):280. doi: 10.3390/nursrep15080280.

本文引用的文献

1
[Interpretation of the WHO's "Ethics and Governance of Artificial Intelligence for Health: Guidance on Large Multi-Modal Models" and its implications for China].[解读世界卫生组织《健康领域人工智能的伦理与治理:大型多模态模型指南》及其对中国的影响]
Zhonghua Yu Fang Yi Xue Za Zhi. 2025 Jun 6;59(6):960-969. doi: 10.3760/cma.j.cn112150-20240709-00548.
2
Towards a Multi-Stakeholder process for developing responsible AI governance in consumer health.迈向消费者健康领域负责任人工智能治理的多利益相关方进程。
Int J Med Inform. 2025 Mar;195:105713. doi: 10.1016/j.ijmedinf.2024.105713. Epub 2024 Nov 22.
3
Building robust, proportionate, and timely approaches to regulation and evaluation of digital mental health technologies.建立健全、适度且及时的数字心理健康技术监管与评估方法。
Lancet Digit Health. 2025 Jan;7(1):e89-e93. doi: 10.1016/S2589-7500(24)00215-2. Epub 2024 Nov 15.
4
The regulatory status of health apps that employ gamification.运用游戏化的健康类 APP 的监管现状。
Sci Rep. 2024 Sep 9;14(1):21016. doi: 10.1038/s41598-024-71808-2.
5
User involvement in digital mental health: approaches, potential and the need for guidelines.用户参与数字心理健康:方法、潜力及指南需求
Front Digit Health. 2024 Aug 22;6:1440660. doi: 10.3389/fdgth.2024.1440660. eCollection 2024.
6
Co-producing digital mental health interventions: A systematic review.共同制作数字心理健康干预措施:一项系统综述。
Digit Health. 2024 Apr 25;10:20552076241239172. doi: 10.1177/20552076241239172. eCollection 2024 Jan-Dec.
7
An Overview of Chatbot-Based Mobile Mental Health Apps: Insights From App Description and User Reviews.基于聊天机器人的移动心理健康应用概述:来自应用描述和用户评论的见解。
JMIR Mhealth Uhealth. 2023 May 22;11:e44838. doi: 10.2196/44838.
8
The impact of inconsistent human annotations on AI driven clinical decision making.人类标注不一致对人工智能驱动的临床决策的影响。
NPJ Digit Med. 2023 Feb 21;6(1):26. doi: 10.1038/s41746-023-00773-3.
9
Barriers to and Facilitators of User Engagement With Digital Mental Health Interventions: Systematic Review.数字心理健康干预措施中用户参与的障碍和促进因素:系统评价。
J Med Internet Res. 2021 Mar 24;23(3):e24387. doi: 10.2196/24387.
10
A review of therapist characteristics and techniques positively impacting the therapeutic alliance.对积极影响治疗联盟的治疗师特征和技术的综述。
Clin Psychol Rev. 2003 Feb;23(1):1-33. doi: 10.1016/s0272-7358(02)00146-0.