这个聊天机器人安全且基于证据吗？呼吁对生成式人工智能心理健康聊天机器人进行批判性评估。

Is This Chatbot Safe and Evidence-Based? A Call for the Critical Evaluation of Generative AI Mental Health Chatbots.

作者信息

Parks Acacia, Travers Eoin, Perera-Delcourt Ramesh, Major Max, Economides Marcos, Mullan Phil

机构信息

Unmind Ltd, 140 Borough High St, London, SE1 1LB, United Kingdom, 1 2678798387.

出版信息

J Particip Med. 2025 May 29;17:e69534. doi: 10.2196/69534.

DOI:10.2196/69534

PMID:40440646

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12140500/

Abstract

The proliferation of artificial intelligence (AI)-based mental health chatbots, such as those on platforms like OpenAI's GPT Store and Character. AI, raises issues of safety, effectiveness, and ethical use; they also raise an opportunity for patients and consumers to ensure AI tools clearly communicate how they meet their needs. While many of these tools claim to offer therapeutic advice, their unregulated status and lack of systematic evaluation create risks for users, particularly vulnerable individuals. This viewpoint article highlights the urgent need for a standardized framework to assess and demonstrate the safety, ethics, and evidence basis of AI chatbots used in mental health contexts. Drawing on clinical expertise, research, co-design experience, and the World Health Organization's guidance, the authors propose key evaluation criteria: adherence to ethical principles, evidence-based responses, conversational skills, safety protocols, and accessibility. Implementation challenges, including setting output criteria without one "right answer," evaluating multiturn conversations, and involving experts for oversight at scale, are explored. The authors advocate for greater consumer engagement in chatbot evaluation to ensure that these tools address users' needs effectively and responsibly, emphasizing the ethical obligation of developers to prioritize safety and a strong base in empirical evidence.

摘要

基于人工智能（AI）的心理健康聊天机器人不断涌现，比如OpenAI的GPT Store和Character.AI等平台上的那些聊天机器人，这引发了安全、有效性及道德使用等问题；它们也为患者和消费者提供了一个机会，以确保人工智能工具能清晰说明如何满足他们的需求。虽然这些工具中有许多声称能提供治疗建议，但其不受监管的状态以及缺乏系统评估给用户，尤其是易受伤害的个体带来了风险。这篇观点文章强调，迫切需要一个标准化框架，以评估和证明用于心理健康领域的人工智能聊天机器人的安全性、道德性及证据基础。作者借鉴临床专业知识、研究、共同设计经验以及世界卫生组织的指导意见，提出了关键评估标准：遵守道德原则、基于证据的回应、对话技巧、安全协议及可及性。探讨了实施挑战，包括在没有“正确答案”的情况下设定输出标准、评估多轮对话以及大规模引入专家进行监督等。作者主张让消费者更多地参与聊天机器人评估，以确保这些工具能有效且负责地满足用户需求，强调开发者有道德义务将安全置于首位并以坚实的实证证据为基础。

Suppr 超能文献

文献检索

文件翻译

深度研究

Suppr 超能文献

文献检索

文件翻译

深度研究

这个聊天机器人安全且基于证据吗？呼吁对生成式人工智能心理健康聊天机器人进行批判性评估。

Is This Chatbot Safe and Evidence-Based? A Call for the Critical Evaluation of Generative AI Mental Health Chatbots.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

这个聊天机器人安全且基于证据吗？呼吁对生成式人工智能心理健康聊天机器人进行批判性评估。

Is This Chatbot Safe and Evidence-Based? A Call for the Critical Evaluation of Generative AI Mental Health Chatbots.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献