文献检索文档翻译深度研究
Suppr Zotero 插件Zotero 插件
邀请有礼套餐&价格历史记录

新学期,新优惠

限时优惠:9月1日-9月22日

30天高级会员仅需29元

1天体验卡首发特惠仅需5.99元

了解详情
不再提醒
插件&应用
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
高级版
套餐订阅购买积分包
AI 工具
文献检索文档翻译深度研究
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2025

Evaluating Generative AI in Mental Health: Systematic Review of Capabilities and Limitations.

作者信息

Wang Liying, Bhanushali Tanmay, Huang Zhuoran, Yang Jingyi, Badami Sukriti, Hightow-Weidman Lisa

机构信息

Institute on Digital Health and Innovation, College of Nursing, Florida State University, 222 S Copeland St, Tallahassee, FL, 32306, United States, 1 (850) 644-3296.

Center of Population Sciences for Health Equity, College of Nursing, Florida State University, Tallahassee, FL, United States.

出版信息

JMIR Ment Health. 2025 May 15;12:e70014. doi: 10.2196/70014.


DOI:10.2196/70014
PMID:40373033
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12097452/
Abstract

BACKGROUND: The global shortage of mental health professionals, exacerbated by increasing mental health needs post COVID-19, has stimulated growing interest in leveraging large language models to address these challenges. OBJECTIVES: This systematic review aims to evaluate the current capabilities of generative artificial intelligence (GenAI) models in the context of mental health applications. METHODS: A comprehensive search across 5 databases yielded 1046 references, of which 8 studies met the inclusion criteria. The included studies were original research with experimental designs (eg, Turing tests, sociocognitive tasks, trials, or qualitative methods); a focus on GenAI models; and explicit measurement of sociocognitive abilities (eg, empathy and emotional awareness), mental health outcomes, and user experience (eg, perceived trust and empathy). RESULTS: The studies, published between 2023 and 2024, primarily evaluated models such as ChatGPT-3.5 and 4.0, Bard, and Claude in tasks such as psychoeducation, diagnosis, emotional awareness, and clinical interventions. Most studies used zero-shot prompting and human evaluators to assess the AI responses, using standardized rating scales or qualitative analysis. However, these methods were often insufficient to fully capture the complexity of GenAI capabilities. The reliance on single-shot prompting techniques, limited comparisons, and task-based assessments isolated from a context may oversimplify GenAI's abilities and overlook the nuances of human-artificial intelligence interaction, especially in clinical applications that require contextual reasoning and cultural sensitivity. The findings suggest that while GenAI models demonstrate strengths in psychoeducation and emotional awareness, their diagnostic accuracy, cultural competence, and ability to engage users emotionally remain limited. Users frequently reported concerns about trustworthiness, accuracy, and the lack of emotional engagement. CONCLUSIONS: Future research could use more sophisticated evaluation methods, such as few-shot and chain-of-thought prompting to fully uncover GenAI's potential. Longitudinal studies and broader comparisons with human benchmarks are needed to explore the effects of GenAI-integrated mental health care.

摘要
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1da/12097452/45f6a3db8410/mental-v12-e70014-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1da/12097452/45f6a3db8410/mental-v12-e70014-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1da/12097452/45f6a3db8410/mental-v12-e70014-g001.jpg

相似文献

[1]
Evaluating Generative AI in Mental Health: Systematic Review of Capabilities and Limitations.

JMIR Ment Health. 2025-5-15

[2]
Evaluating User Interactions and Adoption Patterns of Generative AI in Health Care Occupations Using Claude: Cross-Sectional Study.

J Med Internet Res. 2025-5-30

[3]
Prompts, privacy, and personalized learning: integrating AI into nursing education-a qualitative study.

BMC Nurs. 2025-4-29

[4]
Responsible Design, Integration, and Use of Generative AI in Mental Health.

JMIR Ment Health. 2025-1-20

[5]
Decoding medical educators' perceptions on generative artificial intelligence in medical education.

J Investig Med. 2024-10

[6]
Using Generative Artificial Intelligence in Health Economics and Outcomes Research: A Primer on Techniques and Breakthroughs.

Pharmacoecon Open. 2025-4-29

[7]
Capacity of Generative AI to Interpret Human Emotions From Visual and Textual Data: Pilot Evaluation Study.

JMIR Ment Health. 2024-2-6

[8]
An Ethical Perspective on the Democratization of Mental Health With Generative AI.

JMIR Ment Health. 2024-10-17

[9]
Safety and User Experience of a Generative Artificial Intelligence Digital Mental Health Intervention: Exploratory Randomized Controlled Trial.

J Med Internet Res. 2025-5-23

[10]
Comparing the Efficacy and Efficiency of Human and Generative AI: Qualitative Thematic Analyses.

JMIR AI. 2024-8-2

本文引用的文献

[1]
Opportunities and Risks of Large Language Models in Psychiatry.

NPP Digit Psychiatry Neurosci. 2024

[2]
Performance Assessment of ChatGPT versus Bard in Detecting Alzheimer's Dementia.

Diagnostics (Basel). 2024-4-15

[3]
Loneliness and suicide mitigation for students using GPT3-enabled chatbots.

Npj Ment Health Res. 2024-1-22

[4]
Large language models could change the future of behavioral healthcare: a proposal for responsible development and evaluation.

Npj Ment Health Res. 2024-4-2

[5]
Comparing the Perspectives of Generative AI, Mental Health Experts, and the General Public on Schizophrenia Recovery: Case Vignette Study.

JMIR Ment Health. 2024-3-18

[6]
Evaluation of ChatGPT-generated medical responses: A systematic review and meta-analysis.

J Biomed Inform. 2024-3

[7]
Assessing the Effectiveness of ChatGPT in Delivering Mental Health Support: A Qualitative Study.

J Multidiscip Healthc. 2024-1-31

[8]
An Introduction to Generative Artificial Intelligence in Mental Health Care: Considerations and Guidance.

Curr Psychiatry Rep. 2023-12

[9]
Your robot therapist is not your therapist: understanding the role of AI-powered mental health chatbots.

Front Digit Health. 2023-11-8

[10]
Assessing ChatGPT's ability to answer questions pertaining to erectile dysfunction: can our patients trust it?

Int J Impot Res. 2024-11

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

推荐工具

医学文档翻译智能文献检索