• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

生成式人工智能在评估对负责任的自杀新闻媒体报道的依从性方面的作用:一项多地点、三种语言的研究。

The role of generative artificial intelligence in evaluating adherence to responsible press media reports on suicide: A multisite, three-language study.

作者信息

Elyospeh Zohar, Nobile Bénédicte, Levkovich Inbar, Chancel Raphael, Courtet Philippe, Levi-Belz Yossi

机构信息

https://ror.org/02f009v59University of Haifa, Mount Carmel, Haifa, Israel.

Department of Emergency Psychiatry and Acute Care, CHU Montpellier, Montpellier, France.

出版信息

Eur Psychiatry. 2025 May 27;68(1):e81. doi: 10.1192/j.eurpsy.2025.10037.

DOI:10.1192/j.eurpsy.2025.10037
PMID:40420443
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12188334/
Abstract

BACKGROUND

Improving media adherence to World Health Organization (WHO) guidelines is crucial for preventing suicidal behaviors in the general population. However, there is currently no valid, rapid, and effective method to evaluate the adherence to these guidelines.

METHODS

This comparative effectiveness study (January-August 2024) evaluated the ability of two artificial intelligence (AI) models (Claude Opus 3 and GPT-4O) to assess the adherence of media reports to WHO suicide-reporting guidelines. A total of 120 suicide-related articles (40 in English, 40 in Hebrew, and 40 in French) published within the past 5 years were sourced from prominent newspapers. Six trained human raters (two per language) independently evaluated articles based on a WHO guideline-based questionnaire addressing aspects, such as prominence, sensationalism, and prevention. The same articles were also processed using AI models. Intraclass correlation coefficients (ICCs) and Spearman correlations were calculated to assess agreement between human raters and AI models.

RESULTS

Overall adherence to WHO guidelines was ~50% across all languages. Both AI models demonstrated strong agreement with human raters, with GPT-4O showing the highest agreement (ICC = 0.793 [0.702; 0.855]). The combined evaluations of GPT-4O and Claude Opus 3 yielded the highest reliability (ICC = 0.812 [0.731; 0.869]).

CONCLUSIONS

AI models can replicate human judgment in evaluating media adherence to WHO guidelines. However, they have limitations and should be used alongside human oversight. These findings may suggest that AI tools have the potential to enhance and promote responsible reporting practices among journalists and, thus, may support suicide prevention efforts globally.

摘要

背景

提高媒体对世界卫生组织(WHO)指南的遵循程度对于预防普通人群的自杀行为至关重要。然而,目前尚无有效、快速且高效的方法来评估对这些指南的遵循情况。

方法

这项比较有效性研究(2024年1月至8月)评估了两种人工智能(AI)模型(Claude Opus 3和GPT - 4O)评估媒体报道对WHO自杀报告指南遵循情况的能力。从知名报纸中选取了过去5年内发表的总共120篇与自杀相关的文章(40篇英文、40篇希伯来文和40篇法文)。六名经过培训的人类评分员(每种语言两名)根据一份基于WHO指南的问卷对文章进行独立评估,该问卷涉及突出性、轰动性和预防等方面。同样的文章也使用AI模型进行处理。计算组内相关系数(ICC)和斯皮尔曼相关性以评估人类评分员与AI模型之间的一致性。

结果

所有语言对WHO指南的总体遵循率约为50%。两种AI模型都与人类评分员表现出高度一致性,GPT - 4O的一致性最高(ICC = 0.793 [0.702; 0.855])。GPT - 4O和Claude Opus 3的综合评估产生了最高的可靠性(ICC = 0.812 [0.731; 0.869])。

结论

AI模型在评估媒体对WHO指南的遵循情况时能够复制人类的判断。然而,它们存在局限性,应在人类监督下使用。这些发现可能表明AI工具具有增强并促进记者进行负责任报道实践的潜力,从而可能支持全球的自杀预防工作。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d37e/12188334/5dcce7186a53/S0924933825100370_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d37e/12188334/e854d66342d3/S0924933825100370_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d37e/12188334/5dcce7186a53/S0924933825100370_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d37e/12188334/e854d66342d3/S0924933825100370_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d37e/12188334/5dcce7186a53/S0924933825100370_fig2.jpg

相似文献

1
The role of generative artificial intelligence in evaluating adherence to responsible press media reports on suicide: A multisite, three-language study.生成式人工智能在评估对负责任的自杀新闻媒体报道的依从性方面的作用:一项多地点、三种语言的研究。
Eur Psychiatry. 2025 May 27;68(1):e81. doi: 10.1192/j.eurpsy.2025.10037.
2
Prevention of self-harm and suicide in young people up to the age of 25 in education settings.在教育环境中预防25岁及以下年轻人的自我伤害和自杀行为。
Cochrane Database Syst Rev. 2024 Dec 20;12(12):CD013844. doi: 10.1002/14651858.CD013844.pub2.
3
Behavioral interventions to reduce risk for sexual transmission of HIV among men who have sex with men.降低男男性行为者中艾滋病毒性传播风险的行为干预措施。
Cochrane Database Syst Rev. 2008 Jul 16(3):CD001230. doi: 10.1002/14651858.CD001230.pub2.
4
Home treatment for mental health problems: a systematic review.心理健康问题的居家治疗:一项系统综述
Health Technol Assess. 2001;5(15):1-139. doi: 10.3310/hta5150.
5
Surveillance for Violent Deaths - National Violent Death Reporting System, 50 States, the District of Columbia, and Puerto Rico, 2022.暴力死亡监测——2022年全国暴力死亡报告系统,50个州、哥伦比亚特区和波多黎各
MMWR Surveill Summ. 2025 Jun 12;74(5):1-42. doi: 10.15585/mmwr.ss7405a1.
6
A rapid and systematic review of the clinical effectiveness and cost-effectiveness of topotecan for ovarian cancer.拓扑替康治疗卵巢癌的临床有效性和成本效益的快速系统评价。
Health Technol Assess. 2001;5(28):1-110. doi: 10.3310/hta5280.
7
Systemic treatments for metastatic cutaneous melanoma.转移性皮肤黑色素瘤的全身治疗
Cochrane Database Syst Rev. 2018 Feb 6;2(2):CD011123. doi: 10.1002/14651858.CD011123.pub2.
8
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
9
Applications of Large Language Models in the Field of Suicide Prevention: Scoping Review.大语言模型在自杀预防领域的应用:范围综述
J Med Internet Res. 2025 Jan 23;27:e63126. doi: 10.2196/63126.
10
Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益
Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.

本文引用的文献

1
Applying language models for suicide prevention: evaluating news article adherence to WHO reporting guidelines.应用语言模型预防自杀:评估新闻文章对世界卫生组织报告指南的遵循情况。
Npj Ment Health Res. 2025 Jun 20;4(1):25. doi: 10.1038/s44184-025-00139-5.
2
Applications of Large Language Models in the Field of Suicide Prevention: Scoping Review.大语言模型在自杀预防领域的应用:范围综述
J Med Internet Res. 2025 Jan 23;27:e63126. doi: 10.2196/63126.
3
Suicide prevention strategies in South Korea: What we have learned and the way forward.
韩国的自杀预防策略:我们所学到的以及未来的方向。
Asian J Psychiatr. 2025 Feb;104:104359. doi: 10.1016/j.ajp.2025.104359. Epub 2025 Jan 8.
4
Integrating Previous Suicide Attempts, Gender, and Age Into Suicide Risk Assessment Using Advanced Artificial Intelligence Models.利用先进人工智能模型将既往自杀尝试、性别和年龄纳入自杀风险评估。
J Clin Psychiatry. 2024 Oct 2;85(4):24m15365. doi: 10.4088/JCP.24m15365.
5
Artificial intelligence, consciousness and psychiatry.人工智能、意识与精神病学。
World Psychiatry. 2024 Oct;23(3):309-310. doi: 10.1002/wps.21222.
6
Evaluating large language models for health-related text classification tasks with public social media data.利用公共社交媒体数据评估用于健康相关文本分类任务的大型语言模型。
J Am Med Inform Assoc. 2024 Oct 1;31(10):2181-2189. doi: 10.1093/jamia/ocae210.
7
The impact of media reporting of suicides on subsequent suicides in Asia: A systematic review.媒体对自杀事件的报道对亚洲后续自杀事件的影响:系统评价。
Ann Acad Med Singap. 2024 Mar 27;53(3):152-169. doi: 10.47102/annals-acadmedsg.2023237.
8
The impact of history of depression and access to weapons on suicide risk assessment: a comparison of ChatGPT-3.5 and ChatGPT-4.抑郁史和武器获取对自杀风险评估的影响:ChatGPT-3.5 和 ChatGPT-4 的比较。
PeerJ. 2024 May 29;12:e17468. doi: 10.7717/peerj.17468. eCollection 2024.
9
A critical assessment of using ChatGPT for extracting structured data from clinical notes.对使用ChatGPT从临床记录中提取结构化数据的批判性评估。
NPJ Digit Med. 2024 May 1;7(1):106. doi: 10.1038/s41746-024-01079-8.
10
Generative artificial intelligence in mental health care: potential benefits and current challenges.心理健康护理中的生成式人工智能:潜在益处与当前挑战。
World Psychiatry. 2024 Feb;23(1):1-2. doi: 10.1002/wps.21148.