使用自然语言处理的自杀遗书分类：一项内容分析

Suicide Note Classification Using Natural Language Processing: A Content Analysis.

作者信息

Pestian John, Nasrallah Henry, Matykiewicz Pawel, Bennett Aurora, Leenaars Antoon

机构信息

Department of Biomedical Informatics, Cincinnati Children's Hospital Medical Center, Cincinnati, OH 45229, USA.

出版信息

Biomed Inform Insights. 2010 Aug 4;2010(3):19-28. doi: 10.4137/bii.s4706.

DOI:10.4137/bii.s4706

PMID:21643548

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3107011/

Abstract

Suicide is the second leading cause of death among 25-34 year olds and the third leading cause of death among 15-25 year olds in the United States. In the Emergency Department, where suicidal patients often present, estimating the risk of repeated attempts is generally left to clinical judgment. This paper presents our second attempt to determine the role of computational algorithms in understanding a suicidal patient's thoughts, as represented by suicide notes. We focus on developing methods of natural language processing that distinguish between genuine and elicited suicide notes. We hypothesize that machine learning algorithms can categorize suicide notes as well as mental health professionals and psychiatric physician trainees do. The data used are comprised of suicide notes from 33 suicide completers and matched to 33 elicited notes from healthy control group members. Eleven mental health professionals and 31 psychiatric trainees were asked to decide if a note was genuine or elicited. Their decisions were compared to nine different machine-learning algorithms. The results indicate that trainees accurately classified notes 49% of the time, mental health professionals accurately classified notes 63% of the time, and the best machine learning algorithm accurately classified the notes 78% of the time. This is an important step in developing an evidence-based predictor of repeated suicide attempts because it shows that natural language processing can aid in distinguishing between classes of suicidal notes.

摘要

在美国，自杀是25至34岁人群中的第二大死因，是15至25岁人群中的第三大死因。在急诊室，经常会有自杀患者前来就诊，而对其再次自杀风险的评估通常依靠临床判断。本文是我们第二次尝试确定计算算法在理解自杀患者想法（以自杀遗书为代表）方面的作用。我们专注于开发自然语言处理方法，以区分真实的和诱导产生的自杀遗书。我们假设机器学习算法在对自杀遗书进行分类方面能够与心理健康专业人员和精神科医师实习生做得一样好。所使用的数据包括33例自杀身亡者的自杀遗书，并与健康对照组成员的33份诱导遗书相匹配。11名心理健康专业人员和31名精神科实习生被要求判断一份遗书是真实的还是诱导产生的。他们的判断结果与9种不同的机器学习算法进行了比较。结果表明，实习生准确分类遗书的概率为49%，心理健康专业人员为63%，而最佳机器学习算法准确分类遗书的概率为78%。这是朝着开发基于证据的再次自杀风险预测指标迈出的重要一步，因为它表明自然语言处理有助于区分不同类别的自杀遗书。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc80/7492862/fc2d1cd87355/10.4137_BII.S4706-fig1.jpg

相似文献

Suicide Note Classification Using Natural Language Processing: A Content Analysis.使用自然语言处理的自杀遗书分类：一项内容分析

Biomed Inform Insights. 2010 Aug 4;2010(3):19-28. doi: 10.4137/bii.s4706.

Identification of suicidal behavior among psychiatrically hospitalized adolescents using natural language processing and machine learning of electronic health records.使用电子健康记录的自然语言处理和机器学习识别精神科住院青少年的自杀行为。

PLoS One. 2019 Feb 19;14(2):e0211116. doi: 10.1371/journal.pone.0211116. eCollection 2019.

Natural language processing of clinical mental health notes may add predictive value to existing suicide risk models.临床心理健康记录的自然语言处理可能为现有自杀风险模型增加预测价值。

Psychol Med. 2021 Jun;51(8):1382-1391. doi: 10.1017/S0033291720000173. Epub 2020 Feb 17.

Detection of self-harm and suicidal ideation in emergency department triage notes.在急诊分诊记录中检测自我伤害和自杀意念。

J Am Med Inform Assoc. 2022 Jan 29;29(3):472-480. doi: 10.1093/jamia/ocab261.

Applications of Aspect-based Sentiment Analysis on Psychiatric Clinical Notes to Study Suicide in Youth.基于方面的情感分析在精神科临床记录中的应用，以研究青年人群中的自杀问题。

AMIA Jt Summits Transl Sci Proc. 2021 May 17;2021:229-237. eCollection 2021.

Using weak supervision and deep learning to classify clinical notes for identification of current suicidal ideation.利用弱监督和深度学习对临床记录进行分类，以识别当前的自杀意念。

J Psychiatr Res. 2021 Apr;136:95-102. doi: 10.1016/j.jpsychires.2021.01.052. Epub 2021 Feb 2.

Identifying Suicide Ideation and Suicidal Attempts in a Psychiatric Clinical Research Database using Natural Language Processing.使用自然语言处理技术在精神科临床研究数据库中识别自杀意念和自杀企图。

Sci Rep. 2018 May 9;8(1):7426. doi: 10.1038/s41598-018-25773-2.

A Natural Language Processing Pipeline based on the Columbia-Suicide Severity Rating Scale.基于哥伦比亚自杀严重程度评定量表的自然语言处理管道。

medRxiv. 2024 Dec 20:2024.12.19.24319352. doi: 10.1101/2024.12.19.24319352.

Prediction of suicidal ideation in children and adolescents using machine learning and deep learning algorithm: A case study in South Korea where suicide is the leading cause of death.使用机器学习和深度学习算法预测儿童和青少年的自杀意念：以自杀是韩国主要死因的国家为例的案例研究。

Asian J Psychiatr. 2023 Oct;88:103725. doi: 10.1016/j.ajp.2023.103725. Epub 2023 Aug 6.

Identifying Suicidal Ideation and Attempt From Clinical Notes Within a Large Integrated Health Care System.从大型综合医疗保健系统的临床记录中识别自杀意念和企图。

Perm J. 2022 Apr 5;26(1):85-93. doi: 10.7812/TPP/21.102.

引用本文的文献

Considerations and Challenges When Using Clinical and Vital Record Review for Suicide Research.使用临床和生命记录回顾进行自杀研究时的注意事项与挑战。

J Patient Saf. 2025 Apr 1;21(3):e8-e17. doi: 10.1097/PTS.0000000000001325. Epub 2025 Feb 11.

Enhancing Suicide Attempt Risk Prediction Models with Temporal Clinical Note Features.利用时间性临床记录特征增强自杀未遂风险预测模型

Appl Clin Inform. 2024 Oct;15(5):1107-1120. doi: 10.1055/a-2411-5796. Epub 2024 Sep 9.

Identifying Reddit Users at a High Risk of Suicide and Their Linguistic Features During the COVID-19 Pandemic: Growth-Based Trajectory Model.识别新冠疫情期间有高自杀风险的Reddit用户及其语言特征：基于增长的轨迹模型

J Med Internet Res. 2024 Aug 8;26:e48907. doi: 10.2196/48907.

Exploring the Role of First-Person Singular Pronouns in Detecting Suicidal Ideation: A Machine Learning Analysis of Clinical Transcripts.探究第一人称单数代词在检测自杀意念中的作用：临床记录的机器学习分析

Behav Sci (Basel). 2024 Mar 11;14(3):225. doi: 10.3390/bs14030225.

A multimodal dialog approach to mental state characterization in clinically depressed, anxious, and suicidal populations.一种用于临床抑郁症、焦虑症和自杀倾向人群心理状态特征描述的多模态对话方法。

Front Psychol. 2023 Sep 11;14:1135469. doi: 10.3389/fpsyg.2023.1135469. eCollection 2023.

Virtually screening adults for depression, anxiety, and suicide risk using machine learning and language from an open-ended interview.利用机器学习和开放式访谈中的语言对成年人进行抑郁症、焦虑症和自杀风险的虚拟筛查。

Front Psychiatry. 2023 Jun 12;14:1143175. doi: 10.3389/fpsyt.2023.1143175. eCollection 2023.

Indirect Identification of Perinatal Psychosocial Risks from Natural Language.从自然语言中间接识别围产期心理社会风险

IEEE Trans Affect Comput. 2023 Apr-Jun;14(2):1506-1519. doi: 10.1109/TAFFC.2021.3079282. Epub 2021 May 11.

Identification of maternal depression risk from natural language collected in a mobile health app.从移动健康应用程序中收集的自然语言识别孕产妇抑郁风险。

Procedia Comput Sci. 2022;206:132-140. doi: 10.1016/j.procs.2022.09.092. Epub 2022 Sep 21.

Developing an Automated Assessment of In-session Patient Activation for Psychological Therapy: Codevelopment Approach.开发心理治疗中患者即时激活的自动评估：共同开发方法。

JMIR Med Inform. 2022 Nov 8;10(11):e38168. doi: 10.2196/38168.

Machine learning prediction of suicidal ideation, planning, and attempt among Korean adults: A population-based study.韩国成年人自杀意念、计划及企图的机器学习预测：一项基于人群的研究。

SSM Popul Health. 2022 Sep 14;19:101231. doi: 10.1016/j.ssmph.2022.101231. eCollection 2022 Sep.

本文引用的文献

A comparison of suicide notes written by men and women.男性与女性所写遗书的比较。

Death Stud. 2016;40(3):201-3. doi: 10.1080/07481187.2015.1086449. Epub 2015 Sep 1.

Electrophysiological evidence of interaction between contextual expectation and semantic integration during the processing of collocations.搭配处理过程中语境预期与语义整合相互作用的电生理学证据。

Biol Psychol. 2010 Mar;83(3):176-90. doi: 10.1016/j.biopsycho.2009.12.006. Epub 2009 Dec 22.

A new readability yardstick.一种新的可读性衡量标准。

J Appl Psychol. 1948 Jun;32(3):221-33. doi: 10.1037/h0057532.

Challenges in assessing intent to die: can suicide attempters be trusted?评估死亡意图的挑战：自杀未遂者是否可信？

Omega (Westport). 2007;55(1):57-70. doi: 10.2190/5867-6510-3388-3517.

The content of suicide notes from attempters and completers.自杀未遂者和自杀成功者遗书的内容。

Crisis. 2007;28(2):102-4. doi: 10.1027/0227-5910.28.2.102.

A review of feature selection techniques in bioinformatics.生物信息学中特征选择技术综述。

Bioinformatics. 2007 Oct 1;23(19):2507-17. doi: 10.1093/bioinformatics/btm344. Epub 2007 Aug 24.

The development and validation of statistical prediction rules for discriminating between genuine and simulated suicide notes.

Arch Suicide Res. 2007;11(2):219-33. doi: 10.1080/13811110701250176.

Suicide notes in Mexico: what do they tell us?墨西哥的遗书：它们告诉了我们什么？

Suicide Life Threat Behav. 2006 Dec;36(6):709-15. doi: 10.1521/suli.2006.36.6.709.

Writing characteristics of suicidal people on the Internet: a psychological investigation of emerging social environments.

Suicide Life Threat Behav. 2005 Oct;35(5):507-24. doi: 10.1521/suli.2005.35.5.507.

Suicide note themes and suicide prevention.遗书主题与自杀预防。

Int J Psychiatry Med. 2003;33(4):323-31. doi: 10.2190/T210-E2V5-A5M0-QLJU.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

使用自然语言处理的自杀遗书分类：一项内容分析

Suicide Note Classification Using Natural Language Processing: A Content Analysis.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献