利用弱监督和深度学习对临床记录进行分类，以识别当前的自杀意念。

Using weak supervision and deep learning to classify clinical notes for identification of current suicidal ideation.

作者信息

Cusick Marika, Adekkanattu Prakash, Campion Thomas R, Sholle Evan T, Myers Annie, Banerjee Samprit, Alexopoulos George, Wang Yanshan, Pathak Jyotishman

机构信息

Department of Information and Technology Services, Weill Cornell Medicine, New York, USA; Department Population Health Sciences, Weill Cornell Medicine, New York, USA.

Department of Information and Technology Services, Weill Cornell Medicine, New York, USA.

出版信息

J Psychiatr Res. 2021 Apr;136:95-102. doi: 10.1016/j.jpsychires.2021.01.052. Epub 2021 Feb 2.

DOI:10.1016/j.jpsychires.2021.01.052

PMID:33581461

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8009838/

Abstract

Mental health concerns, such as suicidal thoughts, are frequently documented by providers in clinical notes, as opposed to structured coded data. In this study, we evaluated weakly supervised methods for detecting "current" suicidal ideation from unstructured clinical notes in electronic health record (EHR) systems. Weakly supervised machine learning methods leverage imperfect labels for training, alleviating the burden of creating a large manually annotated dataset. After identifying a cohort of 600 patients at risk for suicidal ideation, we used a rule-based natural language processing approach (NLP) approach to label the training and validation notes (n = 17,978). Using this large corpus of clinical notes, we trained several statistical machine learning models-logistic classifier, support vector machines (SVM), Naive Bayes classifier-and one deep learning model, namely a text classification convolutional neural network (CNN), to be evaluated on a manually-reviewed test set (n = 837). The CNN model outperformed all other methods, achieving an overall accuracy of 94% and a F1-score of 0.82 on documents with "current" suicidal ideation. This algorithm correctly identified an additional 42 encounters and 9 patients indicative of suicidal ideation but missing a structured diagnosis code. When applied to a random subset of 5,000 clinical notes, the algorithm classified 0.46% (n = 23) for "current" suicidal ideation, of which 87% were truly indicative via manual review. Implementation of this approach for large-scale document screening may play an important role in point-of-care clinical information systems for targeted suicide prevention interventions and improve research on the pathways from ideation to attempt.

摘要

心理健康问题，如自杀念头，在临床记录中经常被医护人员记录下来，而非结构化编码数据。在本研究中，我们评估了用于从电子健康记录（EHR）系统中的非结构化临床记录检测“当前”自杀意念的弱监督方法。弱监督机器学习方法利用不完美标签进行训练，减轻了创建大型人工标注数据集的负担。在确定了600名有自杀意念风险的患者队列后，我们使用基于规则的自然语言处理方法（NLP）对训练和验证记录（n = 17,978）进行标注。利用这个大量的临床记录语料库，我们训练了几种统计机器学习模型——逻辑分类器、支持向量机（SVM）、朴素贝叶斯分类器——以及一种深度学习模型，即文本分类卷积神经网络（CNN），并在人工审核的测试集（n = 837）上进行评估。CNN模型优于所有其他方法，在有“当前”自杀意念的文档上总体准确率达到94%，F1分数为0.82。该算法正确识别出另外42次就诊和9名有自杀意念迹象但缺少结构化诊断代码的患者。当应用于5000份临床记录的随机子集时，该算法将0.46%（n = 23）分类为有“当前”自杀意念，其中87%经人工审核确实有自杀意念迹象。这种方法在大规模文档筛查中的应用可能在即时护理临床信息系统中发挥重要作用，用于针对性自杀预防干预，并改善对从意念到自杀未遂途径的研究。

相似文献

Using weak supervision and deep learning to classify clinical notes for identification of current suicidal ideation.

J Psychiatr Res. 2021 Apr;136:95-102. doi: 10.1016/j.jpsychires.2021.01.052. Epub 2021 Feb 2.

A clinical text classification paradigm using weak supervision and deep representation.

BMC Med Inform Decis Mak. 2019 Jan 7;19(1):1. doi: 10.1186/s12911-018-0723-6.

Classifying social determinants of health from unstructured electronic health records using deep learning-based natural language processing.

J Biomed Inform. 2022 Mar;127:103984. doi: 10.1016/j.jbi.2021.103984. Epub 2022 Jan 7.

Identifying Suicidal Ideation and Attempt From Clinical Notes Within a Large Integrated Health Care System.

Perm J. 2022 Apr 5;26(1):85-93. doi: 10.7812/TPP/21.102.

Prediction of suicidal ideation in children and adolescents using machine learning and deep learning algorithm: A case study in South Korea where suicide is the leading cause of death.

Asian J Psychiatr. 2023 Oct;88:103725. doi: 10.1016/j.ajp.2023.103725. Epub 2023 Aug 6.

Detecting Suicidal Ideation on Forums: Proof-of-Concept Study.

J Med Internet Res. 2018 Jun 21;20(6):e215. doi: 10.2196/jmir.9840.

Detecting and Analyzing Suicidal Ideation on Social Media Using Deep Learning and Machine Learning Models.

Int J Environ Res Public Health. 2022 Oct 3;19(19):12635. doi: 10.3390/ijerph191912635.

Convolutional Neural Network-Based Deep Learning Model for Predicting Differential Suicidality in Depressive Patients Using Brain Generalized q-Sampling Imaging.

J Clin Psychiatry. 2021 Feb 23;82(2):19m13225. doi: 10.4088/JCP.19m13225.

Detecting Potentially Harmful and Protective Suicide-Related Content on Twitter: Machine Learning Approach.

J Med Internet Res. 2022 Aug 17;24(8):e34705. doi: 10.2196/34705.

Identification of asthma control factor in clinical notes using a hybrid deep learning model.

BMC Med Inform Decis Mak. 2021 Nov 9;21(Suppl 7):272. doi: 10.1186/s12911-021-01633-4.

引用本文的文献

Multi-Label Classification with Generative AI Models in Healthcare: A Case Study of Suicidality and Risk Factors.

ArXiv. 2025 Jul 22:arXiv:2507.17009v1.

Suicide Phenotyping from Clinical Notes in Safety-Net Psychiatric Hospital Using Multi-Label Classification with Pre-Trained Language Models.

AMIA Jt Summits Transl Sci Proc. 2025 Jun 10;2025:260-269. eCollection 2025.

Logic-driven Indirect Supervision: An Application to Crisis Counseling.

Proc Conf Assoc Comput Linguist Meet. 2023 Jul;2023:11704-11722. doi: 10.18653/v1/2023.acl-long.654.

Leveraging large language models for knowledge-free weak supervision in clinical natural language processing.

Sci Rep. 2025 Mar 10;15(1):8241. doi: 10.1038/s41598-024-68168-2.

Artificial Intelligence in Psychiatry: A Review of Biological and Behavioral Data Analyses.

Diagnostics (Basel). 2025 Feb 11;15(4):434. doi: 10.3390/diagnostics15040434.

Validation of a machine learning model for indirect screening of suicidal ideation in the general population.

Sci Rep. 2025 Feb 24;15(1):6579. doi: 10.1038/s41598-025-90718-5.

Post-discharge suicide prediction among US veterans using natural language processing-enriched social and behavioral determinants of health.

Npj Ment Health Res. 2025 Feb 22;4(1):8. doi: 10.1038/s44184-025-00120-2.

Using Natural Language Processing Methods to Predict Topics Included in 2019 Ohio Syphilis Disease Intervention Specialist Records.

Sex Transm Dis. 2025 Jun 1;52(6):356-363. doi: 10.1097/OLQ.0000000000002135. Epub 2025 Feb 11.

The Goldilocks Zone: Finding the right balance of user and institutional risk for suicide-related generative AI queries.

PLOS Digit Health. 2025 Jan 8;4(1):e0000711. doi: 10.1371/journal.pdig.0000711. eCollection 2025 Jan.

Automatically extracting social determinants of health for suicide: a narrative literature review.

Npj Ment Health Res. 2024 Nov 6;3(1):51. doi: 10.1038/s44184-024-00087-6.

本文引用的文献

C-SSRS performance in emergency department patients at high risk for suicide.

Suicide Life Threat Behav. 2020 Dec;50(6):1097-1104. doi: 10.1111/sltb.12657. Epub 2020 Jul 24.

Deaths: Leading Causes for 2017.

Natl Vital Stat Rep. 2019 Jun;68(6):1-77.

Suicide prediction models: a critical review of recent research with recommendations for the way forward.

Mol Psychiatry. 2020 Jan;25(1):168-179. doi: 10.1038/s41380-019-0531-0. Epub 2019 Sep 30.

What health records data are required for accurate prediction of suicidal behavior?

J Am Med Inform Assoc. 2019 Dec 1;26(12):1458-1465. doi: 10.1093/jamia/ocz136.

Prediction Models for Suicide Attempts and Deaths: A Systematic Review and Simulation.

JAMA Psychiatry. 2019 Jun 1;76(6):642-651. doi: 10.1001/jamapsychiatry.2019.0174.

A clinical text classification paradigm using weak supervision and deep representation.

BMC Med Inform Decis Mak. 2019 Jan 7;19(1):1. doi: 10.1186/s12911-018-0723-6.

Mortality in the United States, 2017.

NCHS Data Brief. 2018 Nov(328):1-8.

Secondary Use of Patients' Electronic Records (SUPER): An Approach for Meeting Specific Data Needs of Clinical and Translational Researchers.

AMIA Annu Symp Proc. 2018 Apr 16;2017:1581-1588. eCollection 2017.

Predicting Suicide Attempts and Suicide Deaths Following Outpatient Visits Using Electronic Health Records.

Am J Psychiatry. 2018 Oct 1;175(10):951-960. doi: 10.1176/appi.ajp.2018.17101167. Epub 2018 May 24.

Identifying Suicide Ideation and Suicidal Attempts in a Psychiatric Clinical Research Database using Natural Language Processing.

Sci Rep. 2018 May 9;8(1):7426. doi: 10.1038/s41598-018-25773-2.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

利用弱监督和深度学习对临床记录进行分类，以识别当前的自杀意念。

Using weak supervision and deep learning to classify clinical notes for identification of current suicidal ideation.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献