深度学习和机器学习在使用文本挖掘方法识别心境障碍患者自杀企图时的比较。

Comparisons of deep learning and machine learning while using text mining methods to identify suicide attempts of patients with mood disorders.

机构信息

Department of Epidemiology and Health Statistics, School of Public Health, Capital Medical University, 10 Xitoutiao, Youanmen Wai, Beijing 100069, China.

Capital Medical University Affiliated Beijing Anding Hospital, Beijing, China.

出版信息

J Affect Disord. 2022 Nov 15;317:107-113. doi: 10.1016/j.jad.2022.08.054. Epub 2022 Aug 24.

DOI:10.1016/j.jad.2022.08.054

PMID:36029873

Abstract

BACKGROUND

Suicide attempt is one of the most severe consequences for patients with mood disorders. This study aimed to perform deep learning and machine learning while using text mining to identify patients with suicide attempts and to compare their effectiveness.

METHODS

A total of 13,100 patients with mood disorders were selected. Two traditional text mining methods, logistic regression and Support vector machine (SVM), and one deep learning model (Convolutional neural network, CNN) were adopted to perform overall analysis and gender-specific subgroup analysis of patients to identify suicide attempts. The classification effectiveness of these models was evaluated by accuracy, F1-value, precision, recall, and the area under Receiver operator characteristic curve (ROC).

RESULTS

CNN's results were greater than the other two for all indicators except recall which was slightly smaller than SVM in male subgroup analysis. The accuracy values of the CNN were 98.4 %, 98.2 %, and 98.5 % in the overall analysis and the subgroup analysis for males and females, respectively. The results of McNemar's test showed that CNN and SVM models' predictions were statistically different from the logistic regression model's predictions in the overall analysis and the subgroup analysis for females (P < 0.050).

LIMITATIONS

A fixed number of features were selected based on document frequency to train models; this was a single-site study.

CONCLUSIONS

CNN model was a better way to detect suicide attempts in patients with mood disorders prior to hospital admission, saving time and resources in recognizing high-risk patients and preventing suicide.

摘要

背景

自杀企图是心境障碍患者最严重的后果之一。本研究旨在通过文本挖掘进行深度学习和机器学习，以识别有自杀企图的患者，并比较其效果。

方法

共纳入 13100 例心境障碍患者。采用两种传统的文本挖掘方法（逻辑回归和支持向量机）和一种深度学习模型（卷积神经网络）对患者进行总体分析和性别亚组分析，以识别自杀企图。通过准确性、F1 值、精度、召回率和受试者工作特征曲线（ROC）下面积评估这些模型的分类效果。

结果

在所有指标中，除了男性亚组分析中召回率稍低于 SVM 外，CNN 的结果均大于其他两种方法。CNN 在整体分析和男性、女性亚组分析中的准确率分别为 98.4%、98.2%和 98.5%。McNemar 检验结果显示，在整体分析和女性亚组分析中，CNN 和 SVM 模型的预测与逻辑回归模型的预测有统计学差异（P<0.050）。