Department of Community Health Sciences, Cumming School of Medicine, University of Calgary, TRW, 4th Floor, Room 4D66, 3280 Hospital Drive NW, Calgary, Alberta, Canada.
Hotchkiss Brain Institute, Department of Psychiatry, Cumming School of Medicine, University of Calgary, TRW, 4th Floor, Room 4D67, 3280 Hospital Drive NW, Calgary, Alberta, Canada.
J Affect Disord. 2020 Mar 1;264:107-114. doi: 10.1016/j.jad.2019.12.024. Epub 2019 Dec 14.
Suicide is a leading cause of death, particularly in younger persons, and this results in tremendous years of life lost.
To compare the performance of recurrent neural networks, one-dimensional convolutional neural networks, and gradient boosted trees, with logistic regression and feedforward neural networks.
The modeling dataset contained 3548 persons that died by suicide and 35,480 persons that did not die by suicide between 2000 and 2016. 101 predictors were selected, and these were assembled for each of the 40 quarters (10 years) prior to the quarter of death, resulting in 4040 predictors in total for each person. Model configurations were evaluated using 10-fold cross-validation.
The optimal recurrent neural network model configuration (AUC: 0.8407), one-dimensional convolutional neural network configuration (AUC: 0.8419), and XGB model configuration (AUC: 0.8493) all outperformed logistic regression (AUC: 0.8179). In addition to superior discrimination, the optimal XGB model configuration also achieved superior calibration.
Although the models developed in this study showed promise, further research is needed to determine the performance limits of statistical and machine learning models that quantify suicide risk, and to develop prediction models optimized for implementation in clinical settings. It appears that the XGB model class is the most promising in terms of discrimination, calibration, and computational expense.
Many important predictors are not available in administrative data and this likely places a limit on how well prediction models developed with administrative data can perform.
自杀是导致死亡的主要原因,尤其是在年轻人中,这导致了大量生命的丧失。
比较递归神经网络、一维卷积神经网络和梯度提升树与逻辑回归和前馈神经网络的性能。
建模数据集包含 2000 年至 2016 年间自杀死亡的 3548 人和未自杀的 35480 人。选择了 101 个预测因子,并为死亡季度前的每 10 年的 40 个季度(40 个季度)组装了这些预测因子,从而为每个人总共组装了 4040 个预测因子。使用 10 折交叉验证评估模型配置。
最佳递归神经网络模型配置(AUC:0.8407)、一维卷积神经网络配置(AUC:0.8419)和 XGB 模型配置(AUC:0.8493)均优于逻辑回归(AUC:0.8179)。除了优越的区分能力外,最佳 XGB 模型配置还实现了优越的校准。
尽管本研究开发的模型显示出了一定的前景,但仍需要进一步研究,以确定量化自杀风险的统计和机器学习模型的性能极限,并开发优化用于临床环境实施的预测模型。XGB 模型类别在区分、校准和计算成本方面似乎最有前途。
许多重要的预测因子在行政数据中不可用,这可能限制了使用行政数据开发的预测模型的性能。