Suppr超能文献

利用医疗保健系统数据预测自杀死亡:递归神经网络、一维卷积神经网络和梯度提升树模型能否提高预测性能?

Predicting death by suicide using administrative health care system data: Can recurrent neural network, one-dimensional convolutional neural network, and gradient boosted trees models improve prediction performance?

机构信息

Department of Community Health Sciences, Cumming School of Medicine, University of Calgary, TRW, 4th Floor, Room 4D66, 3280 Hospital Drive NW, Calgary, Alberta, Canada.

Hotchkiss Brain Institute, Department of Psychiatry, Cumming School of Medicine, University of Calgary, TRW, 4th Floor, Room 4D67, 3280 Hospital Drive NW, Calgary, Alberta, Canada.

出版信息

J Affect Disord. 2020 Mar 1;264:107-114. doi: 10.1016/j.jad.2019.12.024. Epub 2019 Dec 14.

Abstract

BACKGROUND

Suicide is a leading cause of death, particularly in younger persons, and this results in tremendous years of life lost.

OBJECTIVE

To compare the performance of recurrent neural networks, one-dimensional convolutional neural networks, and gradient boosted trees, with logistic regression and feedforward neural networks.

METHODS

The modeling dataset contained 3548 persons that died by suicide and 35,480 persons that did not die by suicide between 2000 and 2016. 101 predictors were selected, and these were assembled for each of the 40 quarters (10 years) prior to the quarter of death, resulting in 4040 predictors in total for each person. Model configurations were evaluated using 10-fold cross-validation.

RESULTS

The optimal recurrent neural network model configuration (AUC: 0.8407), one-dimensional convolutional neural network configuration (AUC: 0.8419), and XGB model configuration (AUC: 0.8493) all outperformed logistic regression (AUC: 0.8179). In addition to superior discrimination, the optimal XGB model configuration also achieved superior calibration.

CONCLUSIONS

Although the models developed in this study showed promise, further research is needed to determine the performance limits of statistical and machine learning models that quantify suicide risk, and to develop prediction models optimized for implementation in clinical settings. It appears that the XGB model class is the most promising in terms of discrimination, calibration, and computational expense.

LIMITATIONS

Many important predictors are not available in administrative data and this likely places a limit on how well prediction models developed with administrative data can perform.

摘要

背景

自杀是导致死亡的主要原因,尤其是在年轻人中,这导致了大量生命的丧失。

目的

比较递归神经网络、一维卷积神经网络和梯度提升树与逻辑回归和前馈神经网络的性能。

方法

建模数据集包含 2000 年至 2016 年间自杀死亡的 3548 人和未自杀的 35480 人。选择了 101 个预测因子,并为死亡季度前的每 10 年的 40 个季度(40 个季度)组装了这些预测因子,从而为每个人总共组装了 4040 个预测因子。使用 10 折交叉验证评估模型配置。

结果

最佳递归神经网络模型配置(AUC:0.8407)、一维卷积神经网络配置(AUC:0.8419)和 XGB 模型配置(AUC:0.8493)均优于逻辑回归(AUC:0.8179)。除了优越的区分能力外,最佳 XGB 模型配置还实现了优越的校准。

结论

尽管本研究开发的模型显示出了一定的前景,但仍需要进一步研究,以确定量化自杀风险的统计和机器学习模型的性能极限,并开发优化用于临床环境实施的预测模型。XGB 模型类别在区分、校准和计算成本方面似乎最有前途。

局限性

许多重要的预测因子在行政数据中不可用,这可能限制了使用行政数据开发的预测模型的性能。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验