Department of Population Health Sciences, Weill Cornell Medicine, Cornell University, Cornell, USA.
Institute of Artificial Intelligence for Digital Health, Weill Cornell Medicine, Cornell University, Cornell, USA.
Transl Psychiatry. 2024 Jul 31;14(1):316. doi: 10.1038/s41398-024-03034-3.
Machine Learning models trained from real-world data have demonstrated promise in predicting suicide attempts in adolescents. However, their transportability, namely the performance of a model trained on one dataset and applied to different data, is largely unknown, hindering the clinical adoption of these models. Here we developed different machine learning-based suicide prediction models based on real-world data collected in different contexts (inpatient, outpatient, and all encounters) with varying purposes (administrative claims and electronic health records), and compared their cross-data performance. The three datasets used were the All-Payer Claims Database in Connecticut, the Hospital Inpatient Discharge Database in Connecticut, and the Electronic Health Records data provided by the Kansas Health Information Network. We included 285,320 patients among whom we identified 3389 (1.2%) suicide attempters and 66% of the suicide attempters were female. Different machine learning models were evaluated on source datasets where models were trained and then applied to target datasets. More complex models, particularly deep long short-term memory neural network models, did not outperform simpler regularized logistic regression models in terms of both local and transported performance. Transported models exhibited varying performance, showing drops or even improvements compared to their source performance. While they can achieve satisfactory transported performance, they are usually upper-bounded by the best performance of locally developed models, and they can identify additional new cases in target data. Our study uncovers complex transportability patterns and could facilitate the development of suicide prediction models with better performance and generalizability.
基于真实世界数据训练的机器学习模型在预测青少年自杀尝试方面表现出了一定的潜力。然而,这些模型的可迁移性(即在一个数据集上训练的模型在不同数据上的性能)在很大程度上是未知的,这阻碍了这些模型在临床中的应用。在这里,我们基于在不同背景(住院、门诊和所有就诊)下收集的、具有不同目的(行政索赔和电子健康记录)的真实世界数据开发了不同的基于机器学习的自杀预测模型,并比较了它们在跨数据上的性能。使用的三个数据集是康涅狄格州的全民支付索赔数据库、康涅狄格州的医院住院数据库和堪萨斯州健康信息网络提供的电子健康记录数据。我们纳入了 285320 名患者,其中 3389 名(1.2%)为自杀未遂者,66%的自杀未遂者为女性。不同的机器学习模型在模型训练的源数据集上进行评估,然后应用于目标数据集。在本地和迁移性能方面,更复杂的模型,特别是深度长短期记忆神经网络模型,并不优于简单的正则化逻辑回归模型。迁移模型表现出不同的性能,与源性能相比,出现了下降甚至提高。虽然它们可以实现令人满意的迁移性能,但它们通常受到本地开发模型最佳性能的限制,并且可以在目标数据中识别出更多的新病例。我们的研究揭示了复杂的可迁移性模式,这有助于开发具有更好性能和通用性的自杀预测模型。