Brain and Mind Centre, University of Sydney, 94 Mallett Street, Camperdown, Sydney, NSW, 2042, Australia.
Discipline of Psychiatry, University of New South Wales, Sydney, Australia.
Soc Psychiatry Psychiatr Epidemiol. 2023 Jun;58(6):893-905. doi: 10.1007/s00127-022-02415-7. Epub 2023 Feb 28.
Machine learning (ML) has shown promise in modelling future self-harm but is yet to be applied to key questions facing clinical services. In a cohort of young people accessing primary mental health care, this study aimed to establish (1) the performance of models predicting deliberate self-harm (DSH) compared to suicide attempt (SA), (2) the performance of models predicting new-onset or repeat behaviour, and (3) the relative importance of factors predicting these outcomes.
802 young people aged 12-25 years attending primary mental health services had detailed social and clinical assessments at baseline and 509 completed 12-month follow-up. Four ML algorithms, as well as logistic regression, were applied to build four distinct models.
The mean performance of models predicting SA (AUC: 0.82) performed better than the models predicting DSH (AUC: 0.72), with mean positive predictive values (PPV) approximately twice that of the prevalence (SA prevalence 14%, PPV: 0.32, DSH prevalence 22%, PPV: 0.40). All ML models outperformed standard logistic regression. The most frequently selected variable in both models was a history of DSH via cutting.
History of DSH and clinical symptoms of common mental disorders, rather than social and demographic factors, were the most important variables in modelling future behaviour. The performance of models predicting outcomes in key sub-cohorts, those with new-onset or repetition of DSH or SA during follow-up, was poor. These findings may indicate that the performance of models of future DSH or SA may depend on knowledge of the individual's recent history of either behaviour.
机器学习(ML)在预测未来的自我伤害方面显示出了前景,但尚未应用于临床服务所面临的关键问题。在一个接受初级心理健康护理的年轻人队列中,本研究旨在确定:(1)预测故意自我伤害(DSH)与自杀企图(SA)的模型的表现,(2)预测新发病或重复行为的模型的表现,以及(3)预测这些结果的因素的相对重要性。
802 名年龄在 12-25 岁之间的年轻人在基线时接受了详细的社会和临床评估,其中 509 人完成了 12 个月的随访。应用了四种 ML 算法和逻辑回归来构建四个不同的模型。
预测 SA(AUC:0.82)的模型的平均性能优于预测 DSH(AUC:0.72)的模型,其平均阳性预测值(PPV)约为流行率的两倍(SA 流行率为 14%,PPV:0.32,DSH 流行率为 22%,PPV:0.40)。所有 ML 模型均优于标准逻辑回归。两种模型中最常被选中的变量都是通过切割导致的 DSH 病史。
DSH 病史和常见精神障碍的临床症状,而不是社会和人口统计学因素,是建模未来行为的最重要变量。在预测关键亚队列(随访期间新发病或重复 DSH 或 SA)结果的模型中,表现不佳。这些发现可能表明,未来 DSH 或 SA 模型的性能可能取决于对个体最近 DSH 或行为史的了解。