Crowley Ryan, Parkin Katherine, Rocheteau Emma, Massou Efthalia, Friedmann Yasmin, John Ann, Sippy Rachel, Liò Pietro, Moore Anna
New York University Grossman School of Medicine, New York, US.
Department of Public Health and Primary Care, University of Cambridge, Cambridge, UK.
BJPsych Open. 2025 Apr 11;11(3):e86. doi: 10.1192/bjo.2025.32.
Rates of childhood mental health problems are increasing in the UK. Early identification of childhood mental health problems is challenging but critical to children's future psychosocial development. This is particularly important for children with social care contact because earlier identification can facilitate earlier intervention. Clinical prediction tools could improve these early intervention efforts.
Characterise a novel cohort consisting of children in social care and develop effective machine learning models for prediction of childhood mental health problems.
We used linked, de-identified data from the Secure Anonymised Information Linkage Databank to create a cohort of 26 820 children in Wales, UK, receiving social care services. Integrating health, social care and education data, we developed several machine learning models aimed at predicting childhood mental health problems. We assessed the performance, interpretability and fairness of these models.
Risk factors strongly associated with childhood mental health problems included age, substance misuse and being a looked after child. The best-performing model, a gradient boosting classifier, achieved an area under the receiver operating characteristic curve of 0.75 (95% CI 0.73-0.78). Assessments of algorithmic fairness showed potential biases within these models.
Machine learning performance on this prediction task was promising. Predictive performance in social care settings can be bolstered by linking diverse routinely collected data-sets, making available a range of heterogenous risk factors relating to clinical, social and environmental exposures.
英国儿童心理健康问题的发生率正在上升。儿童心理健康问题的早期识别具有挑战性,但对儿童未来的心理社会发展至关重要。对于有社会护理接触的儿童来说,这一点尤为重要,因为早期识别可以促进早期干预。临床预测工具可以改善这些早期干预措施。
描述一个由接受社会护理的儿童组成的新队列,并开发有效的机器学习模型来预测儿童心理健康问题。
我们使用来自安全匿名信息链接数据库的关联、去识别化数据,创建了一个由英国威尔士26820名接受社会护理服务的儿童组成的队列。整合健康、社会护理和教育数据,我们开发了几个旨在预测儿童心理健康问题的机器学习模型。我们评估了这些模型的性能、可解释性和公平性。
与儿童心理健康问题密切相关的风险因素包括年龄、药物滥用和受照料儿童身份。表现最佳的模型是梯度提升分类器,其受试者操作特征曲线下面积为0.75(95%CI 0.73-0.78)。算法公平性评估显示这些模型存在潜在偏差。
机器学习在这项预测任务上的表现很有前景。通过链接各种常规收集的数据集,提供一系列与临床、社会和环境暴露相关的异质风险因素,可以提高社会护理环境中的预测性能。