识别面临家庭外安置风险增加儿童的机器学习算法比较：开发与实际考量

Comparison of Machine Learning Algorithms Identifying Children at Increased Risk of Out-of-Home Placement: Development and Practical Considerations.

作者信息

Gorham Tyler J, Hardy Rose Y, Ciccone David, Chisolm Deena J

机构信息

IT Research & Innovation, The Abigail Wexner Research Institute at Nationwide Children's Hospital, Columbus, Ohio, USA.

Center for Child Health Equity and Outcomes Research, The Abigail Wexner Research Institute at Nationwide Children's Hospital, Columbus, Ohio, USA.

出版信息

Health Serv Res. 2025 Aug;60(4):e14601. doi: 10.1111/1475-6773.14601. Epub 2025 Mar 6.

DOI:10.1111/1475-6773.14601

PMID:40047796

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12277119/

Abstract

OBJECTIVE

To develop a machine learning (ML) algorithm capable of identifying children at risk of out-of-home placement among a Medicaid-insured population.

STUDY SETTING AND DESIGN

The study population includes children enrolled in a Medicaid accountable care organization between 2018 and 2022 in two nonurban Ohio counties served by the Centers for Medicare and Medicaid Services-funded Integrated Care for Kids Model. Using a retrospective cohort, we developed and compared a set of ML algorithms to identify children at risk of out-of-home placement within one year. ML algorithms tested include least absolute shrinkage and selection operator (LASSO)-regularized logistic regression and eXtreme gradient-boosted trees (XGBoost). We compared both modeling approaches with and without race as a candidate predictor. Performance metrics included the area under the receiver operating characteristic curve (AUROC) and the corrected partial AUROC at specificities ≥ 90% (pAUROC). Algorithmic bias was tested by comparing pAUROC across each model between Black and White children.

DATA SOURCES AND ANALYTIC SAMPLE

The modeling dataset was comprised of Medicaid claims and patient demographics data from Partners For Kids, a pediatric accountable care organization.

PRINCIPAL FINDINGS

Overall, XGBoost models outperformed LASSO models. When race was included in the model, XGBoost had an AUROC of 0.78 (95% confidence interval [CI]: 0.77-0.79) while the LASSO model had an AUROC of 0.75 (95% CI: 0.74-0.77). When race was excluded from the model, XGBoost had an AUROC of 0.76 (95% CI: 0.74-0.77) while LASSO had an AUROC of 0.73 (95% CI: 0.72-0.74).

CONCLUSIONS

The more complex XGBoost outperformed the simpler LASSO in predicting out-of-home placement and had less evidence of racial bias. This study highlights the complexities of developing predictive models in systems with known racial disparities and illustrates what can be accomplished when ML developers and policy leaders collaborate to maximize data to meet the needs of children and families.

摘要

目的

开发一种机器学习（ML）算法，能够在医疗补助参保人群中识别有家庭外安置风险的儿童。

研究背景与设计

研究人群包括2018年至2022年期间在俄亥俄州两个非城市县参加医疗补助责任医疗组织的儿童，这些县由医疗保险和医疗补助服务中心资助的儿童综合护理模式提供服务。我们采用回顾性队列研究，开发并比较了一组ML算法，以识别一年内有家庭外安置风险的儿童。测试的ML算法包括最小绝对收缩和选择算子（LASSO）正则化逻辑回归和极端梯度提升树（XGBoost）。我们比较了将种族作为候选预测变量和不将种族作为候选预测变量的两种建模方法。性能指标包括受试者操作特征曲线下面积（AUROC）和特异性≥90%时的校正部分AUROC（pAUROC）。通过比较黑人和白人儿童在每个模型中的pAUROC来测试算法偏差。

数据来源与分析样本

建模数据集由儿科责任医疗组织“儿童伙伴”的医疗补助索赔和患者人口统计学数据组成。

主要发现

总体而言，XGBoost模型优于LASSO模型。当模型中纳入种族因素时，XGBoost的AUROC为0.78（95%置信区间[CI]：0.77 - 0.79），而LASSO模型的AUROC为0.75（95% CI：0.74 - 0.77）。当模型中排除种族因素时，XGBoost的AUROC为0.76（95% CI：0.74 - 0.77），而LASSO的AUROC为0.73（95% CI：0.72 - 0.74）。

结论

在预测家庭外安置方面，更复杂的XGBoost优于更简单的LASSO，且种族偏差证据较少。本研究突出了在存在已知种族差异的系统中开发预测模型的复杂性，并说明了当ML开发者和政策领导者合作以最大化数据来满足儿童和家庭需求时所能取得的成果。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c703/12277119/b72cffe43a42/HESR-60-0-g001.jpg

相似文献

Comparison of Machine Learning Algorithms Identifying Children at Increased Risk of Out-of-Home Placement: Development and Practical Considerations.识别面临家庭外安置风险增加儿童的机器学习算法比较：开发与实际考量

Health Serv Res. 2025 Aug;60(4):e14601. doi: 10.1111/1475-6773.14601. Epub 2025 Mar 6.

Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。

Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.

Are Current Survival Prediction Tools Useful When Treating Subsequent Skeletal-related Events From Bone Metastases?当前的生存预测工具在治疗骨转移后的骨骼相关事件时有用吗？

Clin Orthop Relat Res. 2024 Sep 1;482(9):1710-1721. doi: 10.1097/CORR.0000000000003030. Epub 2024 Mar 22.

Idiographic Lapse Prediction With State Space Modeling: Algorithm Development and Validation Study.基于状态空间模型的个性化失误预测：算法开发与验证研究

JMIR Form Res. 2025 Jun 3;9:e73265. doi: 10.2196/73265.

Construction and validation of HBV-ACLF bacterial infection diagnosis model based on machine learning.基于机器学习的HBV-ACLF细菌感染诊断模型的构建与验证

BMC Infect Dis. 2025 Jul 1;25(1):847. doi: 10.1186/s12879-025-11199-5.

Does the Presence of Missing Data Affect the Performance of the SORG Machine-learning Algorithm for Patients With Spinal Metastasis? Development of an Internet Application Algorithm.缺失数据的存在是否会影响 SORG 机器学习算法在脊柱转移瘤患者中的性能？开发一种互联网应用算法。

Clin Orthop Relat Res. 2024 Jan 1;482(1):143-157. doi: 10.1097/CORR.0000000000002706. Epub 2023 Jun 12.

Comparing the performance of screening surveys versus predictive models in identifying patients in need of health-related social need services in the emergency department.比较筛查调查与预测模型在识别急诊科需要健康相关社会需求服务的患者方面的表现。

PLoS One. 2024 Nov 20;19(11):e0312193. doi: 10.1371/journal.pone.0312193. eCollection 2024.

Predicting mortality risk following major lower extremity amputation using machine learning.使用机器学习预测下肢大截肢后的死亡风险。

J Vasc Surg. 2025 May 1. doi: 10.1016/j.jvs.2025.03.198.

Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中，如果患者出现以下症状和体征，可判断其是否患有 COVID-19。

Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.

Are Detailed, Patient-level Social Determinant of Health Factors Associated With Physical Function and Mental Health at Presentation Among New Patients With Orthopaedic Conditions?详细的患者层面的健康社会决定因素是否与新骨科患者就诊时的身体功能和心理健康相关？

Clin Orthop Relat Res. 2023 May 1;481(5):912-921. doi: 10.1097/CORR.0000000000002446. Epub 2022 Oct 6.

本文引用的文献

Predicting Successful Placements for Youth in Child Welfare with Machine Learning.利用机器学习预测儿童福利领域青少年的成功安置情况。

Child Youth Serv Rev. 2023 Oct;153. doi: 10.1016/j.childyouth.2023.107117. Epub 2023 Aug 4.

Easy to use and validated predictive models to identify beneficiaries experiencing homelessness in Medicaid administrative data.易于使用且经过验证的预测模型，可用于识别医疗补助管理数据中无家可归的受益人群。

Health Serv Res. 2023 Aug;58(4):882-893. doi: 10.1111/1475-6773.14143. Epub 2023 Feb 28.

Ethical artificial intelligence in paediatrics.儿科中的伦理人工智能

Lancet Child Adolesc Health. 2022 Dec;6(12):833-835. doi: 10.1016/S2352-4642(22)00243-7. Epub 2022 Sep 7.

Construction of the Ohio Children's Opportunity Index.俄亥俄州儿童机会指数的构建。

Front Public Health. 2022 Jul 22;10:734105. doi: 10.3389/fpubh.2022.734105. eCollection 2022.

Eliminating Race-Based Medicine.消除基于种族的医学。

Pediatrics. 2022 Jul 1;150(1). doi: 10.1542/peds.2022-057998.

Understanding the bias in machine learning systems for cardiovascular disease risk assessment: The first of its kind review.理解机器学习系统在心血管疾病风险评估中的偏差：首例此类综述。

Comput Biol Med. 2022 Mar;142:105204. doi: 10.1016/j.compbiomed.2021.105204. Epub 2022 Jan 4.

Long-term Health and Social Outcomes in Children and Adolescents Placed in Out-of-Home Care.儿童和青少年被安置在家庭之外的长期健康和社会结果。

JAMA Pediatr. 2022 Jan 1;176(1):e214324. doi: 10.1001/jamapediatrics.2021.4324. Epub 2022 Jan 4.

A Unifying Approach for GFR Estimation: Recommendations of the NKF-ASN Task Force on Reassessing the Inclusion of Race in Diagnosing Kidney Disease.一种统一的肾小球滤过率估计方法：NKF-ASN 工作组关于重新评估种族在诊断肾脏疾病中的纳入的建议。

Am J Kidney Dis. 2022 Feb;79(2):268-288.e1. doi: 10.1053/j.ajkd.2021.08.003. Epub 2021 Sep 23.

The importance of child abuse and neglect in adult medicine.儿童虐待和忽视在成人医学中的重要性。

Pharmacol Biochem Behav. 2021 Dec;211:173268. doi: 10.1016/j.pbb.2021.173268. Epub 2021 Sep 7.

Predicting Future Care Requirements Using Machine Learning for Pediatric Intensive and Routine Care Inpatients.使用机器学习预测儿科重症和常规护理住院患者未来的护理需求。

Crit Care Explor. 2021 Aug 10;3(8):e0505. doi: 10.1097/CCE.0000000000000505. eCollection 2021 Aug.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

识别面临家庭外安置风险增加儿童的机器学习算法比较：开发与实际考量

Comparison of Machine Learning Algorithms Identifying Children at Increased Risk of Out-of-Home Placement: Development and Practical Considerations.

作者信息

机构信息

出版信息

OBJECTIVE

STUDY SETTING AND DESIGN

DATA SOURCES AND ANALYTIC SAMPLE

PRINCIPAL FINDINGS

CONCLUSIONS

目的

研究背景与设计

数据来源与分析样本

主要发现

结论

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献