School of Social Policy & Practice, University of Pennsylvania, Philadelphia, PA, United States.
Columbia University Irving Medical Center, Department of Emergency Medicine, New York, NY, United States.
Stud Health Technol Inform. 2024 Aug 22;316:1652-1656. doi: 10.3233/SHTI240740.
Emergency departments (EDs) are pivotal in detecting child abuse and neglect, but this task is often complex. Our study developed a machine learning model using structured and unstructured electronic health record (EHR) data to predict when children in EDs might need intervention from child protective services. We used a case-control study design, analyzing data from a pediatric ED. Clinical notes were processed with natural language processing (NLP) techniques to identify suspected cases and matched in a 1:9 ratio to ensure dataset balance. The features from these notes were combined with structured EHR data to construct a model using the XGBoost algorithm. The model achieved a precision of 0.95, recall of 0.88, and F1-score of 0.92, with improvements seen from integrating NLP-derived data. Key indicators for abuse included hospital admissions, extended ED stays, and specific clinical orders. The model's accuracy and the utility of NLP suggest the potential for EDs to better identify at-risk children. Future work should validate the model further and explore additional features while considering ethical implications to aid healthcare providers in safeguarding children.
急诊科在发现儿童虐待和忽视方面起着关键作用,但这项任务通常很复杂。我们的研究使用结构化和非结构化电子健康记录(EHR)数据开发了一种机器学习模型,以预测急诊科的儿童何时可能需要儿童保护服务的干预。我们使用病例对照研究设计,分析儿科急诊科的数据。使用自然语言处理(NLP)技术处理临床记录,以识别疑似病例,并以 1:9 的比例匹配,以确保数据集平衡。这些记录中的特征与结构化 EHR 数据相结合,使用 XGBoost 算法构建模型。该模型的精度为 0.95,召回率为 0.88,F1 得分为 0.92,通过整合 NLP 衍生数据可以看到改进。虐待的关键指标包括住院、延长急诊停留时间和特定的临床医嘱。该模型的准确性和 NLP 的实用性表明,急诊科有可能更好地识别高危儿童。未来的工作应该进一步验证该模型,并在考虑到伦理影响的情况下探索其他功能,以帮助医疗保健提供者保护儿童。