Harvard John A. Paulson School of Engineering and Applied Sciences, Harvard University, Cambridge, MA, United States of America.
Pediatrics-Public Health, Baylor College of Medicine, Houston, TX, United States of America.
PLoS One. 2021 Feb 26;16(2):e0247404. doi: 10.1371/journal.pone.0247404. eCollection 2021.
Child physical abuse is a leading cause of traumatic injury and death in children. In 2017, child abuse was responsible for 1688 fatalities in the United States, of 3.5 million children referred to Child Protection Services and 674,000 substantiated victims. While large referral hospitals maintain teams trained in Child Abuse Pediatrics, smaller community hospitals often do not have such dedicated resources to evaluate patients for potential abuse. Moreover, identification of abuse has a low margin of error, as false positive identifications lead to unwarranted separations, while false negatives allow dangerous situations to continue. This context makes the consistent detection of and response to abuse difficult, particularly given subtle signs in young, non-verbal patients. Here, we describe the development of artificial intelligence algorithms that use unstructured free-text in the electronic medical record-including notes from physicians, nurses, and social workers-to identify children who are suspected victims of physical abuse. Importantly, only the notes from time of first encounter (e.g.: birth, routine visit, sickness) to the last record before child protection team involvement were used. This allowed us to develop an algorithm using only information available prior to referral to the specialized child protection team. The study was performed in a multi-center referral pediatric hospital on patients screened for abuse within five different locations between 2015 and 2019. Of 1123 patients, 867 records were available after data cleaning and processing, and 55% were abuse-positive as determined by a multi-disciplinary team of clinical professionals. These electronic medical records were encoded with three natural language processing (NLP) algorithms-Bag of Words (BOW), Word Embeddings (WE), and Rules-Based (RB)-and used to train multiple neural network architectures. The BOW and WE encodings utilize the full free-text, while RB selects crucial phrases as identified by physicians. The best architecture was selected by average classification accuracy for the best performing model from each train-test split of a cross-validation experiment. Natural language processing coupled with neural networks detected cases of likely child abuse using only information available to clinicians prior to child protection team referral with average accuracy of 0.90±0.02 and average area under the receiver operator characteristic curve (ROC-AUC) 0.93±0.02 for the best performing Bag of Words models. The best performing rules-based models achieved average accuracy of 0.77±0.04 and average ROC-AUC 0.81±0.05, while a Word Embeddings strategy was severely limited by lack of representative embeddings. Importantly, the best performing model had a false positive rate of 8%, as compared to rates of 20% or higher in previously reported studies. This artificial intelligence approach can help screen patients for whom an abuse concern exists and streamline the identification of patients who may benefit from referral to a child protection team. Furthermore, this approach could be applied to develop computer-aided-diagnosis platforms for the challenging and often intractable problem of reliably identifying pediatric patients suffering from physical abuse.
儿童身体虐待是导致儿童创伤性损伤和死亡的主要原因。2017 年,儿童虐待在美国造成 1688 人死亡,在被儿童保护服务机构转介的 350 万名儿童中,有 674000 人被证实为受害者。虽然大型转诊医院拥有接受过儿童虐待儿科培训的团队,但较小的社区医院通常没有专门的资源来评估潜在虐待的患者。此外,虐待的识别错误率很低,因为错误的阳性识别会导致不必要的分离,而错误的阴性识别会让危险情况继续存在。这种情况使得虐待的持续检测和应对变得困难,特别是对于年轻、无法言语的患者来说,虐待的迹象更为微妙。在这里,我们描述了人工智能算法的开发,该算法使用电子病历中的非结构化自由文本——包括医生、护士和社会工作者的笔记——来识别疑似遭受身体虐待的儿童。重要的是,只使用儿童保护团队介入前的首次就诊(例如:出生、常规就诊、生病)到儿童保护团队介入前的最后一次就诊的笔记。这使得我们能够仅使用在转介给专门的儿童保护团队之前可获得的信息来开发算法。该研究在一家多中心转诊儿科医院进行,在 2015 年至 2019 年期间,对五个不同地点筛查出的虐待儿童患者进行了研究。在 1123 名患者中,经过数据清理和处理后,有 867 份记录可用,其中 55%被多学科临床专业人员确定为虐待阳性。这些电子病历分别使用三种自然语言处理(NLP)算法——词袋(BOW)、词嵌入(WE)和基于规则(RB)进行编码,并用于训练多个神经网络架构。BOW 和 WE 编码使用完整的自由文本,而 RB 则选择医生识别的关键短语。通过交叉验证实验的每个训练-测试分割的最佳表现模型的平均分类准确性来选择最佳架构。自然语言处理与神经网络相结合,仅使用儿童保护团队转介前临床医生可获得的信息,使用平均准确率为 0.90±0.02 和平均接收者操作特征曲线(ROC-AUC)为 0.93±0.02 的词袋模型检测可能的儿童虐待病例。表现最好的基于规则的模型的平均准确率为 0.77±0.04,平均 ROC-AUC 为 0.81±0.05,而词嵌入策略受到代表性嵌入缺失的严重限制。重要的是,与之前报道的研究中 20%或更高的比率相比,该人工智能方法的假阳性率为 8%。这种人工智能方法可以帮助筛选出存在虐待问题的患者,并简化识别可能受益于转介给儿童保护团队的患者的过程。此外,该方法可应用于开发用于可靠识别遭受身体虐待的儿科患者的计算机辅助诊断平台,这些患者的情况具有挑战性,且通常难以解决。