• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种基于自然语言处理和深度学习的方法,用于从儿科电子病历中识别儿童虐待。

A natural language processing and deep learning approach to identify child abuse from pediatric electronic medical records.

机构信息

Harvard John A. Paulson School of Engineering and Applied Sciences, Harvard University, Cambridge, MA, United States of America.

Pediatrics-Public Health, Baylor College of Medicine, Houston, TX, United States of America.

出版信息

PLoS One. 2021 Feb 26;16(2):e0247404. doi: 10.1371/journal.pone.0247404. eCollection 2021.

DOI:10.1371/journal.pone.0247404
PMID:33635890
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7909689/
Abstract

Child physical abuse is a leading cause of traumatic injury and death in children. In 2017, child abuse was responsible for 1688 fatalities in the United States, of 3.5 million children referred to Child Protection Services and 674,000 substantiated victims. While large referral hospitals maintain teams trained in Child Abuse Pediatrics, smaller community hospitals often do not have such dedicated resources to evaluate patients for potential abuse. Moreover, identification of abuse has a low margin of error, as false positive identifications lead to unwarranted separations, while false negatives allow dangerous situations to continue. This context makes the consistent detection of and response to abuse difficult, particularly given subtle signs in young, non-verbal patients. Here, we describe the development of artificial intelligence algorithms that use unstructured free-text in the electronic medical record-including notes from physicians, nurses, and social workers-to identify children who are suspected victims of physical abuse. Importantly, only the notes from time of first encounter (e.g.: birth, routine visit, sickness) to the last record before child protection team involvement were used. This allowed us to develop an algorithm using only information available prior to referral to the specialized child protection team. The study was performed in a multi-center referral pediatric hospital on patients screened for abuse within five different locations between 2015 and 2019. Of 1123 patients, 867 records were available after data cleaning and processing, and 55% were abuse-positive as determined by a multi-disciplinary team of clinical professionals. These electronic medical records were encoded with three natural language processing (NLP) algorithms-Bag of Words (BOW), Word Embeddings (WE), and Rules-Based (RB)-and used to train multiple neural network architectures. The BOW and WE encodings utilize the full free-text, while RB selects crucial phrases as identified by physicians. The best architecture was selected by average classification accuracy for the best performing model from each train-test split of a cross-validation experiment. Natural language processing coupled with neural networks detected cases of likely child abuse using only information available to clinicians prior to child protection team referral with average accuracy of 0.90±0.02 and average area under the receiver operator characteristic curve (ROC-AUC) 0.93±0.02 for the best performing Bag of Words models. The best performing rules-based models achieved average accuracy of 0.77±0.04 and average ROC-AUC 0.81±0.05, while a Word Embeddings strategy was severely limited by lack of representative embeddings. Importantly, the best performing model had a false positive rate of 8%, as compared to rates of 20% or higher in previously reported studies. This artificial intelligence approach can help screen patients for whom an abuse concern exists and streamline the identification of patients who may benefit from referral to a child protection team. Furthermore, this approach could be applied to develop computer-aided-diagnosis platforms for the challenging and often intractable problem of reliably identifying pediatric patients suffering from physical abuse.

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7de6/7909689/ddfb5e03af0c/pone.0247404.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7de6/7909689/00b4e064c80d/pone.0247404.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7de6/7909689/e2ee55526852/pone.0247404.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7de6/7909689/10dba9073e4e/pone.0247404.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7de6/7909689/9da9f8616418/pone.0247404.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7de6/7909689/ddfb5e03af0c/pone.0247404.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7de6/7909689/00b4e064c80d/pone.0247404.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7de6/7909689/e2ee55526852/pone.0247404.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7de6/7909689/10dba9073e4e/pone.0247404.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7de6/7909689/9da9f8616418/pone.0247404.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7de6/7909689/ddfb5e03af0c/pone.0247404.g005.jpg
摘要

儿童身体虐待是导致儿童创伤性损伤和死亡的主要原因。2017 年,儿童虐待在美国造成 1688 人死亡,在被儿童保护服务机构转介的 350 万名儿童中,有 674000 人被证实为受害者。虽然大型转诊医院拥有接受过儿童虐待儿科培训的团队,但较小的社区医院通常没有专门的资源来评估潜在虐待的患者。此外,虐待的识别错误率很低,因为错误的阳性识别会导致不必要的分离,而错误的阴性识别会让危险情况继续存在。这种情况使得虐待的持续检测和应对变得困难,特别是对于年轻、无法言语的患者来说,虐待的迹象更为微妙。在这里,我们描述了人工智能算法的开发,该算法使用电子病历中的非结构化自由文本——包括医生、护士和社会工作者的笔记——来识别疑似遭受身体虐待的儿童。重要的是,只使用儿童保护团队介入前的首次就诊(例如:出生、常规就诊、生病)到儿童保护团队介入前的最后一次就诊的笔记。这使得我们能够仅使用在转介给专门的儿童保护团队之前可获得的信息来开发算法。该研究在一家多中心转诊儿科医院进行,在 2015 年至 2019 年期间,对五个不同地点筛查出的虐待儿童患者进行了研究。在 1123 名患者中,经过数据清理和处理后,有 867 份记录可用,其中 55%被多学科临床专业人员确定为虐待阳性。这些电子病历分别使用三种自然语言处理(NLP)算法——词袋(BOW)、词嵌入(WE)和基于规则(RB)进行编码,并用于训练多个神经网络架构。BOW 和 WE 编码使用完整的自由文本,而 RB 则选择医生识别的关键短语。通过交叉验证实验的每个训练-测试分割的最佳表现模型的平均分类准确性来选择最佳架构。自然语言处理与神经网络相结合,仅使用儿童保护团队转介前临床医生可获得的信息,使用平均准确率为 0.90±0.02 和平均接收者操作特征曲线(ROC-AUC)为 0.93±0.02 的词袋模型检测可能的儿童虐待病例。表现最好的基于规则的模型的平均准确率为 0.77±0.04,平均 ROC-AUC 为 0.81±0.05,而词嵌入策略受到代表性嵌入缺失的严重限制。重要的是,与之前报道的研究中 20%或更高的比率相比,该人工智能方法的假阳性率为 8%。这种人工智能方法可以帮助筛选出存在虐待问题的患者,并简化识别可能受益于转介给儿童保护团队的患者的过程。此外,该方法可应用于开发用于可靠识别遭受身体虐待的儿科患者的计算机辅助诊断平台,这些患者的情况具有挑战性,且通常难以解决。

相似文献

1
A natural language processing and deep learning approach to identify child abuse from pediatric electronic medical records.一种基于自然语言处理和深度学习的方法,用于从儿科电子病历中识别儿童虐待。
PLoS One. 2021 Feb 26;16(2):e0247404. doi: 10.1371/journal.pone.0247404. eCollection 2021.
2
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
3
Artificial Intelligence Learning Semantics via External Resources for Classifying Diagnosis Codes in Discharge Notes.人工智能通过外部资源学习语义以对出院小结中的诊断代码进行分类。
J Med Internet Res. 2017 Nov 6;19(11):e380. doi: 10.2196/jmir.8344.
4
Classifying social determinants of health from unstructured electronic health records using deep learning-based natural language processing.利用基于深度学习的自然语言处理技术从非结构化电子健康记录中分类社会健康决定因素。
J Biomed Inform. 2022 Mar;127:103984. doi: 10.1016/j.jbi.2021.103984. Epub 2022 Jan 7.
5
Using deep learning and natural language processing models to detect child physical abuse.利用深度学习和自然语言处理模型检测儿童身体虐待。
J Pediatr Surg. 2021 Dec;56(12):2326-2332. doi: 10.1016/j.jpedsurg.2021.03.007. Epub 2021 Mar 18.
6
Natural language processing of admission notes to predict severe maternal morbidity during the delivery encounter.入院记录的自然语言处理预测分娩时严重产妇发病率。
Am J Obstet Gynecol. 2022 Sep;227(3):511.e1-511.e8. doi: 10.1016/j.ajog.2022.04.008. Epub 2022 Apr 14.
7
A large language model-based generative natural language processing framework fine-tuned on clinical notes accurately extracts headache frequency from electronic health records.基于大型语言模型的生成式自然语言处理框架,在临床笔记上进行了微调,能够从电子健康记录中准确提取头痛频率。
Headache. 2024 Apr;64(4):400-409. doi: 10.1111/head.14702. Epub 2024 Mar 25.
8
A comparison of word embeddings for the biomedical natural language processing.生物医学自然语言处理中词嵌入的比较。
J Biomed Inform. 2018 Nov;87:12-20. doi: 10.1016/j.jbi.2018.09.008. Epub 2018 Sep 12.
9
Identifying epilepsy surgery referral candidates with natural language processing in an Australian context.在澳大利亚语境下,利用自然语言处理技术识别癫痫手术转诊候选人。
Epilepsia Open. 2024 Apr;9(2):635-642. doi: 10.1002/epi4.12901. Epub 2024 Jan 23.
10
Initial development of tools to identify child abuse and neglect in pediatric primary care.儿科初级保健中识别儿童虐待和忽视工具的初步开发。
BMC Med Inform Decis Mak. 2023 Nov 17;23(1):266. doi: 10.1186/s12911-023-02361-7.

引用本文的文献

1
Leveraging AI to Investigate Child Maltreatment Text Narratives: Promising Benefits and Addressable Risks.利用人工智能研究虐待儿童文本叙述:潜在益处与可应对风险
JMIR Pediatr Parent. 2025 Jul 24;8:e73579. doi: 10.2196/73579.
2
Understanding patterns of loneliness in older long-term care users using natural language processing with free text case notes.利用带有自由文本病例记录的自然语言处理技术,了解老年长期护理使用者的孤独模式。
PLoS One. 2025 Apr 2;20(4):e0319745. doi: 10.1371/journal.pone.0319745. eCollection 2025.
3
Conversational AI in Pediatric Mental Health: A Narrative Review.

本文引用的文献

1
Automated Organ-Level Classification of Free-Text Pathology Reports to Support a Radiology Follow-up Tracking Engine.用于支持放射学随访跟踪引擎的自由文本病理报告的自动器官水平分类
Radiol Artif Intell. 2019 Aug 7;1(5):e180052. doi: 10.1148/ryai.2019180052. eCollection 2019 Sep.
2
Efficient prediction of drug-drug interaction using deep learning models.利用深度学习模型实现药物-药物相互作用的高效预测。
IET Syst Biol. 2020 Aug;14(4):211-216. doi: 10.1049/iet-syb.2019.0116.
3
Novel Efficient RNN and LSTM-Like Architectures: Recurrent and Gated Broad Learning Systems and Their Applications for Text Classification.
儿科心理健康中的对话式人工智能:叙事性综述
Children (Basel). 2025 Mar 14;12(3):359. doi: 10.3390/children12030359.
4
Improving Clinical Documentation with Artificial Intelligence: A Systematic Review.利用人工智能改善临床文档记录:一项系统综述。
Perspect Health Inf Manag. 2024 Jun 1;21(2):1d. eCollection 2024 Summer-Fall.
5
Identifying Risk Factors of Children Who Suffered Physical Abuse: A Systematic Review.识别遭受身体虐待儿童的风险因素:一项系统综述
J Am Acad Orthop Surg Glob Res Rev. 2025 Jan 16;9(1). doi: 10.5435/JAAOSGlobal-D-24-00163. eCollection 2025 Jan 1.
6
Estimating the Prevalence of Child Abuse and Neglect Among Adolescents in Primary Care Through Diagnoses Codes and Free-Text EHR Clinical Notes.通过诊断编码和自由文本电子健康记录临床笔记估算初级保健中青少年遭受虐待和忽视的患病率
J Pediatr Health Care. 2025 Mar-Apr;39(2):189-197. doi: 10.1016/j.pedhc.2024.10.016. Epub 2024 Nov 22.
7
Community-engaged artificial intelligence research: A scoping review.社区参与式人工智能研究:一项范围综述。
PLOS Digit Health. 2024 Aug 23;3(8):e0000561. doi: 10.1371/journal.pdig.0000561. eCollection 2024 Aug.
8
Artificial Intelligence in Pediatrics: Learning to Walk Together.儿科学中的人工智能:携手共进。
Turk Arch Pediatr. 2024 Mar;59(2):121-130. doi: 10.5152/TurkArchPediatr.2024.24002.
9
Integrating Artificial Intelligence in Pediatric Healthcare: Parental Perceptions and Ethical Implications.将人工智能整合到儿科医疗保健中:家长的看法和伦理影响。
Children (Basel). 2024 Feb 14;11(2):240. doi: 10.3390/children11020240.
10
Stressful life events in electronic health records: a scoping review.电子健康记录中的应激性生活事件:范围综述。
J Am Med Inform Assoc. 2024 Apr 3;31(4):1025-1035. doi: 10.1093/jamia/ocae023.
新型高效 RNN 和类似 LSTM 的架构:递归和门控广义学习系统及其在文本分类中的应用。
IEEE Trans Cybern. 2021 Mar;51(3):1586-1597. doi: 10.1109/TCYB.2020.2969705. Epub 2021 Feb 17.
4
Comparison of Word Embeddings for Extraction from Medical Records.从病历中提取的词嵌入比较。
Int J Environ Res Public Health. 2019 Nov 8;16(22):4360. doi: 10.3390/ijerph16224360.
5
Machine learning algorithm validation with a limited sample size.机器学习算法在有限样本量下的验证。
PLoS One. 2019 Nov 7;14(11):e0224365. doi: 10.1371/journal.pone.0224365. eCollection 2019.
6
Scalable and accurate deep learning with electronic health records.借助电子健康记录实现可扩展且准确的深度学习。
NPJ Digit Med. 2018 May 8;1:18. doi: 10.1038/s41746-018-0029-1. eCollection 2018.
7
Prevalence and Correlates of the Co-Occurrence of Family Violence: A Meta-Analysis on Family Polyvictimization.家庭暴力共病的流行率及相关因素:家庭多受害者化的荟萃分析。
Trauma Violence Abuse. 2021 Apr;22(2):289-305. doi: 10.1177/1524838019841601. Epub 2019 May 8.
8
Clinical text classification with rule-based features and knowledge-guided convolutional neural networks.基于规则特征和知识引导卷积神经网络的临床文本分类。
BMC Med Inform Decis Mak. 2019 Apr 4;19(Suppl 3):71. doi: 10.1186/s12911-019-0781-4.
9
Natural language processing of symptoms documented in free-text narratives of electronic health records: a systematic review.电子健康记录中自由文本叙述的症状的自然语言处理:系统评价。
J Am Med Inform Assoc. 2019 Apr 1;26(4):364-379. doi: 10.1093/jamia/ocy173.
10
A guide to deep learning in healthcare.深度学习在医疗保健中的应用指南。
Nat Med. 2019 Jan;25(1):24-29. doi: 10.1038/s41591-018-0316-z. Epub 2019 Jan 7.