• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

意大利基于无编码急诊入院记录的儿科伤害监测:基于机器学习的文本挖掘方法。

Pediatric Injury Surveillance From Uncoded Emergency Department Admission Records in Italy: Machine Learning-Based Text-Mining Approach.

机构信息

Department of Environmental and Preventive Sciences, University of Ferrara, Ferrara, Italy.

Department of Women's and Children's Health, University of Padova, Padua, Italy.

出版信息

JMIR Public Health Surveill. 2023 Jul 12;9:e44467. doi: 10.2196/44467.

DOI:10.2196/44467
PMID:37436799
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10372563/
Abstract

BACKGROUND

Unintentional injury is the leading cause of death in young children. Emergency department (ED) diagnoses are a useful source of information for injury epidemiological surveillance purposes. However, ED data collection systems often use free-text fields to report patient diagnoses. Machine learning techniques (MLTs) are powerful tools for automatic text classification. The MLT system is useful to improve injury surveillance by speeding up the manual free-text coding tasks of ED diagnoses.

OBJECTIVE

This research aims to develop a tool for automatic free-text classification of ED diagnoses to automatically identify injury cases. The automatic classification system also serves for epidemiological purposes to identify the burden of pediatric injuries in Padua, a large province in the Veneto region in the Northeast Italy.

METHODS

The study includes 283,468 pediatric admissions between 2007 and 2018 to the Padova University Hospital ED, a large referral center in Northern Italy. Each record reports a diagnosis by free text. The records are standard tools for reporting patient diagnoses. An expert pediatrician manually classified a randomly extracted sample of approximately 40,000 diagnoses. This study sample served as the gold standard to train an MLT classifier. After preprocessing, a document-term matrix was created. The machine learning classifiers, including decision tree, random forest, gradient boosting method (GBM), and support vector machine (SVM), were tuned by 4-fold cross-validation. The injury diagnoses were classified into 3 hierarchical classification tasks, as follows: injury versus noninjury (task A), intentional versus unintentional injury (task B), and type of unintentional injury (task C), according to the World Health Organization classification of injuries.

RESULTS

The SVM classifier achieved the highest performance accuracy (94.14%) in classifying injury versus noninjury cases (task A). The GBM method produced the best results (92% accuracy) for the unintentional and intentional injury classification task (task B). The highest accuracy for the unintentional injury subclassification (task C) was achieved by the SVM classifier. The SVM, random forest, and GBM algorithms performed similarly against the gold standard across different tasks.

CONCLUSIONS

This study shows that MLTs are promising techniques for improving epidemiological surveillance, allowing for the automatic classification of pediatric ED free-text diagnoses. The MLTs revealed a suitable classification performance, especially for general injuries and intentional injury classification. This automatic classification could facilitate the epidemiological surveillance of pediatric injuries by also reducing the health professionals' efforts in manually classifying diagnoses for research purposes.

摘要

背景

意外伤害是导致儿童死亡的主要原因。急诊科(ED)诊断是伤害流行病学监测的有用信息来源。然而,ED 数据收集系统通常使用自由文本字段报告患者诊断。机器学习技术(MLT)是自动文本分类的强大工具。该 MLT 系统通过加快 ED 诊断的手动自由文本编码任务,有助于改进伤害监测。

目的

本研究旨在开发一种用于自动分类 ED 诊断的自由文本的工具,以自动识别伤害病例。自动分类系统还可用于流行病学目的,以确定意大利东北部威尼托地区一个大省帕多瓦的儿科伤害负担。

方法

该研究包括 2007 年至 2018 年期间帕多瓦大学医院 ED 收治的 283468 例儿科住院患者,这是一家大型转诊中心。每个记录都通过自由文本报告一个诊断。这些记录是报告患者诊断的标准工具。一名儿科专家对大约 40000 个诊断进行了随机抽取样本的手动分类。该研究样本作为训练 MLT 分类器的金标准。经过预处理,创建了一个文档-术语矩阵。机器学习分类器包括决策树、随机森林、梯度提升方法(GBM)和支持向量机(SVM),通过 4 折交叉验证进行调整。根据世界卫生组织的伤害分类,将伤害诊断分为 3 个层次分类任务,如下所示:伤害与非伤害(任务 A)、故意伤害与非故意伤害(任务 B)和非故意伤害类型(任务 C)。

结果

SVM 分类器在分类伤害与非伤害病例(任务 A)方面表现出最高的性能准确性(94.14%)。GBM 方法在非故意伤害和故意伤害分类任务(任务 B)中产生了最佳结果(准确率 92%)。SVM 分类器在非故意伤害亚分类(任务 C)方面达到了最高的准确性。SVM、随机森林和 GBM 算法在不同任务中对金标准的表现相似。

结论

本研究表明,MLT 是改进流行病学监测的有前途的技术,允许对儿科 ED 自由文本诊断进行自动分类。MLT 表现出了合适的分类性能,特别是对于一般伤害和故意伤害分类。这种自动分类可以通过减少卫生专业人员在手动分类诊断以用于研究目的方面的工作,促进儿科伤害的流行病学监测。

相似文献

1
Pediatric Injury Surveillance From Uncoded Emergency Department Admission Records in Italy: Machine Learning-Based Text-Mining Approach.意大利基于无编码急诊入院记录的儿科伤害监测:基于机器学习的文本挖掘方法。
JMIR Public Health Surveill. 2023 Jul 12;9:e44467. doi: 10.2196/44467.
2
Automated Outcome Classification of Computed Tomography Imaging Reports for Pediatric Traumatic Brain Injury.小儿创伤性脑损伤计算机断层扫描成像报告的自动结果分类
Acad Emerg Med. 2016 Feb;23(2):171-8. doi: 10.1111/acem.12859. Epub 2016 Jan 14.
3
Analysis of Unstructured Text-Based Data Using Machine Learning Techniques: The Case of Pediatric Emergency Department Records in Nicaragua.基于机器学习技术的非结构化文本数据分析:以尼加拉瓜儿科急诊记录为例。
Med Care Res Rev. 2021 Apr;78(2):138-145. doi: 10.1177/1077558719844123. Epub 2019 Apr 29.
4
Text mining approach to predict hospital admissions using early medical records from the emergency department.利用急诊科早期医疗记录预测住院情况的文本挖掘方法。
Int J Med Inform. 2017 Apr;100:1-8. doi: 10.1016/j.ijmedinf.2017.01.001. Epub 2017 Jan 5.
5
Automated Classification of Selected Data Elements from Free-text Diagnostic Reports for Clinical Research.用于临床研究的自由文本诊断报告中选定数据元素的自动分类
Methods Inf Med. 2016 Aug 5;55(4):373-80. doi: 10.3414/ME15-02-0019. Epub 2016 Jul 13.
6
Performance of a Machine Learning Classifier of Knee MRI Reports in Two Large Academic Radiology Practices: A Tool to Estimate Diagnostic Yield.在两家大型学术放射科实践中膝关节MRI报告的机器学习分类器性能:一种估计诊断率的工具
AJR Am J Roentgenol. 2017 Apr;208(4):750-753. doi: 10.2214/AJR.16.16128. Epub 2017 Jan 31.
7
Construction accident narrative classification: An evaluation of text mining techniques.建筑事故叙述分类:文本挖掘技术评估
Accid Anal Prev. 2017 Nov;108:122-130. doi: 10.1016/j.aap.2017.08.026. Epub 2017 Sep 1.
8
Automatic extraction of cancer registry reportable information from free-text pathology reports using multitask convolutional neural networks.使用多任务卷积神经网络从自由文本病理报告中自动提取癌症登记报告信息。
J Am Med Inform Assoc. 2020 Jan 1;27(1):89-98. doi: 10.1093/jamia/ocz153.
9
Natural language processing and machine learning to enable automatic extraction and classification of patients' smoking status from electronic medical records.自然语言处理和机器学习可实现从电子病历中自动提取和分类患者的吸烟状况。
Ups J Med Sci. 2020 Nov;125(4):316-324. doi: 10.1080/03009734.2020.1792010. Epub 2020 Jul 22.
10
Application of a Machine Learning-Based Decision Support Tool to Improve an Injury Surveillance System Workflow.基于机器学习的决策支持工具在改进伤害监测系统工作流程中的应用。
Appl Clin Inform. 2022 May;13(3):700-710. doi: 10.1055/a-1863-7176. Epub 2022 May 29.

引用本文的文献

1
AI-Driven Injury Reporting in Pediatric Emergency Departments.儿科急诊科中由人工智能驱动的损伤报告
JAMA Netw Open. 2025 Jul 1;8(7):e2524154. doi: 10.1001/jamanetworkopen.2025.24154.
2
Use of a Large Language Model to Identify and Classify Injuries With Free-Text Emergency Department Data.使用大语言模型通过急诊部自由文本数据识别和分类损伤情况。
JAMA Netw Open. 2024 May 1;7(5):e2413208. doi: 10.1001/jamanetworkopen.2024.13208.
3
Development of machine learning-based predictors for early diagnosis of hepatocellular carcinoma.

本文引用的文献

1
Epidemiology and Trends over Time of Foreign Body Injuries in the Pediatric Emergency Department.儿科急诊科异物损伤的流行病学及随时间变化趋势
Children (Basel). 2021 Oct 19;8(10):938. doi: 10.3390/children8100938.
2
Monitoring Public Perception of Health Risks in Brazil and Italy: Cross-Cultural Research on the Risk Perception of Choking in Children.监测巴西和意大利公众对健康风险的认知:关于儿童窒息风险认知的跨文化研究。
Children (Basel). 2021 Jun 24;8(7):541. doi: 10.3390/children8070541.
3
Analysis of Unstructured Text-Based Data Using Machine Learning Techniques: The Case of Pediatric Emergency Department Records in Nicaragua.
基于机器学习的肝细胞癌早期诊断预测因子的研究进展。
Sci Rep. 2024 Mar 4;14(1):5274. doi: 10.1038/s41598-024-51265-7.
基于机器学习技术的非结构化文本数据分析:以尼加拉瓜儿科急诊记录为例。
Med Care Res Rev. 2021 Apr;78(2):138-145. doi: 10.1177/1077558719844123. Epub 2019 Apr 29.
4
Extending PubMed searches to ClinicalTrials.gov through a machine learning approach for systematic reviews.通过机器学习方法扩展 PubMed 检索以用于系统评价:ClinicalTrials.gov 的应用。
J Clin Epidemiol. 2018 Nov;103:22-30. doi: 10.1016/j.jclinepi.2018.06.015. Epub 2018 Jul 5.
5
The Epidemiology of Unintentional and Violence-Related Injury Morbidity and Mortality among Children and Adolescents in the United States.美国儿童和青少年意外伤害和暴力相关伤害发病率和死亡率的流行病学。
Int J Environ Res Public Health. 2018 Mar 28;15(4):616. doi: 10.3390/ijerph15040616.
6
Development of a Machine Learning Algorithm for the Surveillance of Autism Spectrum Disorder.一种用于监测自闭症谱系障碍的机器学习算法的开发
PLoS One. 2016 Dec 21;11(12):e0168224. doi: 10.1371/journal.pone.0168224. eCollection 2016.
7
Foreign body injuries in children: a review.儿童异物损伤:综述
Acta Otorhinolaryngol Ital. 2015 Oct;35(4):265-71.
8
Our Shrinking Globe: Implications for Child Unintentional Injuries.我们日益缩小的地球:对儿童意外伤害的影响。
Pediatr Clin North Am. 2016 Feb;63(1):167-81. doi: 10.1016/j.pcl.2015.08.009.
9
Knowledge, attitudes, and practices of family physicians and nurses regarding unintentional injuries among children under 15 years in Cairo, Egypt.埃及开罗家庭医生和护士关于15岁以下儿童意外伤害的知识、态度和行为。
Int J Inj Contr Saf Promot. 2017 Mar;24(1):24-31. doi: 10.1080/17457300.2015.1056808. Epub 2015 Jul 15.
10
Leading causes of unintentional and intentional injury mortality: United States, 2000-2009.导致美国 2000-2009 年非故意伤害和故意伤害死亡的主要原因。
Am J Public Health. 2012 Nov;102(11):e84-92. doi: 10.2105/AJPH.2012.300960. Epub 2012 Sep 20.