• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

自然语言处理与国际疾病分类编码在构建跌倒损伤患者登记册中的性能:回顾性分析

Performance of Natural Language Processing versus International Classification of Diseases Codes in Building Registries for Patients With Fall Injury: Retrospective Analysis.

作者信息

Taseh Atta, Sasanfar Souri, Chan Michelle, Sirls Evan, Nazarian Ara, Batmanghelich Kayhan, Bean Jonathan F, Ashkani-Esfahani Soheil

机构信息

Foot & Ankle Research and Innovations Laboratory (FARIL), Department of Orthopaedic Surgery, Mass General Brigham, Harvard Medical School, 158 Boston Post Road, Weston, MA, 02493, United States, 1 7818279613.

Musculoskeletal Translational Innovation Initiative, Carl J. Shapiro Department of Orthopaedic Surgery, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA, United States.

出版信息

JMIR Med Inform. 2025 Jul 14;13:e66973. doi: 10.2196/66973.

DOI:10.2196/66973
PMID:40658984
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12279314/
Abstract

BACKGROUND

Standardized registries, such as the International Classification of Diseases (ICD) codes, are commonly built using administrative codes assigned to patient encounters. However, patients with fall injury are often coded using subsequent injury codes, such as hip fractures. This necessitates manual screening to ensure the accuracy of data registries.

OBJECTIVE

This study aimed to automate the extraction of fall incidents and mechanisms using natural language processing (NLP) and compare this approach with the ICD method.

METHODS

Clinical notes for patients with fall-induced hip fractures were retrospectively reviewed by medical experts. Fall incidences were detected, annotated, and classified among patients who had a fall-induced hip fracture (case group). The control group included patients with hip fractures without any evidence of falls. NLP models were developed using the annotated notes of the study groups to fulfill two separate tasks: fall occurrence detection and fall mechanism classification. The performances of the models were compared using accuracy, sensitivity, specificity, positive predictive value, negative predictive value, F1-score, and area under the receiver operating characteristic curve.

RESULTS

A total of 1769 clinical notes were included in the final analysis for the fall occurrence task, and 783 clinical notes were analyzed for the fall mechanism classification task. The highest F1-score using NLP for fall occurrence was 0.97 (specificity=0.96; sensitivity=0.97), and for fall mechanism classification was 0.61 (specificity=0.56; sensitivity=0.62). Natural language processing could detect up to 98% of the fall occurrences and 65% of the fall mechanisms accurately, compared to 26% and 12%, respectively, by ICD codes.

CONCLUSIONS

Our findings showed promising performance with higher accuracy of NLP algorithms compared to the conventional method for detecting fall occurrence and mechanism in developing disease registries using clinical notes. Our approach can be introduced to other registries that are based on large data and are in need of accurate annotation and classification.

摘要

背景

标准化登记系统,如国际疾病分类(ICD)编码,通常是使用分配给患者就诊的管理代码构建的。然而,跌倒受伤患者通常使用后续的损伤代码进行编码,如髋部骨折。这就需要人工筛查以确保数据登记的准确性。

目的

本研究旨在使用自然语言处理(NLP)自动提取跌倒事件及机制,并将该方法与ICD方法进行比较。

方法

医学专家对跌倒导致髋部骨折患者的临床记录进行回顾性审查。在跌倒导致髋部骨折的患者(病例组)中检测、标注并分类跌倒发生率。对照组包括无任何跌倒证据的髋部骨折患者。利用研究组的标注记录开发NLP模型,以完成两项独立任务:跌倒发生检测和跌倒机制分类。使用准确率、灵敏度、特异度、阳性预测值、阴性预测值、F1分数和受试者工作特征曲线下面积比较模型性能。

结果

最终分析纳入了1769份用于跌倒发生任务的临床记录,783份用于跌倒机制分类任务的临床记录。使用NLP进行跌倒发生检测的最高F1分数为0.97(特异度=0.96;灵敏度=0.97),跌倒机制分类为0.61(特异度=0.56;灵敏度=0.62)。与ICD编码分别为26%和12%相比,自然语言处理能够准确检测高达98%的跌倒发生率和65%的跌倒机制。

结论

我们的研究结果显示,在使用临床记录开发疾病登记系统时,与传统方法相比,NLP算法在检测跌倒发生和机制方面具有更高的准确性,表现出良好的性能。我们的方法可引入到其他基于大数据且需要准确标注和分类的登记系统中。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1210/12279314/fcc8616523a8/medinform-v13-e66973-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1210/12279314/fda56facf81a/medinform-v13-e66973-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1210/12279314/c981a2b51ac6/medinform-v13-e66973-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1210/12279314/fcc8616523a8/medinform-v13-e66973-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1210/12279314/fda56facf81a/medinform-v13-e66973-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1210/12279314/c981a2b51ac6/medinform-v13-e66973-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1210/12279314/fcc8616523a8/medinform-v13-e66973-g003.jpg

相似文献

1
Performance of Natural Language Processing versus International Classification of Diseases Codes in Building Registries for Patients With Fall Injury: Retrospective Analysis.自然语言处理与国际疾病分类编码在构建跌倒损伤患者登记册中的性能:回顾性分析
JMIR Med Inform. 2025 Jul 14;13:e66973. doi: 10.2196/66973.
2
Development and Validation of a Rule-Based Natural Language Processing Algorithm to Identify Falls in Inpatient Records of Older Adults: Retrospective Analysis.用于识别老年人住院记录中跌倒事件的基于规则的自然语言处理算法的开发与验证:回顾性分析
JMIR Aging. 2025 Jul 8;8:e65195. doi: 10.2196/65195.
3
Falls prevention interventions for community-dwelling older adults: systematic review and meta-analysis of benefits, harms, and patient values and preferences.社区居住的老年人跌倒预防干预措施:系统评价和荟萃分析的益处、危害以及患者的价值观和偏好。
Syst Rev. 2024 Nov 26;13(1):289. doi: 10.1186/s13643-024-02681-3.
4
Development and Validation of a Convolutional Neural Network Model to Predict a Pathologic Fracture in the Proximal Femur Using Abdomen and Pelvis CT Images of Patients With Advanced Cancer.利用晚期癌症患者腹部和骨盆 CT 图像建立卷积神经网络模型预测股骨近端病理性骨折的研究
Clin Orthop Relat Res. 2023 Nov 1;481(11):2247-2256. doi: 10.1097/CORR.0000000000002771. Epub 2023 Aug 23.
5
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
6
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
7
De Novo Natural Language Processing Algorithm Accurately Identifies Myxofibrosarcoma From Pathology Reports.全新自然语言处理算法可从病理报告中准确识别黏液纤维肉瘤。
Clin Orthop Relat Res. 2025 Jan 1;483(1):80-87. doi: 10.1097/CORR.0000000000003270. Epub 2024 Oct 2.
8
Validation of administrative health data for the identification of endometriosis diagnosis.用于识别子宫内膜异位症诊断的行政健康数据验证
Hum Reprod. 2025 Feb 1;40(2):289-295. doi: 10.1093/humrep/deae281.
9
Extraction of sleep information from clinical notes of Alzheimer's disease patients using natural language processing.使用自然语言处理从阿尔茨海默病患者的临床记录中提取睡眠信息。
J Am Med Inform Assoc. 2024 Oct 1;31(10):2217-2227. doi: 10.1093/jamia/ocae177.
10
Language Models for Multilabel Document Classification of Surgical Concepts in Exploratory Laparotomy Operative Notes: Algorithm Development Study.用于探索性剖腹手术记录中手术概念多标签文档分类的语言模型:算法开发研究
JMIR Med Inform. 2025 Jul 9;13:e71176. doi: 10.2196/71176.

本文引用的文献

1
BERT-Based Neural Network for Inpatient Fall Detection From Electronic Medical Records: Retrospective Cohort Study.基于BERT的神经网络用于从电子病历中检测住院患者跌倒:回顾性队列研究
JMIR Med Inform. 2024 Jan 30;12:e48995. doi: 10.2196/48995.
2
Constructing a disease database and using natural language processing to capture and standardize free text clinical information.构建疾病数据库并使用自然语言处理技术来捕获和规范自由文本临床信息。
Sci Rep. 2023 May 26;13(1):8591. doi: 10.1038/s41598-023-35482-0.
3
Automated deidentification of radiology reports combining transformer and "hide in plain sight" rule-based methods.
基于 Transformer 和“隐藏在明处”规则的放射学报告自动去识别化。
J Am Med Inform Assoc. 2023 Jan 18;30(2):318-328. doi: 10.1093/jamia/ocac219.
4
Administrative Data Use in National Registry Efforts: Blessing or Curse?国家登记工作中行政数据的使用:是福还是祸?
J Bone Joint Surg Am. 2022 Oct 19;104(Suppl 3):39-46. doi: 10.2106/JBJS.22.00565.
5
A hybrid model to identify fall occurrence from electronic health records.一种从电子健康记录中识别跌倒事件发生情况的混合模型。
Int J Med Inform. 2022 Mar 7;162:104736. doi: 10.1016/j.ijmedinf.2022.104736.
6
Validity of Using Billing Codes From Electronic Health Records to Estimate Skin Cancer Counts.利用电子健康记录中的计费代码估算皮肤癌计数的有效性。
JAMA Dermatol. 2021 Sep 1;157(9):1089-1094. doi: 10.1001/jamadermatol.2021.2856.
7
Evaluating resampling methods and structured features to improve fall incident report identification by the severity level.评估重采样方法和结构化特征,以按严重程度改进跌倒事件报告识别。
J Am Med Inform Assoc. 2021 Jul 30;28(8):1756-1764. doi: 10.1093/jamia/ocab048.
8
Trends in Nonfatal Falls and Fall-Related Injuries Among Adults Aged ≥65 Years - United States, 2012-2018.2012-2018 年美国≥65 岁老年人非致命性跌倒和与跌倒相关伤害的趋势。
MMWR Morb Mortal Wkly Rep. 2020 Jul 10;69(27):875-881. doi: 10.15585/mmwr.mm6927a5.
9
Predicting Inpatient Falls Using Natural Language Processing of Nursing Records Obtained From Japanese Electronic Medical Records: Case-Control Study.利用从日本电子病历中获取的护理记录进行自然语言处理预测住院患者跌倒:病例对照研究。
JMIR Med Inform. 2020 Apr 22;8(4):e16970. doi: 10.2196/16970.
10
Mechanisms of accidental fall injuries and involved injury factors: a registry-based study.意外跌倒损伤的机制及相关损伤因素:一项基于登记处的研究。
Inj Epidemiol. 2020 Mar 16;7(1):8. doi: 10.1186/s40621-020-0234-7.