• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于 ICD 对临床文档进行诊断术语编码的机器学习方法。

Machine Learning Approaches on Diagnostic Term Encoding With the ICD for Clinical Documentation.

出版信息

IEEE J Biomed Health Inform. 2018 Jul;22(4):1323-1329. doi: 10.1109/JBHI.2017.2743824. Epub 2017 Aug 24.

DOI:10.1109/JBHI.2017.2743824
PMID:28858819
Abstract

This work focuses on data mining applied to the clinical documentation domain. Diagnostic terms (DTs) are used as keywords to retrieve valuable information from electronic health records. Indeed, they are encoded manually by experts following the International Classification of Diseases (ICD). The goal of this work is to explore the aid of text mining on DT encoding. From the machine learning (ML) perspective, this is a high-dimensional classification task, as it comprises thousands of codes. This work delves into a robust representation of the instances to improve ML results. The proposed system is able to find the right ICD code among more than 1500 possible ICD codes with 92% precision for the main disease (primary class) and 88% for the main disease together with the nonessential modifiers (fully specified class). The methodology employed is simple and portable. According to the experts from public hospitals, the system is very useful in particular for documentation and pharmacosurveillance services. In fact, they reported an accuracy of 91.2% on a small randomly extracted test. Hence, together with this paper, we made the software publicly available in order to help the clinical and research community.

摘要

这项工作专注于应用于临床文档领域的数据挖掘。诊断术语 (DT) 被用作从电子健康记录中检索有价值信息的关键字。实际上,它们是由专家根据国际疾病分类 (ICD) 手动编码的。这项工作旨在探索文本挖掘在 DT 编码方面的辅助作用。从机器学习 (ML) 的角度来看,这是一项高维分类任务,因为它包含数千个代码。这项工作深入研究了实例的稳健表示,以提高 ML 结果。所提出的系统能够在超过 1500 种可能的 ICD 代码中找到正确的 ICD 代码,对于主要疾病 (主要类别) 的准确率为 92%,对于主要疾病和非必要修饰符 (完全指定类别) 的准确率为 88%。所采用的方法简单且可移植。根据公立医院的专家的说法,该系统对于文档和药物监测服务特别有用。事实上,他们在一个小的随机提取测试中报告了 91.2%的准确率。因此,我们与本文一起将该软件公开提供,以帮助临床和研究界。

相似文献

1
Machine Learning Approaches on Diagnostic Term Encoding With the ICD for Clinical Documentation.基于 ICD 对临床文档进行诊断术语编码的机器学习方法。
IEEE J Biomed Health Inform. 2018 Jul;22(4):1323-1329. doi: 10.1109/JBHI.2017.2743824. Epub 2017 Aug 24.
2
Artificial Intelligence Learning Semantics via External Resources for Classifying Diagnosis Codes in Discharge Notes.人工智能通过外部资源学习语义以对出院小结中的诊断代码进行分类。
J Med Internet Res. 2017 Nov 6;19(11):e380. doi: 10.2196/jmir.8344.
3
Supervised Text Classification System Detects Fontan Patients in Electronic Records With Higher Accuracy Than Codes.监督式文本分类系统在电子病历中的 Fontan 患者检测准确率高于编码。
J Am Heart Assoc. 2023 Jul 4;12(13):e030046. doi: 10.1161/JAHA.123.030046. Epub 2023 Jun 22.
4
Exploiting ICD Hierarchy for Classification of EHRs in Spanish Through Multi-Task Transformers.利用 ICD 层级结构通过多任务转换器对西班牙电子病历进行分类。
IEEE J Biomed Health Inform. 2022 Mar;26(3):1374-1383. doi: 10.1109/JBHI.2021.3112130. Epub 2022 Mar 7.
5
Interpretable deep learning to map diagnostic texts to ICD-10 codes.可解释的深度学习将诊断文本映射到 ICD-10 代码。
Int J Med Inform. 2019 Sep;129:49-59. doi: 10.1016/j.ijmedinf.2019.05.015. Epub 2019 May 22.
6
Efficient identification of nationally mandated reportable cancer cases using natural language processing and machine learning.利用自然语言处理和机器学习有效识别国家规定的应报告癌症病例
J Am Med Inform Assoc. 2016 Nov;23(6):1077-1084. doi: 10.1093/jamia/ocw006. Epub 2016 Mar 28.
7
Cardiology record multi-label classification using latent Dirichlet allocation.使用潜在狄利克雷分配进行心脏病学记录的多标签分类。
Comput Methods Programs Biomed. 2018 Oct;164:111-119. doi: 10.1016/j.cmpb.2018.07.002. Epub 2018 Jul 17.
8
Building a common pipeline for rule-based document classification.构建用于基于规则的文档分类的通用管道。
Stud Health Technol Inform. 2013;192:1211.
9
Tool-supported Interactive Correction and Semantic Annotation of Narrative Clinical Reports.叙事性临床报告的工具支持交互式校正与语义标注
Methods Inf Med. 2017 May 18;56(3):217-229. doi: 10.3414/ME16-01-0083. Epub 2017 Apr 28.
10
Deep neural models for ICD-10 coding of death certificates and autopsy reports in free-text.深度学习模型在自由文本中进行 ICD-10 死亡证明和尸检报告编码。
J Biomed Inform. 2018 Apr;80:64-77. doi: 10.1016/j.jbi.2018.02.011. Epub 2018 Feb 26.

引用本文的文献

1
Hybrid natural language processing tool for semantic annotation of medical texts in Spanish.用于西班牙语医学文本语义标注的混合自然语言处理工具。
BMC Bioinformatics. 2025 Jan 8;26(1):7. doi: 10.1186/s12859-024-05949-6.
2
Digitalising the past decades: automated ICD-10 coding of unstructured free text dermatological diagnoses.数字化过去几十年:非结构化自由文本皮肤科诊断的自动化 ICD-10 编码。
BMC Health Serv Res. 2024 Oct 29;24(1):1297. doi: 10.1186/s12913-024-11761-y.
3
Evaluating the Prevalence of Burnout Among Health Care Professionals Related to Electronic Health Record Use: Systematic Review and Meta-Analysis.
评估与电子健康记录使用相关的医疗保健专业人员职业倦怠的患病率:系统评价与荟萃分析
JMIR Med Inform. 2024 Jun 12;12:e54811. doi: 10.2196/54811.
4
Classification of user queries according to a hierarchical medical procedure encoding system using an ensemble classifier.使用集成分类器根据分层医疗程序编码系统对用户查询进行分类。
Front Artif Intell. 2022 Nov 4;5:1000283. doi: 10.3389/frai.2022.1000283. eCollection 2022.
5
Natural language processing algorithms for mapping clinical text fragments onto ontology concepts: a systematic review and recommendations for future studies.自然语言处理算法在将临床文本片段映射到本体概念上的应用:系统评价及对未来研究的建议。
J Biomed Semantics. 2020 Nov 16;11(1):14. doi: 10.1186/s13326-020-00231-z.
6
Construction of a semi-automatic ICD-10 coding system.构建一个半自动 ICD-10 编码系统。
BMC Med Inform Decis Mak. 2020 Apr 15;20(1):67. doi: 10.1186/s12911-020-1085-4.
7
Findings from the 2019 International Medical Informatics Association Yearbook Section on Health Information Management.2019年国际医学信息学协会健康信息管理年鉴章节的研究结果。
Yearb Med Inform. 2019 Aug;28(1):65-68. doi: 10.1055/s-0039-1677941. Epub 2019 Aug 16.
8
Automated Billing Code Retrieval from MRI Scanner Log Data.从 MRI 扫描仪日志数据中自动提取计费代码。
J Digit Imaging. 2019 Dec;32(6):1103-1111. doi: 10.1007/s10278-019-00241-z.