• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Unsupervised Extraction of Diagnosis Codes from EMRs Using Knowledge-Based and Extractive Text Summarization Techniques.使用基于知识和抽取式文本摘要技术从电子病历中无监督提取诊断代码
Adv Artif Intell. 2013 May;7884:77-88. doi: 10.1007/978-3-642-38457-8_7.
2
An empirical evaluation of supervised learning approaches in assigning diagnosis codes to electronic medical records.对监督学习方法在为电子病历分配诊断代码中的实证评估。
Artif Intell Med. 2015 Oct;65(2):155-66. doi: 10.1016/j.artmed.2015.04.007. Epub 2015 May 15.
3
Neural transfer learning for assigning diagnosis codes to EMRs.将诊断编码分配给电子病历的神经迁移学习。
Artif Intell Med. 2019 May;96:116-122. doi: 10.1016/j.artmed.2019.04.002. Epub 2019 Apr 12.
4
Supervised Extraction of Diagnosis Codes from EMRs: Role of Feature Selection, Data Selection, and Probabilistic Thresholding.电子病历中诊断代码的监督提取:特征选择、数据选择和概率阈值处理的作用
Proc (IEEE Int Conf Healthc Inform). 2013 Sep;2013:66-73. doi: 10.1109/ICHI.2013.15. Epub 2013 Dec 12.
5
Enhanced ICD-10 code assignment of clinical texts: A summarization-based approach.增强临床文本的 ICD-10 编码分配:基于总结的方法。
Artif Intell Med. 2024 Oct;156:102967. doi: 10.1016/j.artmed.2024.102967. Epub 2024 Aug 20.
6
Accuracy of claim data in the identification and classification of adults with congenital heart diseases in electronic medical records.电子病历中索赔数据对成人先天性心脏病的识别和分类的准确性。
Arch Cardiovasc Dis. 2019 Jan;112(1):31-43. doi: 10.1016/j.acvd.2018.07.002. Epub 2019 Jan 3.
7
Automatic ICD-10 Coding and Training System: Deep Neural Network Based on Supervised Learning.自动ICD - 10编码与训练系统:基于监督学习的深度神经网络
JMIR Med Inform. 2021 Aug 31;9(8):e23230. doi: 10.2196/23230.
8
An explainable CNN approach for medical codes prediction from clinical text.一种用于从临床文本预测医疗编码的可解释 CNN 方法。
BMC Med Inform Decis Mak. 2021 Nov 16;21(Suppl 9):256. doi: 10.1186/s12911-021-01615-6.
9
Using SNOMED CT-encoded problems to improve ICD-10-CM coding-A randomized controlled experiment.使用 SNOMED CT 编码问题提高 ICD-10-CM 编码:一项随机对照试验。
Int J Med Inform. 2019 Jun;126:19-25. doi: 10.1016/j.ijmedinf.2019.03.002. Epub 2019 Mar 5.
10
Automated ICD-10 code assignment of nonstandard diagnoses via a two-stage framework.通过两阶段框架对非标准诊断进行自动ICD-10编码分配
Artif Intell Med. 2020 Aug;108:101939. doi: 10.1016/j.artmed.2020.101939. Epub 2020 Aug 15.

引用本文的文献

1
Automated ICD coding via unsupervised knowledge integration (UNITE).基于无监督知识集成的 ICD 编码自动化(UNITE)。
Int J Med Inform. 2020 Jul;139:104135. doi: 10.1016/j.ijmedinf.2020.104135. Epub 2020 Apr 4.
2
Constructing a knowledge-based heterogeneous information graph for medical health status classification.构建用于医疗健康状况分类的基于知识的异构信息图。
Health Inf Sci Syst. 2020 Feb 14;8(1):10. doi: 10.1007/s13755-020-0100-6. eCollection 2020 Dec.
3
A hierarchical method to automatically encode Chinese diagnoses through semantic similarity estimation.一种通过语义相似性估计自动编码中文诊断的分层方法。
BMC Med Inform Decis Mak. 2016 Mar 3;16:30. doi: 10.1186/s12911-016-0269-4.
4
Application of clinical text data for phenome-wide association studies (PheWASs).临床文本数据在表型全基因组关联研究(PheWAS)中的应用。
Bioinformatics. 2015 Jun 15;31(12):1981-7. doi: 10.1093/bioinformatics/btv076. Epub 2015 Feb 4.
5
Applying MetaMap to Medline for identifying novel associations in a large clinical dataset: a feasibility analysis.应用 MetaMap 对 Medline 进行分析,以在大型临床数据集识别新的关联:可行性分析。
J Am Med Inform Assoc. 2014 Sep-Oct;21(5):925-37. doi: 10.1136/amiajnl-2014-002767. Epub 2014 Jun 13.

本文引用的文献

1
An overview of MetaMap: historical perspective and recent advances.MetaMap 概述:历史视角与最新进展。
J Am Med Inform Assoc. 2010 May-Jun;17(3):229-36. doi: 10.1136/jamia.2009.002733.
2
Three approaches to automatic assignment of ICD-9-CM codes to radiology reports.将ICD-9-CM编码自动分配到放射学报告的三种方法。
AMIA Annu Symp Proc. 2007 Oct 11;2007:279-83.
3
Automatic construction of rule-based ICD-9-CM coding systems.基于规则的ICD-9-CM编码系统的自动构建。
BMC Bioinformatics. 2008 Apr 11;9 Suppl 3(Suppl 3):S10. doi: 10.1186/1471-2105-9-S3-S10.
4
Automating the assignment of diagnosis codes to patient encounters using example-based and machine learning techniques.使用基于示例和机器学习技术实现患者就诊诊断代码分配的自动化。
J Am Med Inform Assoc. 2006 Sep-Oct;13(5):516-25. doi: 10.1197/jamia.M2077. Epub 2006 Jun 23.
5
Beyond synonymy: exploiting the UMLS semantics in mapping vocabularies.超越同义词:在映射词汇表中利用统一医学语言系统语义
Proc AMIA Symp. 1998:815-9.
6
Development and evaluation of a computerized admission diagnoses encoding system.计算机化入院诊断编码系统的开发与评估
Comput Biomed Res. 1996 Oct;29(5):351-72. doi: 10.1006/cbmr.1996.0026.

使用基于知识和抽取式文本摘要技术从电子病历中无监督提取诊断代码

Unsupervised Extraction of Diagnosis Codes from EMRs Using Knowledge-Based and Extractive Text Summarization Techniques.

作者信息

Kavuluru Ramakanth, Han Sifei, Harris Daniel

机构信息

Division of Biomedical Informatics, Department of Biostatistics, University of Kentucky, Lexington, KY.

Department of Computer Science, University of Kentucky, Lexington, KY.

出版信息

Adv Artif Intell. 2013 May;7884:77-88. doi: 10.1007/978-3-642-38457-8_7.

DOI:10.1007/978-3-642-38457-8_7
PMID:28748227
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5524149/
Abstract

Diagnosis codes are extracted from medical records for billing and reimbursement and for secondary uses such as quality control and cohort identification. In the US, these codes come from the standard terminology ICD-9-CM derived from the international classification of diseases (ICD). ICD-9 codes are generally extracted by trained human coders by reading all artifacts available in a patient's medical record following specific coding guidelines. To assist coders in this manual process, this paper proposes an unsupervised ensemble approach to automatically extract ICD-9 diagnosis codes from textual narratives included in electronic medical records (EMRs). Earlier attempts on automatic extraction focused on individual documents such as radiology reports and discharge summaries. Here we use a more realistic dataset and extract ICD-9 codes from EMRs of 1000 inpatient visits at the University of Kentucky Medical Center. Using named entity recognition (NER), graph-based concept-mapping of medical concepts, and extractive text summarization techniques, we achieve an example based average recall of 0.42 with average precision 0.47; compared with a baseline of using only NER, we notice a 12% improvement in recall with the graph-based approach and a 7% improvement in precision using the extractive text summarization approach. Although diagnosis codes are complex concepts often expressed in text with significant long range non-local dependencies, our present work shows the potential of unsupervised methods in extracting a portion of codes. As such, our findings are especially relevant for code extraction tasks where obtaining large amounts of training data is difficult.

摘要

诊断代码从医疗记录中提取,用于计费和报销以及质量控制和队列识别等二次用途。在美国,这些代码来自源自国际疾病分类(ICD)的标准术语ICD-9-CM。ICD-9代码通常由经过培训的人工编码员按照特定的编码指南阅读患者医疗记录中的所有文档来提取。为了在这个手动过程中帮助编码员,本文提出了一种无监督集成方法,用于从电子病历(EMR)中的文本叙述中自动提取ICD-9诊断代码。早期的自动提取尝试集中在单个文档上,如放射学报告和出院小结。在这里,我们使用了一个更现实的数据集,并从肯塔基大学医学中心1000次住院就诊的电子病历中提取ICD-9代码。使用命名实体识别(NER)、基于图的医学概念映射和提取式文本摘要技术,我们实现了基于示例的平均召回率为0.42,平均精确率为0.47;与仅使用NER的基线相比,我们注意到基于图的方法召回率提高了12%,使用提取式文本摘要方法精确率提高了7%。尽管诊断代码是复杂的概念,通常在文本中表达,具有显著的长距离非局部依赖性,但我们目前的工作显示了无监督方法在提取一部分代码方面的潜力。因此,我们的发现对于难以获得大量训练数据的代码提取任务尤其相关。