基于电子病历的疾病本体结构的诊断识别与预测方法的统一。

Unifying Diagnosis Identification and Prediction Method Embedding the Disease Ontology Structure From Electronic Medical Records.

机构信息

Health Management Center, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, China.

School of Economics and Management, Institute of Systems Engineering, Dalian University of Technology, Dalian, China.

出版信息

Front Public Health. 2022 Jan 20;9:793801. doi: 10.3389/fpubh.2021.793801. eCollection 2021.

DOI:10.3389/fpubh.2021.793801

PMID:35127624

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8811031/

Abstract

OBJECTIVE

The reasonable classification of a large number of distinct diagnosis codes can clarify patient diagnostic information and help clinicians to improve their ability to assign and target treatment for primary diseases. Our objective is to identify and predict a unifying diagnosis (UD) from electronic medical records (EMRs).

METHODS

We screened 4,418 sepsis patients from a public MIMIC-III database and extracted their diagnostic information for UD identification, their demographic information, laboratory examination information, chief complaint, and history of present illness information for UD prediction. We proposed a data-driven UD identification and prediction method (UDIPM) embedding the disease ontology structure. First, we designed a set similarity measure method embedding the disease ontology structure to generate a patient similarity matrix. Second, we applied affinity propagation clustering to divide patients into different clusters, and extracted a typical diagnosis code co-occurrence pattern from each cluster. Furthermore, we identified a UD by fusing visual analysis and a conditional co-occurrence matrix. Finally, we trained five classifiers in combination with feature fusion and feature selection method to unify the diagnosis prediction.

RESULTS

The experimental results on a public electronic medical record dataset showed that the UDIPM could extracted a typical diagnosis code co-occurrence pattern effectively, identified and predicted a UD based on patients' diagnostic and admission information, and outperformed other fusion methods overall.

CONCLUSIONS

The accurate identification and prediction of the UD from a large number of distinct diagnosis codes and multi-source heterogeneous patient admission information in EMRs can provide a data-driven approach to assist better coding integration of diagnosis.

摘要

目的

对大量不同的诊断代码进行合理分类，可以阐明患者的诊断信息，帮助临床医生提高对主要疾病进行分类和治疗的能力。我们的目标是从电子病历（EMR）中识别和预测统一诊断（UD）。

方法

我们从公共的 MIMIC-III 数据库中筛选了 4418 例脓毒症患者，并提取了他们的诊断信息，用于 UD 识别、人口统计学信息、实验室检查信息、主要诉求和现病史信息，用于 UD 预测。我们提出了一种基于数据驱动的 UD 识别和预测方法（UDIPM），嵌入了疾病本体结构。首先，我们设计了一种带有疾病本体结构的集合相似度测量方法，生成患者相似度矩阵。其次，我们应用亲和传播聚类将患者分为不同的簇，并从每个簇中提取典型的诊断代码共现模式。此外，我们通过融合视觉分析和条件共现矩阵来识别 UD。最后，我们结合特征融合和特征选择方法训练了五个分类器来统一诊断预测。

结果

在公共电子病历数据集上的实验结果表明，UDIPM 可以有效地提取典型的诊断代码共现模式，根据患者的诊断和入院信息识别和预测 UD，并且总体上优于其他融合方法。

结论

从 EMR 中大量不同的诊断代码和多源异构的患者入院信息中准确识别和预测 UD，可以提供一种数据驱动的方法来辅助更好地整合诊断编码。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7add/8811031/5b7314516ef4/fpubh-09-793801-g0001.jpg

相似文献

Unifying Diagnosis Identification and Prediction Method Embedding the Disease Ontology Structure From Electronic Medical Records.

Front Public Health. 2022 Jan 20;9:793801. doi: 10.3389/fpubh.2021.793801. eCollection 2021.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

An empirical evaluation of supervised learning approaches in assigning diagnosis codes to electronic medical records.

Artif Intell Med. 2015 Oct;65(2):155-66. doi: 10.1016/j.artmed.2015.04.007. Epub 2015 May 15.

A fusion framework to extract typical treatment patterns from electronic medical records.

Artif Intell Med. 2020 Mar;103:101782. doi: 10.1016/j.artmed.2019.101782. Epub 2019 Dec 28.

A data-driven framework of typical treatment process extraction and evaluation.

J Biomed Inform. 2018 Jul;83:178-195. doi: 10.1016/j.jbi.2018.06.004. Epub 2018 Jun 15.

Selecting relevant features from the electronic health record for clinical code prediction.

J Biomed Inform. 2017 Oct;74:92-103. doi: 10.1016/j.jbi.2017.09.004. Epub 2017 Sep 14.

Utility of linking primary care electronic medical records with Canadian census data to study the determinants of chronic disease: an example based on socioeconomic status and obesity.

BMC Med Inform Decis Mak. 2016 Mar 11;16:32. doi: 10.1186/s12911-016-0272-9.

Artificial Intelligence Learning Semantics via External Resources for Classifying Diagnosis Codes in Discharge Notes.

J Med Internet Res. 2017 Nov 6;19(11):e380. doi: 10.2196/jmir.8344.

Sequential Data-Based Patient Similarity Framework for Patient Outcome Prediction: Algorithm Development.

J Med Internet Res. 2022 Jan 6;24(1):e30720. doi: 10.2196/30720.

[A customized method for information extraction from unstructured text data in the electronic medical records].

Beijing Da Xue Xue Bao Yi Xue Ban. 2018 Apr 18;50(2):256-263.

引用本文的文献

From COVID-19 to monkeypox: a novel predictive model for emerging infectious diseases.

BioData Min. 2024 Oct 22;17(1):42. doi: 10.1186/s13040-024-00396-8.

Paradigm shift required for translational research on the brain.

Exp Mol Med. 2024 May;56(5):1043-1054. doi: 10.1038/s12276-024-01218-x. Epub 2024 May 1.

Towards Transparent Healthcare: Advancing Local Explanation Methods in Explainable Artificial Intelligence.

Bioengineering (Basel). 2024 Apr 12;11(4):369. doi: 10.3390/bioengineering11040369.

本文引用的文献

Integrating multidimensional data for clustering analysis with applications to cancer patient data.

J Am Stat Assoc. 2021;116(533):14-26. doi: 10.1080/01621459.2020.1730853. Epub 2020 Mar 19.

Explainable ICD multi-label classification of EHRs in Spanish with convolutional attention.

Int J Med Inform. 2022 Jan;157:104615. doi: 10.1016/j.ijmedinf.2021.104615. Epub 2021 Oct 29.

Automatic ICD-10 Coding and Training System: Deep Neural Network Based on Supervised Learning.

JMIR Med Inform. 2021 Aug 31;9(8):e23230. doi: 10.2196/23230.

Automated ICD coding for primary diagnosis via clinically interpretable machine learning.

Int J Med Inform. 2021 Sep;153:104543. doi: 10.1016/j.ijmedinf.2021.104543. Epub 2021 Jul 27.

Machine Learning Prediction Models for Mechanically Ventilated Patients: Analyses of the MIMIC-III Database.

Front Med (Lausanne). 2021 Jul 1;8:662340. doi: 10.3389/fmed.2021.662340. eCollection 2021.

Machine Learning for Predicting the 3-Year Risk of Incident Diabetes in Chinese Adults.

Front Public Health. 2021 Jun 29;9:626331. doi: 10.3389/fpubh.2021.626331. eCollection 2021.

Health information technology and digital innovation for national learning health and care systems.

Lancet Digit Health. 2021 Jun;3(6):e383-e396. doi: 10.1016/S2589-7500(21)00005-4. Epub 2021 May 6.

Predicting mortality of patients with acute kidney injury in the ICU using XGBoost model.

PLoS One. 2021 Feb 4;16(2):e0246306. doi: 10.1371/journal.pone.0246306. eCollection 2021.

Intracranial mesenchymal tumor with FET-CREB fusion-A unifying diagnosis for the spectrum of intracranial myxoid mesenchymal tumors and angiomatoid fibrous histiocytoma-like neoplasms.

Brain Pathol. 2021 Jul;31(4):e12918. doi: 10.1111/bpa.12918. Epub 2021 Jan 28.

Using machine learning methods to predict in-hospital mortality of sepsis patients in the ICU.

BMC Med Inform Decis Mak. 2020 Oct 2;20(1):251. doi: 10.1186/s12911-020-01271-2.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于电子病历的疾病本体结构的诊断识别与预测方法的统一。

Unifying Diagnosis Identification and Prediction Method Embedding the Disease Ontology Structure From Electronic Medical Records.

机构信息

出版信息

OBJECTIVE

METHODS

RESULTS

CONCLUSIONS

目的

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献