通过整合不同电子健康数据资源并应用机器学习策略优化晚期慢性肾脏病及无肾脏疾病的识别

Optimized Identification of Advanced Chronic Kidney Disease and Absence of Kidney Disease by Combining Different Electronic Health Data Resources and by Applying Machine Learning Strategies.

作者信息

Weber Christoph, Röschke Lena, Modersohn Luise, Lohr Christina, Kolditz Tobias, Hahn Udo, Ammon Danny, Betz Boris, Kiehntopf Michael

机构信息

Department of Clinical Chemistry and Laboratory Diagnostics and Integrated Biobank Jena (IBBJ), Jena University Hospital, 07747 Jena, Germany.

Jena University Language & Information Engineering (JULIE) Lab, Friedrich Schiller University Jena, 07743 Jena, Germany.

出版信息

J Clin Med. 2020 Sep 12;9(9):2955. doi: 10.3390/jcm9092955.

DOI:10.3390/jcm9092955

PMID:32932685

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7563476/

Abstract

Automated identification of advanced chronic kidney disease (CKD ≥ III) and of no known kidney disease (NKD) can support both clinicians and researchers. We hypothesized that identification of CKD and NKD can be improved, by combining information from different electronic health record (EHR) resources, comprising laboratory values, discharge summaries and ICD-10 billing codes, compared to using each component alone. We included EHRs from 785 elderly multimorbid patients, hospitalized between 2010 and 2015, that were divided into a training and a test (n = 156) dataset. We used both the area under the receiver operating characteristic (AUROC) and under the precision-recall curve (AUCPR) with a 95% confidence interval for evaluation of different classification models. In the test dataset, the combination of EHR components as a simple classifier identified CKD ≥ III (AUROC 0.96[0.93-0.98]) and NKD (AUROC 0.94[0.91-0.97]) better than laboratory values (AUROC CKD 0.85[0.79-0.90], NKD 0.91[0.87-0.94]), discharge summaries (AUROC CKD 0.87[0.82-0.92], NKD 0.84[0.79-0.89]) or ICD-10 billing codes (AUROC CKD 0.85[0.80-0.91], NKD 0.77[0.72-0.83]) alone. Logistic regression and machine learning models improved recognition of CKD ≥ III compared to the simple classifier if only laboratory values were used (AUROC 0.96[0.92-0.99] vs. 0.86[0.81-0.91], < 0.05) and improved recognition of NKD if information from previous hospital stays was used (AUROC 0.99[0.98-1.00] vs. 0.95[0.92-0.97]], < 0.05). Depending on the availability of data, correct automated identification of CKD ≥ III and NKD from EHRs can be improved by generating classification models based on the combination of different EHR components.

摘要

自动识别晚期慢性肾脏病（CKD≥III期）和无已知肾脏疾病（NKD）的情况，可为临床医生和研究人员提供帮助。我们假设，与单独使用每个组件相比，通过整合来自不同电子健康记录（EHR）资源的信息（包括实验室检查值、出院小结和ICD-10计费代码），可以改善对CKD和NKD的识别。我们纳入了2010年至2015年间住院的785例老年多病患者的电子健康记录，并将其分为训练数据集和测试数据集（n = 156）。我们使用了受试者工作特征曲线下面积（AUROC）和精确召回率曲线下面积（AUCPR）以及95%置信区间来评估不同的分类模型。在测试数据集中，作为简单分类器的EHR组件组合对CKD≥III期（AUROC 0.96[0.93 - 0.98]）和NKD（AUROC 0.94[0.91 - 0.97]）的识别，优于单独使用实验室检查值（CKD的AUROC 0.85[0.79 - 0.90]，NKD的AUROC 0.91[0.87 - 0.94]）、出院小结（CKD的AUROC 0.87[0.82 - 0.92]，NKD的AUROC 0.84[0.79 - 0.89]）或ICD-10计费代码（CKD的AUROC 0.85[0.80 - 0.91]，NKD的AUROC 0.77[0.72 - 0.83]）。如果仅使用实验室检查值，逻辑回归和机器学习模型与简单分类器相比，对CKD≥III期的识别有所改善（AUROC 0.96[0.92 - 0.99]对0.86[0.81 - 0.91]，P < 0.05）；如果使用既往住院信息，则对NKD的识别有所改善（AUROC 0.99[0.98 - 1.00]对0.95[0.92 - 0.97]，P < 0.05）。根据数据的可用性，通过基于不同EHR组件的组合生成分类模型，可以改善从EHR中正确自动识别CKD≥III期和NKD的情况。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a560/7563476/bcd6394fc767/jcm-09-02955-g001.jpg

相似文献

Optimized Identification of Advanced Chronic Kidney Disease and Absence of Kidney Disease by Combining Different Electronic Health Data Resources and by Applying Machine Learning Strategies.通过整合不同电子健康数据资源并应用机器学习策略优化晚期慢性肾脏病及无肾脏疾病的识别

J Clin Med. 2020 Sep 12;9(9):2955. doi: 10.3390/jcm9092955.

Developing a FHIR-based EHR phenotyping framework: A case study for identification of patients with obesity and multiple comorbidities from discharge summaries.基于 FHIR 的电子健康记录表型框架的开发：以从出院小结中识别肥胖且伴有多种合并症的患者为例。

J Biomed Inform. 2019 Nov;99:103310. doi: 10.1016/j.jbi.2019.103310. Epub 2019 Oct 14.

Early Detection of Septic Shock Onset Using Interpretable Machine Learners.使用可解释机器学习算法早期检测脓毒症休克发作

J Clin Med. 2021 Jan 15;10(2):301. doi: 10.3390/jcm10020301.

Electronic Phenotype for Advanced Chronic Kidney Disease in a Veteran Health Care System Clinical Database: Systems-Based Strategy for Model Development and Evaluation.退伍军人医疗系统临床数据库中晚期慢性肾病的电子表型：基于系统的模型开发与评估策略

Interact J Med Res. 2023 Jul 24;12:e43384. doi: 10.2196/43384.

Predicting Postoperative Mortality With Deep Neural Networks and Natural Language Processing: Model Development and Validation.使用深度神经网络和自然语言处理预测术后死亡率：模型开发与验证

JMIR Med Inform. 2022 May 10;10(5):e38241. doi: 10.2196/38241.

CKD Progression Prediction in a Diverse US Population: A Machine-Learning Model.美国多样化人群中慢性肾脏病进展预测：一种机器学习模型

Kidney Med. 2023 Jun 24;5(9):100692. doi: 10.1016/j.xkme.2023.100692. eCollection 2023 Sep.

Machine Learning Electronic Health Record Identification of Patients with Rheumatoid Arthritis: Algorithm Pipeline Development and Validation Study.机器学习在类风湿性关节炎患者电子健康记录识别中的应用：算法流程开发与验证研究。

JMIR Med Inform. 2020 Nov 30;8(11):e23930. doi: 10.2196/23930.

Artificial Intelligence Learning Semantics via External Resources for Classifying Diagnosis Codes in Discharge Notes.人工智能通过外部资源学习语义以对出院小结中的诊断代码进行分类。

J Med Internet Res. 2017 Nov 6;19(11):e380. doi: 10.2196/jmir.8344.

Relational machine learning for electronic health record-driven phenotyping.用于电子健康记录驱动的表型分析的关系机器学习。

J Biomed Inform. 2014 Dec;52:260-70. doi: 10.1016/j.jbi.2014.07.007. Epub 2014 Jul 15.

Publicly available machine learning models for identifying opioid misuse from the clinical notes of hospitalized patients.可公开获取的机器学习模型，用于从住院患者的临床记录中识别阿片类药物滥用。

BMC Med Inform Decis Mak. 2020 Apr 29;20(1):79. doi: 10.1186/s12911-020-1099-y.

引用本文的文献

Variational quantum classifier-based early identification and classification of chronic kidney disease using sparse autoencoder and LASSO shrinkage.基于变分量子分类器，利用稀疏自编码器和套索收缩法对慢性肾脏病进行早期识别和分类

PeerJ Comput Sci. 2025 Apr 17;11:e2789. doi: 10.7717/peerj-cs.2789. eCollection 2025.

Machine-learning-based identification of patients with IgA nephropathy using a computerized medical billing database.利用计算机化医疗计费数据库，基于机器学习识别IgA肾病患者。

PLoS One. 2024 Dec 5;19(12):e0312915. doi: 10.1371/journal.pone.0312915. eCollection 2024.

Imbalanced class distribution and performance evaluation metrics: A systematic review of prediction accuracy for determining model performance in healthcare systems.不均衡的类别分布与性能评估指标：关于医疗系统中用于确定模型性能的预测准确性的系统综述

PLOS Digit Health. 2023 Nov 30;2(11):e0000290. doi: 10.1371/journal.pdig.0000290. eCollection 2023 Nov.

Predict, diagnose, and treat chronic kidney disease with machine learning: a systematic literature review.利用机器学习预测、诊断和治疗慢性肾脏病：系统文献回顾。

J Nephrol. 2023 May;36(4):1101-1117. doi: 10.1007/s40620-023-01573-4. Epub 2023 Feb 14.

A Hybrid Risk Factor Evaluation Scheme for Metabolic Syndrome and Stage 3 Chronic Kidney Disease Based on Multiple Machine Learning Techniques.基于多种机器学习技术的代谢综合征和3期慢性肾脏病混合风险因素评估方案

Healthcare (Basel). 2022 Dec 9;10(12):2496. doi: 10.3390/healthcare10122496.

Invasive Versus Medical Management in Patients With Chronic Kidney Disease and Non-ST-Segment-Elevation Myocardial Infarction.慢性肾脏病合并非ST段抬高型心肌梗死患者的侵入性治疗与药物治疗对比

J Am Heart Assoc. 2022 Jun 17;11(12):e025205. doi: 10.1161/JAHA.121.025205.

Prediction of 3-year risk of diabetic kidney disease using machine learning based on electronic medical records.基于电子病历的机器学习预测糖尿病肾病 3 年风险。

J Transl Med. 2022 Mar 26;20(1):143. doi: 10.1186/s12967-022-03339-1.

Prediction of early clinical response in patients receiving tofacitinib in the OCTAVE Induction 1 and 2 studies.在OCTAVE诱导1期和2期研究中接受托法替布治疗的患者早期临床反应的预测

Therap Adv Gastroenterol. 2021 Nov 29;14:17562848211054710. doi: 10.1177/17562848211054710. eCollection 2021.

本文引用的文献

Defining Early Recovery of Acute Kidney Injury.急性肾损伤早期恢复的定义

Clin J Am Soc Nephrol. 2020 Sep 7;15(9):1358-1360. doi: 10.2215/CJN.13381019. Epub 2020 Apr 1.

Lab-based and diagnosis-based chronic kidney disease recognition and staging concordance.基于实验室和基于诊断的慢性肾脏病识别和分期的一致性。

BMC Nephrol. 2019 Sep 14;20(1):357. doi: 10.1186/s12882-019-1551-3.

Real world evidence in cardiovascular medicine: ensuring data validity in electronic health record-based studies.心血管医学中的真实世界证据：确保电子健康记录研究中的数据有效性。

J Am Med Inform Assoc. 2019 Nov 1;26(11):1189-1194. doi: 10.1093/jamia/ocz119.

Projection Word Embedding Model With Hybrid Sampling Training for Classifying ICD-10-CM Codes: Longitudinal Observational Study.用于对ICD-10-CM编码进行分类的混合采样训练投影词嵌入模型：纵向观察研究

JMIR Med Inform. 2019 Jul 23;7(3):e14499. doi: 10.2196/14499.

Can billing codes accurately identify rapidly progressing stage 3 and stage 4 chronic kidney disease patients: a diagnostic test study.能否通过计费代码准确识别快速进展的 3 期和 4 期慢性肾脏病患者：一项诊断性测试研究。

BMC Nephrol. 2019 Jul 12;20(1):260. doi: 10.1186/s12882-019-1429-4.

Intelligent Diagnostic Prediction and Classification System for Chronic Kidney Disease.智能慢性肾脏病诊断预测与分类系统。

Sci Rep. 2019 Jul 3;9(1):9583. doi: 10.1038/s41598-019-46074-2.

Deep Learning on Electronic Health Records to Improve Disease Coding Accuracy.基于电子健康记录的深度学习以提高疾病编码准确性。

AMIA Jt Summits Transl Sci Proc. 2019 May 6;2019:620-629. eCollection 2019.

Concordance between the Clinical Definition of Polypathological Patient versus Automated Detection by Means of Combined Identification through ICD-9-CM Codes.通过ICD-9-CM编码联合识别实现的多病理患者临床定义与自动检测之间的一致性。

J Clin Med. 2019 May 6;8(5):613. doi: 10.3390/jcm8050613.

Neural network and support vector machine for the prediction of chronic kidney disease: A comparative study.神经网络和支持向量机在慢性肾脏病预测中的比较研究。

Comput Biol Med. 2019 Jun;109:101-111. doi: 10.1016/j.compbiomed.2019.04.017. Epub 2019 Apr 25.

Comparison and development of machine learning tools in the prediction of chronic kidney disease progression.机器学习工具在慢性肾脏病进展预测中的比较与发展。

J Transl Med. 2019 Apr 11;17(1):119. doi: 10.1186/s12967-019-1860-0.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

通过整合不同电子健康数据资源并应用机器学习策略优化晚期慢性肾脏病及无肾脏疾病的识别

Optimized Identification of Advanced Chronic Kidney Disease and Absence of Kidney Disease by Combining Different Electronic Health Data Resources and by Applying Machine Learning Strategies.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献