Suppr超能文献

机器学习在继发性高血压病因诊断中的应用:基于电子病历的回顾性研究

An Application of Machine Learning to Etiological Diagnosis of Secondary Hypertension: Retrospective Study Using Electronic Medical Records.

作者信息

Diao Xiaolin, Huo Yanni, Yan Zhanzheng, Wang Haibin, Yuan Jing, Wang Yuxin, Cai Jun, Zhao Wei

机构信息

Department of Information Center, Fuwai Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China.

Department of Information Center, Fuwai Hospital, National Center for Cardiovascular Diseases, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China.

出版信息

JMIR Med Inform. 2021 Jan 25;9(1):e19739. doi: 10.2196/19739.

Abstract

BACKGROUND

Secondary hypertension is a kind of hypertension with a definite etiology and may be cured. Patients with suspected secondary hypertension can benefit from timely detection and treatment and, conversely, will have a higher risk of morbidity and mortality than those with primary hypertension.

OBJECTIVE

The aim of this study was to develop and validate machine learning (ML) prediction models of common etiologies in patients with suspected secondary hypertension.

METHODS

The analyzed data set was retrospectively extracted from electronic medical records of patients discharged from Fuwai Hospital between January 1, 2016, and June 30, 2019. A total of 7532 unique patients were included and divided into 2 data sets by time: 6302 patients in 2016-2018 as the training data set for model building and 1230 patients in 2019 as the validation data set for further evaluation. Extreme Gradient Boosting (XGBoost) was adopted to develop 5 models to predict 4 etiologies of secondary hypertension and occurrence of any of them (named as composite outcome), including renovascular hypertension (RVH), primary aldosteronism (PA), thyroid dysfunction, and aortic stenosis. Both univariate logistic analysis and Gini Impurity were used for feature selection. Grid search and 10-fold cross-validation were used to select the optimal hyperparameters for each model.

RESULTS

Validation of the composite outcome prediction model showed good performance with an area under the receiver-operating characteristic curve (AUC) of 0.924 in the validation data set, while the 4 prediction models of RVH, PA, thyroid dysfunction, and aortic stenosis achieved AUC of 0.938, 0.965, 0.959, and 0.946, respectively, in the validation data set. A total of 79 clinical indicators were identified in all and finally used in our prediction models. The result of subgroup analysis on the composite outcome prediction model demonstrated high discrimination with AUCs all higher than 0.890 among all age groups of adults.

CONCLUSIONS

The ML prediction models in this study showed good performance in detecting 4 etiologies of patients with suspected secondary hypertension; thus, they may potentially facilitate clinical diagnosis decision making of secondary hypertension in an intelligent way.

摘要

背景

继发性高血压是一种病因明确的高血压,有可能治愈。疑似继发性高血压患者可从及时检测和治疗中获益,相反,与原发性高血压患者相比,他们的发病和死亡风险更高。

目的

本研究旨在开发并验证疑似继发性高血压患者常见病因的机器学习(ML)预测模型。

方法

分析数据集是从2016年1月1日至2019年6月30日期间从阜外医院出院患者的电子病历中回顾性提取的。共纳入7532例患者,并按时间分为2个数据集:2016 - 2018年的6302例患者作为模型构建的训练数据集,2019年的1230例患者作为进一步评估的验证数据集。采用极端梯度提升(XGBoost)开发5个模型,以预测继发性高血压的4种病因以及其中任何一种病因的发生情况(称为复合结局),包括肾血管性高血压(RVH)、原发性醛固酮增多症(PA)、甲状腺功能障碍和主动脉狭窄。单因素逻辑分析和基尼不纯度均用于特征选择。网格搜索和10折交叉验证用于为每个模型选择最佳超参数。

结果

复合结局预测模型在验证数据集中的受试者工作特征曲线下面积(AUC)为0.924,表现良好,而RVH、PA、甲状腺功能障碍和主动脉狭窄的4个预测模型在验证数据集中的AUC分别为0.938、0.965、0.959和0.946。总共识别出79项临床指标,最终用于我们的预测模型。复合结局预测模型的亚组分析结果表明,在所有成人年龄组中,其区分度较高,AUC均高于0.890。

结论

本研究中的ML预测模型在检测疑似继发性高血压患者的4种病因方面表现良好;因此,它们可能潜在地以智能方式促进继发性高血压的临床诊断决策。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ae0/7870351/6de395e95fb0/medinform_v9i1e19739_fig1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验