Gong Lingmin, Chen Shiyu, Yang Yuhui, Hu Weiwei, Cai Jiaxin, Liu Sitong, Zhao Yaling, Pei Leilei, Ma Jiaojiao, Chen Fangyao
Department of Epidemiology and Biostatistics, School of Public Health, Xi'an Jiaotong University Health Science Center, Xi'an, Shaanxi, China.
Department of Neurology, Xi'an Gaoxin Hospital, Xi'an, Shaanxi, China.
Digit Health. 2024 Oct 8;10:20552076241288833. doi: 10.1177/20552076241288833. eCollection 2024 Jan-Dec.
Ischemic stroke (IS) accounts large amount of stroke incidence. The aim of this study was to discover the risk and prognostic factors that affecting the occurrence of IS in hypertensive patients.
Study data were obtained from the Medical Information Mart for Intensive Care (MIMIC)-IV database. To avoid biased factors selection process, several approaches were studied including logistic regression, elastic net regression, random forest, correlation analysis, and multifactor logistic regression methods. And seven different machine-learning methods are used to construct predictive models. The performance of the developed models was evaluated using AUC (Area Under the Curve), prediction accuracy, precision, recall, F1 score, PPV (Positive Predictive Value) and NPV (Negative Predictive Value). Interaction analysis was conducted to explore potential relationships between influential factors.
The study included 92,514 hypertensive patients, of which 1746 hypertensive patients experienced IS. The Gradient Boosted Decision Tree (GBDT) model outperformed the other prediction model terms of prediction accuracy and AUC values in both ischemic and prognosis cases. By using the SHapley Additive exPlanations (SHAP), we found that a range of factors and corresponding interactions between factors are important risk factors for IS and its prognosis in hypertensive patients.
The study identified factors that increase the risk of IS and poor prognosis in hypertensive patients, which may provide guidance for clinical diagnosis and treatment.
缺血性中风(IS)占中风发病率的很大一部分。本研究的目的是发现影响高血压患者发生IS的风险和预后因素。
研究数据来自重症监护医学信息集市(MIMIC)-IV数据库。为避免选择过程中的偏倚因素,研究了多种方法,包括逻辑回归、弹性网回归、随机森林、相关分析和多因素逻辑回归方法。使用七种不同的机器学习方法构建预测模型。使用曲线下面积(AUC)、预测准确性、精确率、召回率、F1分数、阳性预测值(PPV)和阴性预测值(NPV)评估所开发模型的性能。进行交互分析以探索影响因素之间的潜在关系。
该研究纳入了92514名高血压患者,其中1746名高血压患者发生了IS。在缺血和预后病例中,梯度提升决策树(GBDT)模型在预测准确性和AUC值方面均优于其他预测模型。通过使用SHapley加法解释(SHAP),我们发现一系列因素以及因素之间的相应相互作用是高血压患者IS及其预后的重要危险因素。
该研究确定了增加高血压患者IS风险和不良预后的因素,这可能为临床诊断和治疗提供指导。