Qu Fang Zhou, Ding Jiang, An Xi Feng, Peng Rui, He Ni, Liu Sheng, Jiang Xin
Medical School, Xizang Minzu University, Xianyang, People's Republic of China.
Institute of Electrical Power Systems, Graz University of Technology, Graz, Austria.
Int J Gen Med. 2024 Dec 28;17:6523-6534. doi: 10.2147/IJGM.S493789. eCollection 2024.
Heart failure (HF) is a clinical syndrome in which structural or functional abnormalities of the heart result in impaired ventricular filling or ejection capacity. In order to improve the adaptability of models to different patient populations and data situations. This study aims to develop predictive models for HF risk using six machine learning algorithms, providing valuable insights into the early assessment and recognition of HF by clinical features.
The present study focused on clinical characteristics that significantly differed between groups with left ventricular ejection fractions (LVEF) [≤40% and >40%]. Following the elimination of features with significant missing values, the remaining features were utilized to construct predictive models employing six machine learning algorithms. The optimal model was selected based on various performance metrics, including the area under the curve (AUC), accuracy, precision, recall, and F1 score. Utilizing the optimal model, the significance of clinical features was assessed, and those with importance values exceeding 0.8 were identified as crucial to the study. Finally, a correlation analysis was conducted to examine the relationships between these features and other significant clinical features.
The logistic regression (LR) model was determined to be the optimal machine learning algorithm in this study, achieving an accuracy of 0.64, a precision of 0.45, a recall of 0.72, an F1 score of 0.51, and an AUC of 0.81 in the training set and 0.91 in the testing set. In addition, the analysis of feature importance indicated that blood calcium, angiotensin-converting enzyme inhibitors (ACEI) dosage, mean hemoglobin concentration, and survival duration were critical to the study, each possessing importance values exceeding 0.8. Furthermore, correlation analysis revealed a strong relationship between blood calcium and ionized calcium (|cor|=0.99), as well as a significant association between ACEI dosage (|cor|=0.68) and left ventricular metrics (|cor|=0.58); on the other hand, no correlations were observed between mean hemoglobin levels and other clinical characteristics.
The present study identified LR as the most effective risk prediction model for patients with HF, highlighting blood calcium, ACEI dosage, and mean hemoglobin level as significant predictors. These findings provide significant insights for the clinical prevention and early intervention of HF.
心力衰竭(HF)是一种临床综合征,其中心脏的结构或功能异常导致心室充盈或射血能力受损。为了提高模型对不同患者群体和数据情况的适应性。本研究旨在使用六种机器学习算法开发HF风险预测模型,通过临床特征为HF的早期评估和识别提供有价值的见解。
本研究重点关注左心室射血分数(LVEF)[≤40%和>40%]组之间有显著差异的临床特征。在消除具有大量缺失值的特征后,利用其余特征构建采用六种机器学习算法的预测模型。基于包括曲线下面积(AUC)、准确性、精确性、召回率和F1分数等各种性能指标选择最佳模型。利用最佳模型评估临床特征的重要性,将重要性值超过0.8的特征确定为对该研究至关重要的特征。最后,进行相关性分析以检查这些特征与其他重要临床特征之间的关系。
在本研究中,逻辑回归(LR)模型被确定为最佳机器学习算法,在训练集中准确率为0.64,精确率为0.45,召回率为0.72,F1分数为0.51,AUC为0.81,在测试集中AUC为0.91。此外,特征重要性分析表明血钙、血管紧张素转换酶抑制剂(ACEI)剂量、平均血红蛋白浓度和生存时间对该研究至关重要,每个特征的重要性值都超过0.8。此外,相关性分析显示血钙与离子钙之间存在强相关性(|cor| = 0.99),以及ACEI剂量(|cor| = 0.68)与左心室指标(|cor| = 0.58)之间存在显著关联;另一方面,未观察到平均血红蛋白水平与其他临床特征之间存在相关性。
本研究确定LR是HF患者最有效的风险预测模型,突出了血钙、ACEI剂量和平均血红蛋白水平作为重要预测指标。这些发现为HF的临床预防和早期干预提供了重要见解。