Pu Jianchen, Yao Yimin, Wang Xiaochun
Medical Laboratory, Nanxun Hospital of Traditional Chinese Medicine, Huzhou, China.
Medical Laboratory, The First Affiliated Hospital of Zhejiang Chinese Medical University (Zhejiang Provincial Hospital of Chinese Medicine), Hangzhou, China.
Front Cardiovasc Med. 2025 Mar 17;12:1539966. doi: 10.3389/fcvm.2025.1539966. eCollection 2025.
Heart failure (HF), a core component of cardiovascular diseases, is characterized by high morbidity and mortality worldwide. By collecting and analyzing routine blood data, machine learning models were built to identify the patterns of changes in blood indicators related to HF.
We conducted a statistical analysis of routine blood data from 226 patients who visited Zhejiang Provincial Hospital of Traditional Chinese Medicine (Hubin) between May 1, 2024, and June 30, 2024. The patients were divided into an experimental group (HF patients) and a normal control group. Additionally, 211 patients from the Qiantang and Xixi centers formed an independent external validation cohort. This study used both univariate and multivariate analyses to identify the risk factors associated with HF. Variables associated with HF were selected using LASSO regression analysis. In addition, eight different machine learning algorithms were applied for prediction, and the prediction performances of these algorithms were comprehensively evaluated using the receiver operating characteristic curve, area under the curve (AUC), calibration curve analysis, and decision curve analysis and confusion matrix.
Using LASSO regression analysis, leukocyte, neutrophil, red blood cell, hemoglobin, platelet, and monocyte-to-lymphocyte ratios were identified as risk factors for HF. Among the evaluated models, the random forest model exhibited the best performance. In the validation cohort, the area under the curve (AUC) of the model was 0.948, while that of the test cohort was 1.000. The calibration curve revealed good agreement between the actual and predicted probabilities, whereas the decision curve showed the significant clinical application of the model. Additionally, the AUC of the model in the external independent test cohort was 0.945.
We used an online predictive tool to develop a predictive machine-learning model. The main purpose of this model was to predict the probability of developing HF in the future. This prediction can provide strong support and references for clinicians when making decisions. This online forecasting tool not only processes a large amount of data but also continuously optimizes and adjusts the accuracy of the model according to the latest medical research and clinical data. We hope to identify high-risk patients for early intervention to reduce the incidence of HF and improve their quality of life.
心力衰竭(HF)是心血管疾病的核心组成部分,在全球范围内具有高发病率和高死亡率的特点。通过收集和分析常规血液数据,构建机器学习模型以识别与HF相关的血液指标变化模式。
我们对2024年5月1日至2024年6月30日期间就诊于浙江省中医院(湖滨院区)的226例患者的常规血液数据进行了统计分析。将患者分为实验组(HF患者)和正常对照组。此外,来自钱塘和西溪中心的211例患者组成独立的外部验证队列。本研究采用单因素和多因素分析来确定与HF相关的危险因素。使用LASSO回归分析选择与HF相关的变量。此外,应用八种不同的机器学习算法进行预测,并使用受试者工作特征曲线、曲线下面积(AUC)、校准曲线分析、决策曲线分析和混淆矩阵对这些算法的预测性能进行综合评估。
通过LASSO回归分析,确定白细胞、中性粒细胞、红细胞、血红蛋白、血小板和单核细胞与淋巴细胞比值为HF的危险因素。在评估的模型中,随机森林模型表现最佳。在验证队列中,该模型的曲线下面积(AUC)为0.948,而在测试队列中为1.000。校准曲线显示实际概率与预测概率之间具有良好的一致性,而决策曲线显示该模型具有显著的临床应用价值。此外,该模型在外部独立测试队列中的AUC为0.945。
我们使用在线预测工具开发了一个预测性机器学习模型。该模型的主要目的是预测未来发生HF的概率。这一预测可为临床医生决策提供有力支持和参考。这种在线预测工具不仅能处理大量数据,还能根据最新的医学研究和临床数据不断优化和调整模型的准确性。我们希望识别出高危患者以便进行早期干预,从而降低HF的发病率并提高患者的生活质量。