Chen Jianhui, Huang Minghuan, Xu Rongbin, Lin Yanya, Chen Shijun, Xie Ying, Hu Jianxiong
Department of Critical Care Medicine, Affiliated Hospital of Putian University Putian 351100, Fujian, China.
Department of Nephrology, Affiliated Hospital of Putian University Putian 351100, Fujian, China.
Am J Transl Res. 2025 Apr 15;17(4):2678-2689. doi: 10.62347/MMNQ1017. eCollection 2025.
To explore the correlation between Blood Routine Indicators (BRI) and sepsis using machine learning algorithms (MLAs) and evaluate their application in early sepsis for prognosis assessment.
A total of 4,558 blood routine data (BRD) samples were collected, including 149 sepsis patients and 186 patients with common infections (CI). A binary logistic regression model (BLRM) was constructed to predict sepsis based on BRI. Additionally, MLAs were applied, including support vector machines, neural networks, Bayesian classifiers, k-nearest neighbors), decision trees, and random forest classification models (RFCM). The performance of these seven predictive models was evaluated.
The RFCM demonstrated the best predictive performance among the MLAs, with accuracy of 86.97%, precision of 87.02%, recall of 86.97%, and F1 score of 0.87. These metrics were significantly higher than those of the BLRM (accuracy: 68.77%, precision PRE: 71.45%, recall: 69.47%, F1 Score: 0.70). In the random forest model, red blood cell volume distribution width (RDW) was identified as the most significant feature, with RDW-coefficient of variation contributing 6.98% and RDW-standard deviation contributing 5.32%.
Combining blood routine indicators (BRI) with MLA has considerable potential in predicting sepsis. The RFCM showed the highest predictive value, and RDW may play a crucial role in sepsis prediction.
运用机器学习算法(MLA)探索血常规指标(BRI)与脓毒症之间的相关性,并评估其在早期脓毒症预后评估中的应用。
共收集4558份血常规数据(BRD)样本,其中包括149例脓毒症患者和186例普通感染(CI)患者。构建二元逻辑回归模型(BLRM)以基于BRI预测脓毒症。此外,还应用了MLA,包括支持向量机、神经网络、贝叶斯分类器、k近邻、决策树和随机森林分类模型(RFCM)。评估这七种预测模型的性能。
RFCM在MLA中表现出最佳的预测性能,准确率为86.97%,精确率为87.02%,召回率为86.97%,F1分数为0.87。这些指标显著高于BLRM(准确率:68.77%,精确率PRE:71.45%,召回率:69.47%,F1分数:0.70)。在随机森林模型中,红细胞体积分布宽度(RDW)被确定为最显著的特征,RDW变异系数贡献了6.98%,RDW标准差贡献了5.32%。
将血常规指标(BRI)与MLA相结合在预测脓毒症方面具有相当大的潜力。RFCM显示出最高的预测价值,并且RDW可能在脓毒症预测中起关键作用。