传统逻辑回归与机器学习在预测成年脓毒症患者死亡率方面的比较。

Comparison between traditional logistic regression and machine learning for predicting mortality in adult sepsis patients.

作者信息

Wu Hongsheng, Liao Biling, Ji Tengfei, Ma Keqiang, Luo Yumei, Zhang Shengmin

机构信息

Hepatobiliary Pancreatic Surgery Department, Huadu District People's Hospital of Guangzhou, Guangzhou, China.

出版信息

Front Med (Lausanne). 2025 Jan 6;11:1496869. doi: 10.3389/fmed.2024.1496869. eCollection 2024.

DOI:10.3389/fmed.2024.1496869

PMID:39835102

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11743956/

Abstract

BACKGROUND

Sepsis is a life-threatening disease associated with a high mortality rate, emphasizing the need for the exploration of novel models to predict the prognosis of this patient population. This study compared the performance of traditional logistic regression and machine learning models in predicting adult sepsis mortality.

OBJECTIVE

To develop an optimum model for predicting the mortality of adult sepsis patients based on comparing traditional logistic regression and machine learning methodology.

METHODS

Retrospective analysis was conducted on 606 adult sepsis inpatients at our medical center between January 2020 and December 2022, who were randomly divided into training and validation sets in a 7:3 ratio. Traditional logistic regression and machine learning methods were employed to assess the predictive ability of mortality in adult sepsis. Univariate analysis identified independent risk factors for the logistic regression model, while Least Absolute Shrinkage and Selection Operator (LASSO) regression facilitated variable shrinkage and selection for the machine learning model. Among various machine learning models, which included Bagged Tree, , , , , , and , the one with the maximum area under the curve (AUC) was chosen for model construction. Model validation and comparison with the Sequential Organ Failure Assessment (SOFA) and the Acute Physiology and Chronic Health Evaluation (APACHE) scores were performed using receiver operating characteristic (ROC) curves, calibration curves, and decision curve analysis (DCA) curves in the validation set.

RESULTS

Univariate analysis was employed to assess 17 variables, namely gender, history of coronary heart disease (CHD), systolic pressure, white blood cell (WBC), neutrophil count (NEUT), lymphocyte count (LYMP), lactic acid, neutrophil-to-lymphocyte ratio (NLR), red blood cell distribution width (RDW), interleukin-6 (IL-6), prothrombin time (PT), international normalized ratio (INR), fibrinogen (FBI), D-dimer, aspartate aminotransferase (AST), total bilirubin (Tbil), and lung infection. Significant differences ( < 0.05) between the survival and non-survival groups were observed for these variables. Utilizing stepwise regression with the "backward" method, independent risk factors, including systolic pressure, lactic acid, NLR, RDW, IL-6, PT, and Tbil, were identified. These factors were then incorporated into a logistic regression model, chosen based on the minimum Akaike Information Criterion (AIC) value (98.65). Machine learning techniques were also applied, and the RF model, demonstrating the maximum Area Under the Curve (AUC) of 0.999, was selected. LASSO regression, employing the lambda.1SE criteria, identified systolic pressure, lactic acid, NEUT, RDW, IL6, INR, and Tbil as variables for constructing the RF model, validated through ten-fold cross-validation. For model validation and comparison with traditional logistic models, SOFA, and APACHE scoring.

CONCLUSION

Based on deep machine learning principles, the RF model demonstrates advantages over traditional logistic regression models in predicting adult sepsis prognosis. The RF model holds significant potential for clinical surveillance and interventions to enhance outcomes for sepsis patients.

摘要

背景

脓毒症是一种危及生命的疾病，死亡率很高，这凸显了探索新模型以预测该患者群体预后的必要性。本研究比较了传统逻辑回归和机器学习模型在预测成人脓毒症死亡率方面的表现。

目的

通过比较传统逻辑回归和机器学习方法，开发一种预测成人脓毒症患者死亡率的最佳模型。

方法

对2020年1月至2022年12月期间在我们医疗中心住院的606例成人脓毒症患者进行回顾性分析，这些患者以7:3的比例随机分为训练集和验证集。采用传统逻辑回归和机器学习方法评估成人脓毒症死亡率的预测能力。单因素分析确定逻辑回归模型的独立危险因素，而最小绝对收缩和选择算子（LASSO）回归有助于机器学习模型的变量收缩和选择。在包括袋装树等多种机器学习模型中，选择曲线下面积（AUC）最大的模型进行模型构建。在验证集中使用受试者操作特征（ROC）曲线、校准曲线和决策曲线分析（DCA）曲线对模型进行验证，并与序贯器官衰竭评估（SOFA）和急性生理与慢性健康评估（APACHE）评分进行比较。

结果

采用单因素分析评估17个变量，即性别、冠心病（CHD）病史、收缩压、白细胞（WBC）、中性粒细胞计数（NEUT）、淋巴细胞计数（LYMP）、乳酸、中性粒细胞与淋巴细胞比值（NLR）、红细胞分布宽度（RDW）、白细胞介素-6（IL-6）、凝血酶原时间（PT）、国际标准化比值（INR）、纤维蛋白原（FBI）、D-二聚体、天冬氨酸转氨酶（AST）、总胆红素（Tbil）和肺部感染。这些变量在生存组和非生存组之间存在显著差异（<0.05）。采用“向后”法逐步回归，确定了包括收缩压、乳酸、NLR、RDW、IL-6、PT和Tbil在内的独立危险因素。然后将这些因素纳入基于最小赤池信息准则（AIC）值（98.65）选择的逻辑回归模型。还应用了机器学习技术，选择了曲线下面积（AUC）最大为0.999的随机森林（RF）模型。LASSO回归采用lambda.1SE标准，确定收缩压、乳酸、NEUT、RDW、IL6、INR和Tbil为构建RF模型的变量，并通过十折交叉验证进行验证。用于模型验证并与传统逻辑模型、SOFA和APACHE评分进行比较。

结论

基于深度机器学习原理，随机森林（RF）模型在预测成人脓毒症预后方面优于传统逻辑回归模型。随机森林（RF）模型在临床监测和干预以改善脓毒症患者预后方面具有巨大潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1168/11743956/274467e603d9/fmed-11-1496869-g001.jpg

相似文献

Comparison between traditional logistic regression and machine learning for predicting mortality in adult sepsis patients.传统逻辑回归与机器学习在预测成年脓毒症患者死亡率方面的比较。

Front Med (Lausanne). 2025 Jan 6;11:1496869. doi: 10.3389/fmed.2024.1496869. eCollection 2024.

Establishment of a mortality risk nomogram for predicting in-hospital mortality of sepsis: cohort study from a Chinese single center.建立用于预测脓毒症患者院内死亡率的死亡风险列线图：来自中国单中心的队列研究

Front Med (Lausanne). 2024 May 3;11:1360197. doi: 10.3389/fmed.2024.1360197. eCollection 2024.

[Constructing a predictive model for the death risk of patients with septic shock based on supervised machine learning algorithms].基于监督机器学习算法构建脓毒症休克患者死亡风险预测模型

Zhonghua Wei Zhong Bing Ji Jiu Yi Xue. 2024 Apr;36(4):345-352. doi: 10.3760/cma.j.cn121430-20230930-00832.

[Construction of a predictive model for in-hospital mortality of sepsis patients in intensive care unit based on machine learning].基于机器学习构建重症监护病房脓毒症患者院内死亡率预测模型

Zhonghua Wei Zhong Bing Ji Jiu Yi Xue. 2023 Jul;35(7):696-701. doi: 10.3760/cma.j.cn121430-20221219-01104.

[Establishment and validation of a sepsis 28-day mortality prediction model based on the lactate dehydrogenase-to-albumin ratio in patients with sepsis].[基于乳酸脱氢酶与白蛋白比值的脓毒症患者28天死亡率预测模型的建立与验证]

Zhonghua Wei Zhong Bing Ji Jiu Yi Xue. 2024 Nov;36(11):1140-1146. doi: 10.3760/cma.j.cn121430-20231012-00865.

Development and validation of a prediction model for coronary heart disease risk in depressed patients aged 20 years and older using machine learning algorithms.使用机器学习算法开发并验证针对20岁及以上抑郁症患者冠心病风险的预测模型。

Front Cardiovasc Med. 2025 Jan 9;11:1504957. doi: 10.3389/fcvm.2024.1504957. eCollection 2024.

Application of machine learning model in predicting the likelihood of blood transfusion after hip fracture surgery.机器学习模型在预测髋部骨折手术后输血可能性中的应用。

Aging Clin Exp Res. 2023 Nov;35(11):2643-2656. doi: 10.1007/s40520-023-02550-4. Epub 2023 Sep 21.

Construction and validation of machine learning models for sepsis prediction in patients with acute pancreatitis.构建并验证用于预测急性胰腺炎患者脓毒症的机器学习模型。

BMC Surg. 2023 Sep 1;23(1):267. doi: 10.1186/s12893-023-02151-y.

[Clinical characteristics of elderly patients with sepsis and development and evaluation of death risk assessment scale].[老年脓毒症患者的临床特征及死亡风险评估量表的研制与评价]

Zhonghua Wei Zhong Bing Ji Jiu Yi Xue. 2025 Jan;37(1):17-22. doi: 10.3760/cma.j.cn121430-20240103-00009.

[Development and validation of a prognostic model for patients with sepsis in intensive care unit].[重症监护病房脓毒症患者预后模型的开发与验证]

Zhonghua Wei Zhong Bing Ji Jiu Yi Xue. 2023 Aug;35(8):800-806. doi: 10.3760/cma.j.cn121430-20230103-00003.

引用本文的文献

Red cell distribution width and clinical outcomes in sepsis patients infected with Escherichia coli using data from MIMIC-IV.利用MIMIC-IV数据库数据研究感染大肠杆菌的脓毒症患者的红细胞分布宽度与临床结局

Eur J Med Res. 2025 Jul 5;30(1):580. doi: 10.1186/s40001-025-02756-4.

The Application of Machine Learning Algorithms to Predict HIV Testing Using Evidence from the 2002-2017 South African Adult Population-Based Surveys: An HIV Testing Predictive Model.运用机器学习算法，根据2002 - 2017年南非基于成人人口的调查数据预测HIV检测情况：一种HIV检测预测模型

Trop Med Infect Dis. 2025 Jun 14;10(6):167. doi: 10.3390/tropicalmed10060167.

Developing a Predictive Model for Significant Prostate Cancer Detection in Prostatic Biopsies from Seven Clinical Variables: Is Machine Learning Superior to Logistic Regression?基于七个临床变量构建前列腺活检中显著前列腺癌检测的预测模型：机器学习是否优于逻辑回归？

Cancers (Basel). 2025 Mar 25;17(7):1101. doi: 10.3390/cancers17071101.

本文引用的文献

IL-6 Blockade in Cytokine Storm Syndromes.白细胞介素-6 阻断在细胞因子风暴综合征中的应用。

Adv Exp Med Biol. 2024;1448:565-572. doi: 10.1007/978-3-031-59815-9_37.

Machine learning algorithms in sepsis.脓毒症中的机器学习算法

Clin Chim Acta. 2024 Jan 15;553:117738. doi: 10.1016/j.cca.2023.117738. Epub 2023 Dec 28.

Generalized linear models.广义线性模型

Am J Orthod Dentofacial Orthop. 2023 Oct;164(4):604-606. doi: 10.1016/j.ajodo.2023.07.005.

Targeting IL-6 trans-signalling: past, present and future prospects.靶向 IL-6 转导信号：过去、现在和未来的前景。

Nat Rev Immunol. 2023 Oct;23(10):666-681. doi: 10.1038/s41577-023-00856-y. Epub 2023 Apr 17.

Predictive value of C-reactive protein, procalcitonin, and interleukin-6 on 30-day mortality in patients with bloodstream infections.C 反应蛋白、降钙素原和白细胞介素-6 对血流感染患者 30 天死亡率的预测价值。

Med Clin (Barc). 2023 Jun 23;160(12):540-546. doi: 10.1016/j.medcli.2023.01.022. Epub 2023 Mar 24.

A prediction model for predicting the risk of acute respiratory distress syndrome in sepsis patients: a retrospective cohort study.预测脓毒症患者急性呼吸窘迫综合征风险的预测模型：一项回顾性队列研究。

BMC Pulm Med. 2023 Mar 8;23(1):78. doi: 10.1186/s12890-023-02365-z.

A review on longitudinal data analysis with random forest.随机森林的纵向数据分析综述。

Brief Bioinform. 2023 Mar 19;24(2). doi: 10.1093/bib/bbad002.

Serological indices and ultrasound variables in predicting the staging of hepatitis B liver fibrosis: A comparative study based on random forest algorithm and traditional methods.血清学指标和超声变量在预测乙型肝炎肝纤维化分期中的应用：基于随机森林算法和传统方法的比较研究

J Cancer Res Ther. 2022 Dec;18(7):2049-2057. doi: 10.4103/jcrt.jcrt_1394_22.

Neutrophil, neutrophil extracellular traps and endothelial cell dysfunction in sepsis.脓毒症中的中性粒细胞、中性粒细胞胞外陷阱和内皮细胞功能障碍。

Clin Transl Med. 2023 Jan;13(1):e1170. doi: 10.1002/ctm2.1170.

Clinical Value of Serum Interleukin-18 in Neonatal Sepsis Diagnosis and Mortality Prediction.血清白细胞介素-18在新生儿败血症诊断及死亡率预测中的临床价值

J Inflamm Res. 2022 Dec 30;15:6923-6930. doi: 10.2147/JIR.S393506. eCollection 2022.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

传统逻辑回归与机器学习在预测成年脓毒症患者死亡率方面的比较。

Comparison between traditional logistic regression and machine learning for predicting mortality in adult sepsis patients.

作者信息

机构信息

出版信息

BACKGROUND

OBJECTIVE

METHODS

RESULTS

CONCLUSION

背景

目的

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献