随机森林算法在预测重症急性胰腺炎中的应用。

Usefulness of Random Forest Algorithm in Predicting Severe Acute Pancreatitis.

机构信息

Department of Gastroenterology and Hepatology, The First Affiliated Hospital of Wenzhou Medical University, Wenzhou, China.

School of the First Clinical Medical Sciences, Wenzhou Medical University, Wenzhou, China.

出版信息

Front Cell Infect Microbiol. 2022 Jun 10;12:893294. doi: 10.3389/fcimb.2022.893294. eCollection 2022.

DOI:10.3389/fcimb.2022.893294

PMID:35755843

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9226542/

Abstract

BACKGROUND AND AIMS

This study aimed to develop an interpretable random forest model for predicting severe acute pancreatitis (SAP).

METHODS

Clinical and laboratory data of 648 patients with acute pancreatitis were retrospectively reviewed and randomly assigned to the training set and test set in a 3:1 ratio. Univariate analysis was used to select candidate predictors for the SAP. Random forest (RF) and logistic regression (LR) models were developed on the training sample. The prediction models were then applied to the test sample. The performance of the risk models was measured by calculating the area under the receiver operating characteristic (ROC) curves (AUC) and area under precision recall curve. We provide visualized interpretation by using local interpretable model-agnostic explanations (LIME).

RESULTS

The LR model was developed to predict SAP as the following function: -1.10-0.13×albumin (g/L) + 0.016 × serum creatinine (μmol/L) + 0.14 × glucose (mmol/L) + 1.63 × pleural effusion (0/1)(No/Yes). The coefficients of this formula were utilized to build a nomogram. The RF model consists of 16 variables identified by univariate analysis. It was developed and validated by a tenfold cross-validation on the training sample. Variables importance analysis suggested that blood urea nitrogen, serum creatinine, albumin, high-density lipoprotein cholesterol, low-density lipoprotein cholesterol, calcium, and glucose were the most important seven predictors of SAP. The AUCs of RF model in tenfold cross-validation of the training set and the test set was 0.89 and 0.96, respectively. Both the area under precision recall curve and the diagnostic accuracy of the RF model were higher than that of both the LR model and the BISAP score. LIME plots were used to explain individualized prediction of the RF model.

CONCLUSIONS

An interpretable RF model exhibited the highest discriminatory performance in predicting SAP. Interpretation with LIME plots could be useful for individualized prediction in a clinical setting. A nomogram consisting of albumin, serum creatinine, glucose, and pleural effusion was useful for prediction of SAP.

摘要

背景与目的

本研究旨在开发一种用于预测重症急性胰腺炎（SAP）的可解释随机森林模型。

方法

回顾性分析了 648 例急性胰腺炎患者的临床和实验室数据，并按 3:1 的比例随机分配到训练集和测试集中。采用单因素分析筛选 SAP 的候选预测因子。在训练样本上建立随机森林（RF）和逻辑回归（LR）模型。然后将预测模型应用于测试样本。通过计算受试者工作特征（ROC）曲线下面积（AUC）和精度召回曲线下面积来衡量风险模型的性能。我们通过使用局部可解释模型不可知解释（LIME）提供可视化解释。

结果

建立了预测 SAP 的 LR 模型，其函数如下：-1.10-0.13×白蛋白（g/L）+0.016×血清肌酐（μmol/L）+0.14×血糖（mmol/L）+1.63×胸腔积液（0/1）（无/有）。该公式的系数用于构建列线图。RF 模型由单因素分析确定的 16 个变量组成。它是在训练样本上通过十折交叉验证开发和验证的。变量重要性分析表明，血尿素氮、血清肌酐、白蛋白、高密度脂蛋白胆固醇、低密度脂蛋白胆固醇、钙和血糖是 SAP 最重要的七个预测因子。在训练集和测试集的十折交叉验证中，RF 模型的 AUC 分别为 0.89 和 0.96。RF 模型的精度召回曲线下面积和诊断准确性均高于 LR 模型和 BISAP 评分。使用 LIME 图解释 RF 模型的个体化预测。

结论

可解释的 RF 模型在预测 SAP 方面表现出最高的判别性能。使用 LIME 图进行解释对于临床环境中的个体化预测可能是有用的。由白蛋白、血清肌酐、血糖和胸腔积液组成的列线图可用于预测 SAP。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10ee/9226542/14a4caf5d1d3/fcimb-12-893294-g001.jpg

相似文献

Usefulness of Random Forest Algorithm in Predicting Severe Acute Pancreatitis.

Front Cell Infect Microbiol. 2022 Jun 10;12:893294. doi: 10.3389/fcimb.2022.893294. eCollection 2022.

A Comparison of XGBoost, Random Forest, and Nomograph for the Prediction of Disease Severity in Patients With COVID-19 Pneumonia: Implications of Cytokine and Immune Cell Profile.

Front Cell Infect Microbiol. 2022 Apr 12;12:819267. doi: 10.3389/fcimb.2022.819267. eCollection 2022.

[Application of machine learning model based on XGBoost algorithm in early prediction of patients with acute severe pancreatitis].

Zhonghua Wei Zhong Bing Ji Jiu Yi Xue. 2023 Apr;35(4):421-426. doi: 10.3760/cma.j.cn121430-20221019-00930.

Automated Machine Learning for the Early Prediction of the Severity of Acute Pancreatitis in Hospitals.

Front Cell Infect Microbiol. 2022 Jun 10;12:886935. doi: 10.3389/fcimb.2022.886935. eCollection 2022.

Automated machine learning for early prediction of acute kidney injury in acute pancreatitis.

BMC Med Inform Decis Mak. 2024 Jan 11;24(1):16. doi: 10.1186/s12911-024-02414-5.

Development and validation of a risk prediction score for the severity of acute hypertriglyceridemic pancreatitis in Chinese patients.

World J Gastroenterol. 2022 Sep 7;28(33):4846-4860. doi: 10.3748/wjg.v28.i33.4846.

Prediction of severe acute pancreatitis using classification and regression tree analysis.

Dig Dis Sci. 2011 Dec;56(12):3664-71. doi: 10.1007/s10620-011-1849-x. Epub 2011 Aug 11.

Development and evaluation of machine learning models and nomogram for the prediction of severe acute pancreatitis.

J Gastroenterol Hepatol. 2023 Mar;38(3):468-475. doi: 10.1111/jgh.16125. Epub 2023 Jan 27.

Development and validation of a risk prediction score for severe acute pancreatitis.

J Transl Med. 2019 May 8;17(1):146. doi: 10.1186/s12967-019-1903-6.

Establishment and validation of early prediction model for hypertriglyceridemic severe acute pancreatitis.

Lipids Health Dis. 2023 Dec 8;22(1):218. doi: 10.1186/s12944-023-01984-z.

引用本文的文献

Predicting intraoperative blood loss risk in severe lumbar disc herniation patients undergoing PLIF: a multicenter cohort study using ensemble learning.

Int J Surg. 2025 Sep 1;111(9):5904-5913. doi: 10.1097/JS9.0000000000002730. Epub 2025 Jun 19.

Constructing a prediction model for acute pancreatitis severity based on liquid neural network.

Sci Rep. 2025 May 13;15(1):16655. doi: 10.1038/s41598-025-01218-5.

Investigating Protective and Risk Factors and Predictive Insights for Aboriginal Perinatal Mental Health: Explainable Artificial Intelligence Approach.

J Med Internet Res. 2025 Apr 30;27:e68030. doi: 10.2196/68030.

The Prognostic Value of Red Blood Cell Distribution Width-to-Albumin Ratio (RAR) in Predicting Mortality and Severity in Acute Pancreatitis: A Systematic Review and Meta-Analysis.

Cureus. 2025 Mar 27;17(3):e81279. doi: 10.7759/cureus.81279. eCollection 2025 Mar.

Predictive factors at emergency department admission for a complicated course of acute pancreatitis.

Ulus Travma Acil Cerrahi Derg. 2025 Apr;31(4):341-349. doi: 10.14744/tjtes.2025.05070.

Development and internal validation of an interpretable risk prediction model for diabetic peripheral neuropathy in type 2 diabetes: a single-centre retrospective cohort study in China.

BMJ Open. 2025 Apr 3;15(4):e092463. doi: 10.1136/bmjopen-2024-092463.

A systematic review of machine learning-based prognostic models for acute pancreatitis: Towards improving methods and reporting quality.

PLoS Med. 2025 Feb 24;22(2):e1004432. doi: 10.1371/journal.pmed.1004432. eCollection 2025 Feb.

Enhancing clinical decision-making in closed pelvic fractures with machine learning models.

Biomol Biomed. 2025 May 8;25(7):1491-1507. doi: 10.17305/bb.2024.10802.

Prediction of acute myeloid leukemia prognosis based on autophagy features and characterization of its immune microenvironment.

Front Immunol. 2024 Nov 22;15:1489171. doi: 10.3389/fimmu.2024.1489171. eCollection 2024.

A Random Forest Algorithm for Assessing Risk Factors Associated With Chronic Kidney Disease: Observational Study.

Asian Pac Isl Nurs J. 2024 Jun 3;8:e48378. doi: 10.2196/48378.

本文引用的文献

Explainable machine learning to predict long-term mortality in critically ill ventilated patients: a retrospective study in central Taiwan.

BMC Med Inform Decis Mak. 2022 Mar 25;22(1):75. doi: 10.1186/s12911-022-01817-6.

Hypoalbuminemia affects one third of acute pancreatitis patients and is independently associated with severity and mortality.

Sci Rep. 2021 Dec 17;11(1):24158. doi: 10.1038/s41598-021-03449-8.

Machine learning predictive models for acute pancreatitis: A systematic review.

Int J Med Inform. 2022 Jan;157:104641. doi: 10.1016/j.ijmedinf.2021.104641. Epub 2021 Nov 10.

Pleural effusion volume in patients with acute pancreatitis: a retrospective study from three acute pancreatitis centers.

Ann Med. 2021 Dec;53(1):2003-2018. doi: 10.1080/07853890.2021.1998594.

Early prediction of severe acute pancreatitis using machine learning.

Pancreatology. 2022 Jan;22(1):43-50. doi: 10.1016/j.pan.2021.10.003. Epub 2021 Oct 16.

Serum Albumin in Health and Disease: Esterase, Antioxidant, Transporting and Signaling Properties.

Int J Mol Sci. 2021 Sep 25;22(19):10318. doi: 10.3390/ijms221910318.

Opening the Black Box: The Promise and Limitations of Explainable Machine Learning in Cardiology.

Can J Cardiol. 2022 Feb;38(2):204-213. doi: 10.1016/j.cjca.2021.09.004. Epub 2021 Sep 14.

Prognostic Value of Glucose-to-Lymphocyte Ratio in Critically Ill Patients with Acute Pancreatitis.

Int J Gen Med. 2021 Sep 8;14:5449-5460. doi: 10.2147/IJGM.S327123. eCollection 2021.

Critically Ill . Non-Critically Ill Patients With COVID-19 Pneumonia: Clinical Features, Laboratory Findings, and Prediction.

Front Cell Infect Microbiol. 2021 Jul 13;11:550456. doi: 10.3389/fcimb.2021.550456. eCollection 2021.

Blood glucose-related indicators are associated with in-hospital mortality in critically ill patients with acute pancreatitis.

Sci Rep. 2021 Jul 28;11(1):15351. doi: 10.1038/s41598-021-94697-1.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

随机森林算法在预测重症急性胰腺炎中的应用。

Usefulness of Random Forest Algorithm in Predicting Severe Acute Pancreatitis.

机构信息

Department of Gastroenterology and Hepatology, The First Affiliated Hospital of Wenzhou Medical University, Wenzhou, China.

School of the First Clinical Medical Sciences, Wenzhou Medical University, Wenzhou, China.