使用集成机器学习对临床风险预测算法进行基准测试：非酒精性脂肪性肝病中肝纤维化无创诊断的超学习器算法示例

Benchmarking clinical risk prediction algorithms with ensemble machine learning: An illustration of the superlearner algorithm for the non-invasive diagnosis of liver fibrosis in non-alcoholic fatty liver disease.

作者信息

Charu Vivek, Liang Jane W, Mannalithara Ajitha, Kwong Allison, Tian Lu, Kim W Ray

出版信息

medRxiv. 2023 Aug 4:2023.08.02.23293569. doi: 10.1101/2023.08.02.23293569.

DOI:10.1101/2023.08.02.23293569

PMID:37577485

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10418571/

Abstract

BACKGROUND AND AIMS

Ensemble machine learning (ML) methods can combine many individual models into a single 'super' model using an optimal weighted combination. Here we demonstrate how an underutilized ensemble model, the superlearner, can be used as a benchmark for model performance in clinical risk prediction. We illustrate this by implementing a superlearner to predict liver fibrosis in patients with non-alcoholic fatty liver disease (NAFLD).

METHODS

We trained a superlearner based on 23 demographic and clinical variables, with the goal of predicting stage 2 or higher liver fibrosis. The superlearner was trained on data from the Non-alcoholic steatohepatitis - clinical research network observational study (NASH-CRN, n=648), and validated using data from participants in a randomized trial for NASH ('FLINT' trial, n=270) and data from examinees with NAFLD who participated in the National Health and Nutrition Examination Survey (NHANES, n=1244). We compared the performance of the superlearner with existing models, including FIB-4, NFS, Forns, APRI, BARD and SAFE.

RESULTS

In the FLINT and NHANES validation sets, the superlearner (derived from 12 base models) discriminates patients with significant fibrosis from those without well, with AUCs of 0.79 (95% CI: 0.73-0.84) and 0.74 (95% CI: 0.68-0.79). Among the existing scores considered, the SAFE score performed similarly to the superlearner, and the superlearner and SAFE scores outperformed FIB-4, APRI, Forns, and BARD scores in the validation datasets. A superlearner model derived from 12 base models performed as well as one derived from 90 base models.

CONCLUSIONS

The superlearner, thought of as the "best-in-class" ML prediction, performed better than most existing models commonly used in practice in detecting fibrotic NASH. The superlearner can be used to benchmark the performance of conventional clinical risk prediction models.

摘要

背景与目的

集成机器学习（ML）方法可以使用最优加权组合将多个个体模型组合成一个单一的“超级”模型。在此，我们展示了一种未得到充分利用的集成模型——超级学习器，如何能够用作临床风险预测中模型性能的基准。我们通过实施一个超级学习器来预测非酒精性脂肪性肝病（NAFLD）患者的肝纤维化，对此进行说明。

方法

我们基于23个人口统计学和临床变量训练了一个超级学习器，目标是预测2期或更高阶段的肝纤维化。该超级学习器在非酒精性脂肪性肝炎临床研究网络观察性研究（NASH-CRN，n = 648）的数据上进行训练，并使用来自一项NASH随机试验参与者的数据（“FLINT”试验，n = 270）以及参与国家健康与营养检查调查（NHANES，n = 1244）的NAFLD受检者的数据进行验证。我们将超级学习器的性能与现有模型进行了比较，包括FIB-4、NFS、Forns、APRI、BARD和SAFE。

结果

在FLINT和NHANES验证集中，超级学习器（源自12个基础模型）能够很好地区分有显著纤维化的患者和无纤维化的患者，曲线下面积（AUC）分别为0.79（95%置信区间：0.73 - 0.84）和0.74（95%置信区间：0.68 - 0.79）。在所考虑的现有评分中，SAFE评分的表现与超级学习器相似，并且在验证数据集中，超级学习器和SAFE评分优于FIB-4、APRI、Forns和BARD评分。源自12个基础模型的超级学习器模型与源自90个基础模型的模型表现相当。

结论

被视为“同类最佳”ML预测的超级学习器，在检测纤维化NASH方面的表现优于实践中常用的大多数现有模型。超级学习器可用于衡量传统临床风险预测模型的性能。

相似文献

Benchmarking clinical risk prediction algorithms with ensemble machine learning: An illustration of the superlearner algorithm for the non-invasive diagnosis of liver fibrosis in non-alcoholic fatty liver disease.使用集成机器学习对临床风险预测算法进行基准测试：非酒精性脂肪性肝病中肝纤维化无创诊断的超学习器算法示例

medRxiv. 2023 Aug 4:2023.08.02.23293569. doi: 10.1101/2023.08.02.23293569.

Benchmarking clinical risk prediction algorithms with ensemble machine learning for the noninvasive diagnosis of liver fibrosis in NAFLD.利用集成机器学习对非酒精性脂肪性肝病（NAFLD）肝纤维化进行无创诊断的临床风险预测算法的基准测试

Hepatology. 2024 Nov 1;80(5):1184-1195. doi: 10.1097/HEP.0000000000000908. Epub 2024 Apr 30.

Long-term outcomes and predictive ability of non-invasive scoring systems in patients with non-alcoholic fatty liver disease.非酒精性脂肪性肝病患者无创评分系统的长期结局及预测能力

J Hepatol. 2021 Oct;75(4):786-794. doi: 10.1016/j.jhep.2021.05.008. Epub 2021 Jun 4.

Modified AST to platelet ratio index improves APRI and better predicts advanced fibrosis and liver cirrhosis in patients with non-alcoholic fatty liver disease.改良 AST 与血小板比值指数（m APRI）可改善 APRI，更好地预测非酒精性脂肪性肝病患者的肝纤维化及肝硬化。

Clin Res Hepatol Gastroenterol. 2021 Jul;45(4):101528. doi: 10.1016/j.clinre.2020.08.006. Epub 2020 Nov 29.

Development and validation of an ensemble machine learning framework for detection of all-cause advanced hepatic fibrosis: a retrospective cohort study.用于检测全因性晚期肝纤维化的集成机器学习框架的开发与验证：一项回顾性队列研究

Lancet Digit Health. 2022 Mar;4(3):e188-e199. doi: 10.1016/S2589-7500(21)00270-3.

Noninvasive fibrosis tools in NAFLD: validation of APRI, BARD, FIB-4, NAFLD fibrosis score, and Hepamet fibrosis score in a Portuguese population.非酒精性脂肪性肝病无创性纤维化工具：APRI、BARD、FIB-4、NAFLD 纤维化评分和 Hepamet 纤维化评分在葡萄牙人群中的验证。

Postgrad Med. 2022 May;134(4):435-440. doi: 10.1080/00325481.2022.2058285. Epub 2022 Mar 30.

Validation of conventional non-invasive fibrosis scoring systems in patients with metabolic associated fatty liver disease.代谢相关脂肪性肝病患者中传统非侵入性纤维化评分系统的验证。

World J Gastroenterol. 2021 Sep 14;27(34):5753-5763. doi: 10.3748/wjg.v27.i34.5753.

FibroGENE: A gene-based model for staging liver fibrosis.FibroGENE：一种基于基因的肝纤维化分期模型。

J Hepatol. 2016 Feb;64(2):390-398. doi: 10.1016/j.jhep.2015.11.008. Epub 2015 Dec 1.

Machine learning improves the prediction of significant fibrosis in Asian patients with metabolic dysfunction-associated steatotic liver disease - The Gut and Obesity in Asia (GO-ASIA) Study.机器学习提高了代谢功能障碍相关脂肪性肝病的亚洲患者发生显著纤维化的预测能力 - 亚洲肠道和肥胖研究（GO-ASIA）。

Aliment Pharmacol Ther. 2024 Mar;59(6):774-788. doi: 10.1111/apt.17891. Epub 2024 Feb 1.

Relationship between three commonly used non-invasive fibrosis biomarkers and improvement in fibrosis stage in patients with non-alcoholic steatohepatitis.三种常用非侵入性纤维化生物标志物与非酒精性脂肪性肝炎患者纤维化分期改善的关系。

Liver Int. 2019 May;39(5):924-932. doi: 10.1111/liv.13974. Epub 2019 Feb 21.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。