• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于健康体检队列的非酒精性脂肪性肝病机器学习生存模型

Machine learning survival models for Non-alcoholic fatty liver disease based on a health checkup cohort.

作者信息

Zhang Hongyu, Zhang Li, Li Na, Zhang Yongsheng, Zhang Xiaowen, Wang Dawei

机构信息

The First Clinical Medical College, Shandong University of Traditional Chinese Medicine, Jinan, China.

Health Management Center, Institute of Health Management, Shandong Engineering Laboratory of Health Management, the First Affiliated Hospital of Shandong First Medical University & Shandong Provincial Qianfoshan Hospital, Jinan, China.

出版信息

BMC Gastroenterol. 2025 Jul 15;25(1):518. doi: 10.1186/s12876-025-04120-6.

DOI:10.1186/s12876-025-04120-6
PMID:40660095
Abstract

OBJECTIVES

This study aimed to develop an accurate prediction model for the risk of Non-alcoholic fatty liver disease (NAFLD) using the random survival forests (RSF), and to investigate the distribution of NAFLD risk with time.

METHODS

This retrospective cohort study included subjects who had annual health checkups from 1 January 2021 to 31 December 2024. The hold-out strategy, that all the subjects were divided into a training set and a test set, was employed to develop and evaluate our models. Important predictors were then extracted from all the candidate variables using the LASSO regression on the training set. Two prediction models were constructed using the Cox model and the RSF model. Feature importance and their 95% CIs were calculated using the VIMP with bootstrap resampling. The integrated area under the curve (iAUC), the time-dependent area under the curve (tAUC), the integrated Brier score (iBS), and the time-dependent prediction error (PE) were used to evaluate the discrimination and calibration of our models.

RESULTS

A total of 18,250 patients fulfilled the criteria, and 14 predictors were extracted through the LASSO regression for the next model development. The RSF model showed exceptional discrimination (iAUC of 0.856) and calibration (iBS of 0.116) compared to the Cox model (iAUC of 0.759 and iBS of 0.148). Based on the RSF model predictions, subjects were stratified into the high- and low-risk groups with significant differences, with a mean NAFLD-free time of 20.86 and 36.76 months (P <.0001), respectively.

CONCLUSIONS

In this study, the RSF prediction model for the risk of NAFLD was developed, which outperformed the traditional Cox model, achieved remarkable risk stratification for NAFLD, and provided novel insights into the distribution of NAFLD risk with time.

摘要

目的

本研究旨在使用随机生存森林(RSF)开发一种准确的非酒精性脂肪性肝病(NAFLD)风险预测模型,并研究NAFLD风险随时间的分布情况。

方法

这项回顾性队列研究纳入了2021年1月1日至2024年12月31日期间进行年度健康检查的受试者。采用留出法策略,即将所有受试者分为训练集和测试集,用于开发和评估我们的模型。然后在训练集上使用LASSO回归从所有候选变量中提取重要预测因子。使用Cox模型和RSF模型构建了两个预测模型。使用带有自助重采样的VIMP计算特征重要性及其95%置信区间。使用综合曲线下面积(iAUC)、时间依赖性曲线下面积(tAUC)、综合Brier评分(iBS)和时间依赖性预测误差(PE)来评估我们模型的区分度和校准度。

结果

共有18250名患者符合标准,通过LASSO回归提取了14个预测因子用于后续模型开发。与Cox模型(iAUC为0.759,iBS为0.148)相比,RSF模型表现出卓越的区分度(iAUC为0.856)和校准度(iBS为0.116)。基于RSF模型的预测,受试者被分为高风险组和低风险组,差异显著,无NAFLD的平均时间分别为20.86个月和36.76个月(P <.0001)。

结论

在本研究中,开发了NAFLD风险的RSF预测模型该模型优于传统的Cox模型,实现了显著的NAFLD风险分层,并为NAFLD风险随时间的分布提供了新的见解。

相似文献

1
Machine learning survival models for Non-alcoholic fatty liver disease based on a health checkup cohort.基于健康体检队列的非酒精性脂肪性肝病机器学习生存模型
BMC Gastroenterol. 2025 Jul 15;25(1):518. doi: 10.1186/s12876-025-04120-6.
2
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
3
Are Current Survival Prediction Tools Useful When Treating Subsequent Skeletal-related Events From Bone Metastases?当前的生存预测工具在治疗骨转移后的骨骼相关事件时有用吗?
Clin Orthop Relat Res. 2024 Sep 1;482(9):1710-1721. doi: 10.1097/CORR.0000000000003030. Epub 2024 Mar 22.
4
Development and Spatial External Validation of a Predictive Model of Survival Based on Random Survival Forest Analysis for People Living With HIV and AIDS After Highly Active Antiretroviral Therapy in China: Retrospective Cohort Study.基于随机生存森林分析的中国接受高效抗逆转录病毒治疗的艾滋病毒/艾滋病患者生存预测模型的开发与空间外部验证:回顾性队列研究
J Med Internet Res. 2025 Jun 2;27:e71257. doi: 10.2196/71257.
5
Does the Presence of Missing Data Affect the Performance of the SORG Machine-learning Algorithm for Patients With Spinal Metastasis? Development of an Internet Application Algorithm.缺失数据的存在是否会影响 SORG 机器学习算法在脊柱转移瘤患者中的性能?开发一种互联网应用算法。
Clin Orthop Relat Res. 2024 Jan 1;482(1):143-157. doi: 10.1097/CORR.0000000000002706. Epub 2023 Jun 12.
6
Machine learning models integrating dietary data predict all-cause mortality in U.S. NAFLD patients: an NHANES-based study.整合饮食数据的机器学习模型可预测美国非酒精性脂肪性肝病患者的全因死亡率:一项基于美国国家健康与营养检查调查的研究
Nutr J. 2025 Jul 1;24(1):100. doi: 10.1186/s12937-025-01170-0.
7
Interpretable machine learning analysis of immunoinflammatory biomarkers for predicting CHD among NAFLD patients.用于预测非酒精性脂肪性肝病患者冠心病的免疫炎症生物标志物的可解释机器学习分析
Cardiovasc Diabetol. 2025 Jul 3;24(1):263. doi: 10.1186/s12933-025-02818-1.
8
Machine Learning Did Not Outperform Conventional Competing Risk Modeling to Predict Revision Arthroplasty.在预测翻修关节成形术方面,机器学习的表现并未优于传统的竞争风险模型。
Clin Orthop Relat Res. 2024 Aug 1;482(8):1472-1482. doi: 10.1097/CORR.0000000000003018. Epub 2024 Mar 12.
9
Comparison of Random Survival Forest Based-Overall Survival With Deep Learning and Cox Proportional Hazard Models in HER-2-Positive HR-Negative Breast Cancer.基于随机生存森林的总生存期与深度学习及Cox比例风险模型在HER-2阳性HR阴性乳腺癌中的比较
Cancer Rep (Hoboken). 2025 Jul;8(7):e70262. doi: 10.1002/cnr2.70262.
10
Competing risk and random survival forest models for predicting survival in post-resection elderly stage I-III colorectal cancer patients.用于预测I-III期老年结直肠癌患者术后生存情况的竞争风险和随机生存森林模型
Sci Rep. 2025 Jul 7;15(1):24269. doi: 10.1038/s41598-025-05824-1.

本文引用的文献

1
NON-ALCOHOLIC FATTY LIVER DISEASE AND RELATIONSHIP WITH ADIPOSITY IN NIGERIAN PATIENTS WITH TYPE 2 DIABETES MELLITUS: THE IBADAN EXPERIENCE.尼日利亚2型糖尿病患者的非酒精性脂肪性肝病及其与肥胖的关系:伊巴丹经验
Ann Ib Postgrad Med. 2024 Aug 30;22(2):74-80.
2
Baseline and change in serum lipid and uric acid level over time and incident of nonalcoholic fatty liver disease (NAFLD) in Chinese adults.中国成年人血清脂质和尿酸水平随时间的变化及其与非酒精性脂肪肝(NAFLD)发病的关系。
Sci Rep. 2024 Aug 9;14(1):18547. doi: 10.1038/s41598-024-69411-6.
3
survex: an R package for explaining machine learning survival models.
survex:一个用于解释机器学习生存模型的 R 包。
Bioinformatics. 2023 Dec 1;39(12). doi: 10.1093/bioinformatics/btad723.
4
The Future Is Beyond Bright: The Evolving Role of Quantitative US for Fatty Liver Disease.未来一片光明:定量超声在脂肪性肝病中的作用不断演变。
Radiology. 2023 Nov;309(2):e223146. doi: 10.1148/radiol.223146.
5
Multi-system diseases and death trajectory of metabolic dysfunction-associated fatty liver disease: findings from the UK Biobank.代谢相关脂肪性肝病相关多系统疾病和死亡轨迹:来自英国生物库的研究结果。
BMC Med. 2023 Oct 20;21(1):398. doi: 10.1186/s12916-023-03080-6.
6
A comparison of random survival forest and Cox regression for prediction of mortality in patients with hemorrhagic stroke.随机生存森林与 Cox 回归在预测出血性脑卒中患者死亡率中的比较。
BMC Med Inform Decis Mak. 2023 Oct 13;23(1):215. doi: 10.1186/s12911-023-02293-2.
7
The burden of nonalcoholic fatty liver disease (NAFLD) is rapidly growing in every region of the world from 1990 to 2019.从 1990 年到 2019 年,全球各地区非酒精性脂肪性肝病(NAFLD)的负担正在迅速增加。
Hepatol Commun. 2023 Oct 2;7(10). doi: 10.1097/HC9.0000000000000251. eCollection 2023 Oct 1.
8
Predicting the 5-Year Risk of Nonalcoholic Fatty Liver Disease Using Machine Learning Models: Prospective Cohort Study.利用机器学习模型预测非酒精性脂肪性肝病的 5 年风险:前瞻性队列研究。
J Med Internet Res. 2023 Sep 12;25:e46891. doi: 10.2196/46891.
9
Machine Learning-Based Prognostic Model for Patients After Lung Transplantation.基于机器学习的肺移植术后患者预后模型。
JAMA Netw Open. 2023 May 1;6(5):e2312022. doi: 10.1001/jamanetworkopen.2023.12022.
10
Machine learning versus regression for prediction of sporadic pancreatic cancer.机器学习与回归在预测散发性胰腺癌中的比较。
Pancreatology. 2023 Jun;23(4):396-402. doi: 10.1016/j.pan.2023.04.009. Epub 2023 Apr 27.