• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

美国国家健康与营养检查调查(NHANES)研究中基于深度学习和集成学习的生物学年龄的开发与验证

Development and validation of deep learning- and ensemble learning-based biological ages in the NHANES study.

作者信息

Huang Yushu, Yang Xifan, Wang Qi, Abula Adila, Dong Yue, Li Wenyuan

机构信息

Center of Clinical Big Data and Analytics of The Second Affiliated Hospital, Department of Big Data in Health Science School of Public Health, Zhejiang University School of Medicine, Hangzhou, Zhejiang, China.

Zhejiang Provincial Key Laboratory of Intelligent Preventive Medicine, Hangzhou, Zhejiang, China.

出版信息

Front Aging Neurosci. 2025 Jul 16;17:1532884. doi: 10.3389/fnagi.2025.1532884. eCollection 2025.

DOI:10.3389/fnagi.2025.1532884
PMID:40741049
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12307447/
Abstract

INTRODUCTION

Conventional machine learning (ML) approaches for constructing biological age (BA) have predominantly relied on blood-based markers, limiting their scope. This study aims to develop and validate novel ML-based BA models using a comprehensive set of clinical, behavioral, and socioeconomic factors and evaluate their predictive performance for mortality.

METHODS

We analyzed data from 24,985 participants in the National Health and Nutrition Examination Survey (NHANES) from 1999 to 2010, with follow-up extending to 31 December 2019, or until death or loss to follow-up. Thirty features, including blood and urine biochemistry, physical examination data, behavioral traits, and socioeconomic factors, were selected using the Least Absolute Shrinkage and Selection Operator (LASSO). These features were utilized to train deep neural networks (DNN) and ensemble learning models, specifically the Deep Biological Age (DBA) and Ensemble Biological Age (EnBA), with chronological age (CA) as the reference label. Model performance was assessed using mean absolute error (MAE), while interpretability was explored using Shapley Additive exPlanation (SHAP). Predictive accuracy of DBA and EnBA for mortality was compared with Phenotypic Age (PhenoAge) using the area under the curve (AUC) derived from Cox proportional hazards models and hazard ratios (HR), adjusted for demographics and lifestyle factors. Sensitivity analyses were performed to ensure robustness.

RESULTS

DBA and EnBA accurately predicted actual age (MAE = 2.98 and 3.58 years, respectively) and demonstrated strong predictive capability for all-cause mortality, with AUCs of 0.896 (95% CI: 0.891-0.898) for DBA and 0.889 (95% CI: 0.884-0.894) for EnBA. Higher DBA and EnBA accelerations were significantly associated with increased mortality risk (HR = 1.059 and 1.039, respectively). SHAP analysis highlighted prescription medication usage, hepatitis B surface antibody status, and vigorous physical activity as the most influential features contributing to DBA predictions. Furthermore, BA acceleration was linked to elevated risk of death from specific chronic conditions, including cardiovascular and cerebrovascular diseases and cancer.

DISCUSSION

Our study successfully developed and validated two ML-based BA models capable of accurately predicting both all-cause and cause-specific mortality. These findings suggest that the DBA and EnBA models hold promise for early identification of high-risk individuals, potentially facilitating timely preventive interventions and improving population health outcomes.

摘要

引言

传统的用于构建生物学年龄(BA)的机器学习(ML)方法主要依赖于血液标志物,限制了其应用范围。本研究旨在使用一套全面的临床、行为和社会经济因素开发并验证基于ML的新型BA模型,并评估其对死亡率的预测性能。

方法

我们分析了1999年至2010年美国国家健康与营养检查调查(NHANES)中24,985名参与者的数据,随访期延长至2019年12月31日,或直至死亡或失访。使用最小绝对收缩和选择算子(LASSO)选择了30个特征,包括血液和尿液生化指标、体格检查数据、行为特征和社会经济因素。这些特征被用于训练深度神经网络(DNN)和集成学习模型,即深度生物学年龄(DBA)和集成生物学年龄(EnBA),以实足年龄(CA)作为参考标签。使用平均绝对误差(MAE)评估模型性能,同时使用Shapley加法解释(SHAP)探索模型的可解释性。使用Cox比例风险模型得出的曲线下面积(AUC)和风险比(HR),并对人口统计学和生活方式因素进行调整,将DBA和EnBA对死亡率的预测准确性与表型年龄(PhenoAge)进行比较。进行敏感性分析以确保结果的稳健性。

结果

DBA和EnBA能够准确预测实际年龄(MAE分别为2.98岁和3.58岁),并对全因死亡率具有很强的预测能力——DBA的AUC为0.896(95%CI:0.891-0.898),EnBA的AUC为0.889(95%CI:0.884-0.894)。较高的DBA和EnBA加速与死亡风险增加显著相关(HR分别为1.059和1.039)。SHAP分析突出了处方药使用、乙肝表面抗体状态和剧烈体育活动是对DBA预测最有影响的特征。此外,BA加速与特定慢性病(包括心血管和脑血管疾病以及癌症)的死亡风险升高有关。

讨论

我们的研究成功开发并验证了两个基于ML的BA模型,能够准确预测全因死亡率和特定病因死亡率。这些发现表明,DBA和EnBA模型有望早期识别高危个体,可能有助于及时进行预防性干预并改善人群健康结果。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/828f/12307447/e15d02dc3fc9/fnagi-17-1532884-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/828f/12307447/c1de7d649d8b/fnagi-17-1532884-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/828f/12307447/f6542246f55d/fnagi-17-1532884-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/828f/12307447/e15d02dc3fc9/fnagi-17-1532884-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/828f/12307447/c1de7d649d8b/fnagi-17-1532884-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/828f/12307447/f6542246f55d/fnagi-17-1532884-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/828f/12307447/e15d02dc3fc9/fnagi-17-1532884-g003.jpg

相似文献

1
Development and validation of deep learning- and ensemble learning-based biological ages in the NHANES study.美国国家健康与营养检查调查(NHANES)研究中基于深度学习和集成学习的生物学年龄的开发与验证
Front Aging Neurosci. 2025 Jul 16;17:1532884. doi: 10.3389/fnagi.2025.1532884. eCollection 2025.
2
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
3
Supervised Machine Learning Models for Predicting Sepsis-Associated Liver Injury in Patients With Sepsis: Development and Validation Study Based on a Multicenter Cohort Study.用于预测脓毒症患者脓毒症相关肝损伤的监督式机器学习模型:基于多中心队列研究的开发与验证研究
J Med Internet Res. 2025 May 26;27:e66733. doi: 10.2196/66733.
4
Optimized feature selection and advanced machine learning for stroke risk prediction in revascularized coronary artery disease patients.优化特征选择与先进机器学习用于预测冠状动脉疾病血运重建患者的卒中风险
BMC Med Inform Decis Mak. 2025 Jul 24;25(1):276. doi: 10.1186/s12911-025-03116-2.
5
A Responsible Framework for Assessing, Selecting, and Explaining Machine Learning Models in Cardiovascular Disease Outcomes Among People With Type 2 Diabetes: Methodology and Validation Study.用于评估、选择和解释2型糖尿病患者心血管疾病结局机器学习模型的责任框架:方法与验证研究
JMIR Med Inform. 2025 Jun 27;13:e66200. doi: 10.2196/66200.
6
AI-based Hepatic Steatosis Detection and Integrated Hepatic Assessment from Cardiac CT Attenuation Scans Enhances All-cause Mortality Risk Stratification: A Multi-center Study.基于人工智能的心脏CT衰减扫描检测肝脂肪变性及综合肝脏评估可增强全因死亡风险分层:一项多中心研究
medRxiv. 2025 Jun 11:2025.06.09.25329157. doi: 10.1101/2025.06.09.25329157.
7
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病:网络荟萃分析。
Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.
8
Development and Validation of a Convolutional Neural Network Model to Predict a Pathologic Fracture in the Proximal Femur Using Abdomen and Pelvis CT Images of Patients With Advanced Cancer.利用晚期癌症患者腹部和骨盆 CT 图像建立卷积神经网络模型预测股骨近端病理性骨折的研究
Clin Orthop Relat Res. 2023 Nov 1;481(11):2247-2256. doi: 10.1097/CORR.0000000000002771. Epub 2023 Aug 23.
9
Blood biomarkers for the non-invasive diagnosis of endometriosis.用于子宫内膜异位症无创诊断的血液生物标志物。
Cochrane Database Syst Rev. 2016 May 1;2016(5):CD012179. doi: 10.1002/14651858.CD012179.
10
Artificial intelligence for diagnosing exudative age-related macular degeneration.人工智能在渗出性年龄相关性黄斑变性诊断中的应用。
Cochrane Database Syst Rev. 2024 Oct 17;10(10):CD015522. doi: 10.1002/14651858.CD015522.pub2.

本文引用的文献

1
Construction and validation of a DNN-based biological age and its influencing factors in the China Kadoorie Biobank.基于深度神经网络的中国嘉道理生物样本库生物年龄及其影响因素的构建与验证
Geroscience. 2025 Mar 7. doi: 10.1007/s11357-025-01577-x.
2
Integrating the environmental and genetic architectures of aging and mortality.整合衰老与死亡率的环境和遗传结构。
Nat Med. 2025 Mar;31(3):1016-1025. doi: 10.1038/s41591-024-03483-9. Epub 2025 Feb 19.
3
Accelerated biological aging elevates the risk of cardiometabolic multimorbidity and mortality.
加速的生物衰老会增加患心血管代谢性多病共病和死亡的风险。
Nat Cardiovasc Res. 2024 Mar;3(3):332-342. doi: 10.1038/s44161-024-00438-8. Epub 2024 Mar 1.
4
Clinical biomarker-based biological age predicts deaths in Brazilian adults: the ELSA-Brasil study.基于临床生物标志物的生物学年龄可预测巴西成年人的死亡:ELSA-Brasil 研究。
Geroscience. 2024 Dec;46(6):6115-6126. doi: 10.1007/s11357-024-01186-0. Epub 2024 May 16.
5
A biological age model based on physical examination data to predict mortality in a Chinese population.一种基于体格检查数据的生物学年龄模型,用于预测中国人群的死亡率。
iScience. 2024 Feb 3;27(3):108891. doi: 10.1016/j.isci.2024.108891. eCollection 2024 Mar 15.
6
Validation of biomarkers of aging.衰老生物标志物的验证。
Nat Med. 2024 Feb;30(2):360-372. doi: 10.1038/s41591-023-02784-9. Epub 2024 Feb 14.
7
Biomarkers of aging for the identification and evaluation of longevity interventions.衰老生物标志物用于鉴定和评估长寿干预措施。
Cell. 2023 Aug 31;186(18):3758-3775. doi: 10.1016/j.cell.2023.08.003.
8
Clinical biomarker-based biological aging and risk of cancer in the UK Biobank.基于临床生物标志物的生物年龄与英国生物库中癌症的风险。
Br J Cancer. 2023 Jul;129(1):94-103. doi: 10.1038/s41416-023-02288-w. Epub 2023 Apr 29.
9
Accelerated biological aging and risk of depression and anxiety: evidence from 424,299 UK Biobank participants.加速的生物衰老与抑郁和焦虑风险:来自 424299 名英国生物库参与者的证据。
Nat Commun. 2023 Apr 20;14(1):2277. doi: 10.1038/s41467-023-38013-7.
10
BoostTree and BoostForest for Ensemble Learning.BoostTree 和 BoostForest 用于集成学习。
IEEE Trans Pattern Anal Mach Intell. 2023 Jul;45(7):8110-8126. doi: 10.1109/TPAMI.2022.3227370. Epub 2023 Jun 5.