• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于机器学习的维生素 D 缺乏预测:NHANES 2001-2018。

Machine learning-based prediction of vitamin D deficiency: NHANES 2001-2018.

机构信息

Department of Orthopedics, Chaohu Hospital of Anhui Medical University, Hefei, China.

Department of Infection, Chaohu Hospital of Anhui Medical University, Hefei, China.

出版信息

Front Endocrinol (Lausanne). 2024 Feb 16;15:1327058. doi: 10.3389/fendo.2024.1327058. eCollection 2024.

DOI:10.3389/fendo.2024.1327058
PMID:38449846
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10916299/
Abstract

BACKGROUND

Vitamin D deficiency is strongly associated with the development of several diseases. In the current context of a global pandemic of vitamin D deficiency, it is critical to identify people at high risk of vitamin D deficiency. There are no prediction tools for predicting the risk of vitamin D deficiency in the general community population, and this study aims to use machine learning to predict the risk of vitamin D deficiency using data that can be obtained through simple interviews in the community.

METHODS

The National Health and Nutrition Examination Survey 2001-2018 dataset is used for the analysis which is randomly divided into training and validation sets in the ratio of 70:30. GBM, LR, NNet, RF, SVM, XGBoost methods are used to construct the models and their performance is evaluated. The best performed model was interpreted using the SHAP value and further development of the online web calculator.

RESULTS

There were 62,919 participants enrolled in the study, and all participants included in the study were 2 years old and above, of which 20,204 (32.1%) participants had vitamin D deficiency. The models constructed by each method were evaluated using AUC as the primary evaluation statistic and ACC, PPV, NPV, SEN, SPE, F1 score, MCC, Kappa, and Brier score as secondary evaluation statistics. Finally, the XGBoost-based model has the best and near-perfect performance. The summary plot of SHAP values shows that the top three important features for this model are race, age, and BMI. An online web calculator based on this model can easily and quickly predict the risk of vitamin D deficiency.

CONCLUSION

In this study, the XGBoost-based prediction tool performs flawlessly and is highly accurate in predicting the risk of vitamin D deficiency in community populations.

摘要

背景

维生素 D 缺乏与多种疾病的发生密切相关。在当前全球维生素 D 缺乏症大流行的背景下,确定维生素 D 缺乏高危人群至关重要。目前尚无预测一般社区人群维生素 D 缺乏风险的预测工具,本研究旨在使用机器学习,利用社区中通过简单访谈即可获得的数据来预测维生素 D 缺乏的风险。

方法

使用 2001-2018 年全国健康与营养调查(NHANES)数据集进行分析,将数据随机分为训练集和验证集,比例为 70:30。使用 GBM、LR、NNet、RF、SVM、XGBoost 方法构建模型,并评估其性能。使用 SHAP 值对表现最佳的模型进行解释,并进一步开发在线网络计算器。

结果

本研究共纳入 62919 名参与者,所有纳入研究的参与者年龄均在 2 岁及以上,其中 20204 名(32.1%)参与者存在维生素 D 缺乏。使用 AUC 作为主要评估指标,ACC、PPV、NPV、SEN、SPE、F1 评分、MCC、Kappa 和 Brier 评分作为次要评估指标,对每种方法构建的模型进行评估。最终,基于 XGBoost 的模型具有最佳且接近完美的性能。SHAP 值的汇总图显示,对于该模型,最重要的三个特征是种族、年龄和 BMI。基于该模型的在线网络计算器可以方便、快速地预测维生素 D 缺乏的风险。

结论

在本研究中,基于 XGBoost 的预测工具表现完美,在预测社区人群维生素 D 缺乏风险方面具有高度准确性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5623/10916299/935c0ba46173/fendo-15-1327058-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5623/10916299/ea508558d216/fendo-15-1327058-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5623/10916299/428e4eaa1df0/fendo-15-1327058-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5623/10916299/2c7440bda271/fendo-15-1327058-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5623/10916299/935c0ba46173/fendo-15-1327058-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5623/10916299/ea508558d216/fendo-15-1327058-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5623/10916299/428e4eaa1df0/fendo-15-1327058-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5623/10916299/2c7440bda271/fendo-15-1327058-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5623/10916299/935c0ba46173/fendo-15-1327058-g004.jpg

相似文献

1
Machine learning-based prediction of vitamin D deficiency: NHANES 2001-2018.基于机器学习的维生素 D 缺乏预测:NHANES 2001-2018。
Front Endocrinol (Lausanne). 2024 Feb 16;15:1327058. doi: 10.3389/fendo.2024.1327058. eCollection 2024.
2
Application of machine learning model in predicting the likelihood of blood transfusion after hip fracture surgery.机器学习模型在预测髋部骨折手术后输血可能性中的应用。
Aging Clin Exp Res. 2023 Nov;35(11):2643-2656. doi: 10.1007/s40520-023-02550-4. Epub 2023 Sep 21.
3
A machine learning-based online web calculator to aid in the diagnosis of sarcopenia in the US community.一种基于机器学习的在线网络计算器,用于辅助美国社区中肌肉减少症的诊断。
Digit Health. 2024 Sep 27;10:20552076241283247. doi: 10.1177/20552076241283247. eCollection 2024 Jan-Dec.
4
A Risk Prediction Model for Physical Restraints Among Older Chinese Adults in Long-term Care Facilities: Machine Learning Study.长期护理机构中老年人身体约束的风险预测模型:机器学习研究。
J Med Internet Res. 2023 Apr 6;25:e43815. doi: 10.2196/43815.
5
Development and Validation of an Insulin Resistance Model for a Population with Chronic Kidney Disease Using a Machine Learning Approach.基于机器学习方法的慢性肾脏病人群胰岛素抵抗模型的建立与验证。
Nutrients. 2022 Jul 9;14(14):2832. doi: 10.3390/nu14142832.
6
Development and validation of machine learning models for postoperative venous thromboembolism prediction in colorectal cancer inpatients: a retrospective study.用于预测结直肠癌住院患者术后静脉血栓栓塞的机器学习模型的开发与验证:一项回顾性研究。
J Gastrointest Oncol. 2023 Feb 28;14(1):220-232. doi: 10.21037/jgo-23-18. Epub 2023 Feb 15.
7
Establishment and validation of a heart failure risk prediction model for elderly patients after coronary rotational atherectomy based on machine learning.基于机器学习的老年患者冠状动脉旋磨术后心力衰竭风险预测模型的建立与验证
PeerJ. 2024 Jan 31;12:e16867. doi: 10.7717/peerj.16867. eCollection 2024.
8
Development and validation of a web-based artificial intelligence prediction model to assess massive intraoperative blood loss for metastatic spinal disease using machine learning techniques.开发和验证一种基于网络的人工智能预测模型,使用机器学习技术评估转移性脊柱疾病的术中大量失血。
Spine J. 2024 Jan;24(1):146-160. doi: 10.1016/j.spinee.2023.09.001. Epub 2023 Sep 11.
9
Machine learning algorithms to predict risk of postoperative pneumonia in elderly with hip fracture.机器学习算法预测老年髋部骨折术后肺炎风险。
J Orthop Surg Res. 2023 Aug 5;18(1):571. doi: 10.1186/s13018-023-04049-0.
10
Machine Learning Models for Data-Driven Prediction of Diabetes by Lifestyle Type.基于生活方式类型的数据驱动糖尿病预测的机器学习模型。
Int J Environ Res Public Health. 2022 Nov 15;19(22):15027. doi: 10.3390/ijerph192215027.

引用本文的文献

1
The association of life's essential 8 with prevalence of chronic respiratory diseases in adults: insights from NHANES 2007-2018.成人生命必需的八项指标与慢性呼吸道疾病患病率的关联:来自2007 - 2018年美国国家健康与营养检查调查的见解
BMC Pulm Med. 2025 Jul 9;25(1):331. doi: 10.1186/s12890-025-03775-x.
2
Construction of a nomogram for predicting serum vitamin D deficiency in children/adolescents with new-onset type 1 diabetes: a single-center study in China.构建预测新诊断1型糖尿病儿童/青少年血清维生素D缺乏的列线图:一项中国单中心研究
Front Pediatr. 2025 May 30;13:1554833. doi: 10.3389/fped.2025.1554833. eCollection 2025.
3

本文引用的文献

1
Machine learning for administrative health records: A systematic review of techniques and applications.基于管理健康记录的机器学习:技术与应用的系统综述。
Artif Intell Med. 2023 Oct;144:102642. doi: 10.1016/j.artmed.2023.102642. Epub 2023 Aug 27.
2
Machine Learning: A New Approach for Dose Individualization.机器学习:剂量个体化的新方法。
Clin Pharmacol Ther. 2024 Apr;115(4):727-744. doi: 10.1002/cpt.3049. Epub 2023 Sep 29.
3
The association of vitamin D deficiency, age and depression in US adults: a cross-sectional analysis.
The relationship between dietary vitamin B1 and stroke: a machine learning analysis of NHANES data.
饮食中维生素B1与中风的关系:基于美国国家健康与营养检查调查(NHANES)数据的机器学习分析
Front Nutr. 2025 May 6;12:1584654. doi: 10.3389/fnut.2025.1584654. eCollection 2025.
4
Diagnostic biomarkers and immune infiltration profiles common to COVID-19, acute myocardial infarction and acute ischaemic stroke using bioinformatics methods and machine learning.使用生物信息学方法和机器学习确定2019冠状病毒病、急性心肌梗死和急性缺血性中风共有的诊断生物标志物和免疫浸润特征
BMC Neurol. 2025 May 8;25(1):201. doi: 10.1186/s12883-025-04212-6.
5
The relationship between epigenetic biomarkers and the risk of diabetes and cancer: a machine learning modeling approach.表观遗传生物标志物与糖尿病和癌症风险之间的关系:一种机器学习建模方法。
Front Public Health. 2025 Mar 21;13:1509458. doi: 10.3389/fpubh.2025.1509458. eCollection 2025.
6
Correlation analysis of a novel artificial intelligence optical microscope-assisted semen assessment system with IVF outcomes.一种新型人工智能光学显微镜辅助精液评估系统与体外受精结局的相关性分析
J Assist Reprod Genet. 2025 Mar 31. doi: 10.1007/s10815-025-03453-1.
7
Discovering Vitamin-D-Deficiency-Associated Factors in Korean Adults Using KNHANES Data Based on an Integrated Analysis of Machine Learning and Statistical Techniques.基于机器学习和统计技术的综合分析,利用韩国国家健康与营养检查调查(KNHANES)数据发现韩国成年人维生素D缺乏相关因素。
Nutrients. 2025 Feb 8;17(4):618. doi: 10.3390/nu17040618.
8
Low-carbohydrate diet score and chronic obstructive pulmonary disease: a machine learning analysis of NHANES data.低碳水化合物饮食评分与慢性阻塞性肺疾病:对美国国家健康与营养检查调查(NHANES)数据的机器学习分析
Front Nutr. 2024 Dec 18;11:1519782. doi: 10.3389/fnut.2024.1519782. eCollection 2024.
9
Comment on Aşır et al. Investigation of Vitamin D Levels in Men with Suspected Infertility. 2024, , 273.关于阿西尔等人的评论。对疑似不育男性维生素D水平的调查。2024年,,273。 (注:原文中“2024, , 273”表述不太完整规范,可能影响准确理解其确切含义)
Life (Basel). 2024 Jul 22;14(7):913. doi: 10.3390/life14070913.
美国成年人中维生素 D 缺乏、年龄和抑郁的关联:一项横断面分析。
BMC Psychiatry. 2023 Jul 24;23(1):534. doi: 10.1186/s12888-023-04685-0.
4
Evaluation of vitamin D levels and affecting factors of vitamin D deficiency in healthy children 0-18 years old.评估 0-18 岁健康儿童维生素 D 水平及维生素 D 缺乏的影响因素。
Eur J Pediatr. 2023 Sep;182(9):4123-4131. doi: 10.1007/s00431-023-05096-9. Epub 2023 Jul 10.
5
Low Vitamin D Levels Are Associated With Long COVID Syndrome in COVID-19 Survivors.维生素 D 水平低与 COVID-19 幸存者的长新冠综合征有关。
J Clin Endocrinol Metab. 2023 Sep 18;108(10):e1106-e1116. doi: 10.1210/clinem/dgad207.
6
Prevalence of vitamin D deficiency in South America: a systematic review and meta-analysis.南美洲维生素 D 缺乏症的流行情况:系统评价和荟萃分析。
Nutr Rev. 2023 Sep 11;81(10):1290-1309. doi: 10.1093/nutrit/nuad010.
7
Mendelian randomization on the association of obesity with vitamin D: Guangzhou Biobank Cohort Study.基于孟德尔随机化的肥胖与维生素 D 关联研究:广州生物银行队列研究。
Eur J Clin Nutr. 2023 Feb;77(2):195-201. doi: 10.1038/s41430-022-01234-y. Epub 2022 Nov 8.
8
Positive Effects of Vitamin D Supplementation in Patients Hospitalized for COVID-19: A Randomized, Double-Blind, Placebo-Controlled Trial.维生素 D 补充对 COVID-19 住院患者的积极影响:一项随机、双盲、安慰剂对照试验。
Nutrients. 2022 Jul 26;14(15):3048. doi: 10.3390/nu14153048.
9
Prediction of Vitamin D Deficiency in Older Adults: The Role of Machine Learning Models.老年人维生素 D 缺乏预测:机器学习模型的作用。
J Clin Endocrinol Metab. 2022 Sep 28;107(10):2737-2747. doi: 10.1210/clinem/dgac432.
10
Shifting machine learning for healthcare from development to deployment and from models to data.将医疗保健领域的机器学习从开发转移到部署,从模型转移到数据。
Nat Biomed Eng. 2022 Dec;6(12):1330-1345. doi: 10.1038/s41551-022-00898-y. Epub 2022 Jul 4.