• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

机器学习方法在维生素 D 水平检测中的应用:一项对比研究。

Machine learning approach for the detection of vitamin D level: a comparative study.

机构信息

Department of Mathematics, Near East University, Nicosia, 99138, Turkey.

Department of Computer Engineering, Faculty of Electrical and Computer Engineering, University of Tabriz, Tabriz, Iran.

出版信息

BMC Med Inform Decis Mak. 2023 Oct 16;23(1):219. doi: 10.1186/s12911-023-02323-z.

DOI:10.1186/s12911-023-02323-z
PMID:37845674
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10580577/
Abstract

BACKGROUND

After the World Health Organization declared the COVID-19 pandemic, the role of Vitamin D has become even more critical for people worldwide. The most accurate way to define vitamin D level is 25-hydroxy vitamin D(25-OH-D) blood test. However, this blood test is not always feasible. Most data sets used in health science research usually contain highly correlated features, which is referred to as multicollinearity problem. This problem can lead to misleading results and overfitting problems in the ML training process. Therefore, the proposed study aims to determine a clinically acceptable ML model for the detection of the vitamin D status of the North Cyprus adult participants accurately, without the need to determine 25-OH-D level, taking into account the multicollinearity problem.

METHOD

The study was conducted with 481 observations who applied voluntarily to Internal Medicine Department at NEU Hospital. The classification performance of four conventional supervised ML models, namely, Ordinal logistic regression(OLR), Elastic-net ordinal regression(ENOR), Support Vector Machine(SVM), and Random Forest (RF) was compared. The comparative analysis is performed regarding the model's sensitivity to the participant's metabolic syndrome(MtS)'positive status, hyper-parameter tuning, sensitivities to the size of training data, and the classification performance of the models.

RESULTS

Due to the presence of multicollinearity, the findings showed that the performance of the SVM(RBF) is obviously negatively affected when the test is examined. Moreover, it can be obviously detected that RF is more robust than other models when the variations in the size of training data are examined. This experiment's result showed that the selected RF and ENOR showed better performances than the other two models when the size of training samples was reduced. Since the multicollinearity is more severe in the small samples, it can be concluded that RF and ENOR are not affected by the presence of the multicollinearity problem. The comparative analysis revealed that the RF classifier performed better and was more robust than the other proposed models in terms of accuracy (0.94), specificity (0.96), sensitivity or recall (0.94), precision (0.95), F1-score (0.95), and Cohen's kappa (0.90).

CONCLUSION

It is evident that the RF achieved better than the SVM(RBF), ENOR, and OLR. These comparison findings will be applied to develop a Vitamin D level intelligent detection system for being used in routine clinical, biochemical tests, and lifestyle characteristics of individuals to decrease the cost and time of vitamin D level detection.

摘要

背景

世界卫生组织宣布 COVID-19 大流行后,维生素 D 的作用对于全世界的人们来说变得更加重要。定义维生素 D 水平最准确的方法是 25-羟维生素 D(25-OH-D)血液检测。然而,这种血液检测并不总是可行的。健康科学研究中使用的大多数数据集通常包含高度相关的特征,这被称为多重共线性问题。这个问题可能会导致机器学习(ML)训练过程中的误导性结果和过拟合问题。因此,拟议的研究旨在确定一种临床上可接受的 ML 模型,以准确检测北塞浦路斯成年参与者的维生素 D 状态,而无需确定 25-OH-D 水平,同时考虑到多重共线性问题。

方法

该研究共纳入 481 名自愿到 NEU 医院内科就诊的观察对象。比较了四种常规监督式机器学习模型(有序逻辑回归(OLR)、弹性网络有序回归(ENOR)、支持向量机(SVM)和随机森林(RF))的分类性能。比较分析了模型对参与者代谢综合征(MtS)阳性状态的敏感性、超参数调整、对训练数据大小的敏感性以及模型的分类性能。

结果

由于存在多重共线性,研究结果表明,当对测试进行检查时,SVM(RBF)的性能明显受到负面影响。此外,当检查训练数据大小的变化时,可以明显检测到 RF 比其他模型更稳健。本实验结果表明,当训练样本量减少时,选择的 RF 和 ENOR 比其他两种模型表现更好。由于小样本中多重共线性更为严重,因此可以得出结论,RF 和 ENOR 不受多重共线性问题的影响。比较分析表明,RF 分类器在准确性(0.94)、特异性(0.96)、敏感性或召回率(0.94)、精度(0.95)、F1 分数(0.95)和 Cohen's kappa(0.90)方面的表现优于其他提出的模型。

结论

RF 的表现明显优于 SVM(RBF)、ENOR 和 OLR。这些比较结果将应用于开发一种维生素 D 水平智能检测系统,用于常规临床、生化检测和个体生活方式特征,以降低维生素 D 水平检测的成本和时间。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/74e4/10580577/caecec7d282d/12911_2023_2323_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/74e4/10580577/689f426f13e4/12911_2023_2323_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/74e4/10580577/2ab7d232ad77/12911_2023_2323_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/74e4/10580577/da688cf82f4e/12911_2023_2323_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/74e4/10580577/caecec7d282d/12911_2023_2323_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/74e4/10580577/689f426f13e4/12911_2023_2323_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/74e4/10580577/2ab7d232ad77/12911_2023_2323_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/74e4/10580577/da688cf82f4e/12911_2023_2323_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/74e4/10580577/caecec7d282d/12911_2023_2323_Fig4_HTML.jpg

相似文献

1
Machine learning approach for the detection of vitamin D level: a comparative study.机器学习方法在维生素 D 水平检测中的应用:一项对比研究。
BMC Med Inform Decis Mak. 2023 Oct 16;23(1):219. doi: 10.1186/s12911-023-02323-z.
2
Machine learning approaches to constructing predictive models of vitamin D deficiency in a hypertensive population: a comparative study.机器学习方法构建高血压人群维生素 D 缺乏预测模型的比较研究。
Inform Health Soc Care. 2021 Dec 2;46(4):355-369. doi: 10.1080/17538157.2021.1896524. Epub 2021 Apr 1.
3
Joint modeling strategy for using electronic medical records data to build machine learning models: an example of intracerebral hemorrhage.利用电子病历数据构建机器学习模型的联合建模策略:以脑出血为例。
BMC Med Inform Decis Mak. 2022 Oct 25;22(1):278. doi: 10.1186/s12911-022-02018-x.
4
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
5
Can machine learning predict pharmacotherapy outcomes? An application study in osteoporosis.机器学习能预测药物治疗效果吗?一项在骨质疏松症中的应用研究。
Comput Methods Programs Biomed. 2022 Oct;225:107028. doi: 10.1016/j.cmpb.2022.107028. Epub 2022 Jul 21.
6
A new hybrid ensemble machine-learning model for severity risk assessment and post-COVID prediction system.一种新的混合集成机器学习模型,用于严重程度风险评估和 COVID 后预测系统。
Math Biosci Eng. 2022 Apr 13;19(6):6102-6123. doi: 10.3934/mbe.2022285.
7
Classification Models for COVID-19 Test Prioritization in Brazil: Machine Learning Approach.巴西 COVID-19 检测优先级分类模型:机器学习方法。
J Med Internet Res. 2021 Apr 8;23(4):e27293. doi: 10.2196/27293.
8
Machine learning approach for automated predicting of COVID-19 severity based on clinical and paraclinical characteristics: Serum levels of zinc, calcium, and vitamin D.基于临床和检验特征的机器学习方法对 COVID-19 严重程度的自动预测:血清锌、钙和维生素 D 水平。
Clin Nutr ESPEN. 2022 Oct;51:404-411. doi: 10.1016/j.clnesp.2022.07.011. Epub 2022 Jul 31.
9
Bayesian Optimization with Support Vector Machine Model for Parkinson Disease Classification.基于支持向量机模型的贝叶斯优化在帕金森病分类中的应用。
Sensors (Basel). 2023 Feb 13;23(4):2085. doi: 10.3390/s23042085.
10
Do we need different machine learning algorithms for QSAR modeling? A comprehensive assessment of 16 machine learning algorithms on 14 QSAR data sets.我们是否需要不同的机器学习算法来进行定量构效关系建模?对 16 种机器学习算法在 14 个定量构效关系数据集上的综合评估。
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa321.

引用本文的文献

1
Supervised model based polycystic ovarian syndrome detection in relation to vitamin d deficiency by exploring different feature selection techniques.通过探索不同的特征选择技术,基于监督模型检测多囊卵巢综合征与维生素D缺乏的关系。
Sci Rep. 2025 Aug 26;15(1):31481. doi: 10.1038/s41598-025-14728-z.
2
Machine learning prediction of coal workers' pneumoconiosis classification based on few-shot clinical data.基于少量临床数据的煤工尘肺分类的机器学习预测
Digit Health. 2025 Jul 13;11:20552076251359498. doi: 10.1177/20552076251359498. eCollection 2025 Jan-Dec.
3
The role of vitamin D in sleep regulation: mechanisms, clinical advances, and future directions.

本文引用的文献

1
Machine learning algorithms for predicting low birth weight in Ethiopia.用于预测埃塞俄比亚低出生体重的机器学习算法。
BMC Med Inform Decis Mak. 2022 Sep 5;22(1):232. doi: 10.1186/s12911-022-01981-9.
2
Modelling Covid-19 infections in Zambia using data mining techniques.使用数据挖掘技术对赞比亚的新冠病毒感染情况进行建模。
Results Eng. 2022 Mar;13:100363. doi: 10.1016/j.rineng.2022.100363. Epub 2022 Feb 4.
3
Machine learning approaches for the prediction of postoperative complication risk in liver resection patients.机器学习方法预测肝切除术患者术后并发症风险。
维生素D在睡眠调节中的作用:机制、临床进展及未来方向。
Front Nutr. 2025 Jun 2;12:1595813. doi: 10.3389/fnut.2025.1595813. eCollection 2025.
4
Enhancing the prediction of vitamin D deficiency levels using an integrated approach of deep learning and evolutionary computing.使用深度学习和进化计算的综合方法增强维生素D缺乏水平的预测。
PeerJ Comput Sci. 2025 Feb 21;11:e2698. doi: 10.7717/peerj-cs.2698. eCollection 2025.
5
Machine Learning-Driven Prediction of Vitamin D Deficiency Severity with Hybrid Optimization.基于混合优化的机器学习驱动的维生素D缺乏严重程度预测
Bioengineering (Basel). 2025 Feb 18;12(2):200. doi: 10.3390/bioengineering12020200.
6
A Systematic Review of the Outcomes of Utilization of Artificial Intelligence Within the Healthcare Systems of the Middle East: A Thematic Analysis of Findings.中东医疗系统中人工智能应用成果的系统评价:研究结果的主题分析
Health Sci Rep. 2024 Dec 24;7(12):e70300. doi: 10.1002/hsr2.70300. eCollection 2024 Dec.
7
CT radiomics to predict pathologic complete response after neoadjuvant immunotherapy plus chemoradiotherapy in locally advanced esophageal squamous cell carcinoma.CT影像组学预测局部晚期食管鳞状细胞癌新辅助免疫治疗联合放化疗后的病理完全缓解
Eur Radiol. 2025 Mar;35(3):1594-1604. doi: 10.1007/s00330-024-11141-4. Epub 2024 Oct 29.
8
Using Machine Learning to Identify the Relationships between Demographic, Biochemical, and Lifestyle Parameters and Plasma Vitamin D Concentration in Healthy Premenopausal Chinese Women.利用机器学习识别健康的绝经前中国女性的人口统计学、生化和生活方式参数与血浆维生素D浓度之间的关系。
Life (Basel). 2023 Nov 27;13(12):2257. doi: 10.3390/life13122257.
BMC Med Inform Decis Mak. 2021 Dec 30;21(1):371. doi: 10.1186/s12911-021-01731-3.
4
A Machine Learning Approach to Assess Differential Item Functioning in Psychometric Questionnaires Using the Elastic Net Regularized Ordinal Logistic Regression in Small Sample Size Groups.一种使用弹性网络正则化有序逻辑回归在小样本量组中评估心理计量问卷中差异项目功能的机器学习方法。
Biomed Res Int. 2021 Dec 15;2021:6854477. doi: 10.1155/2021/6854477. eCollection 2021.
5
Regularized Ordinal Regression and the ordinalNet R Package.正则化有序回归与ordinalNet R包。
J Stat Softw. 2021 Sep;99(6). doi: 10.18637/jss.v099.i06.
6
Towards the Interpretability of Machine Learning Predictions for Medical Applications Targeting Personalised Therapies: A Cancer Case Survey.面向针对个性化治疗的医学应用的机器学习预测的可解释性:一项癌症病例调查。
Int J Mol Sci. 2021 Apr 22;22(9):4394. doi: 10.3390/ijms22094394.
7
A Deep Learning Approach for Table Tennis Forehand Stroke Evaluation System Using an IMU Sensor.基于 IMU 传感器的乒乓球正手挥拍动作评估系统的深度学习方法
Comput Intell Neurosci. 2021 Apr 9;2021:5584756. doi: 10.1155/2021/5584756. eCollection 2021.
8
Machine learning approaches to constructing predictive models of vitamin D deficiency in a hypertensive population: a comparative study.机器学习方法构建高血压人群维生素 D 缺乏预测模型的比较研究。
Inform Health Soc Care. 2021 Dec 2;46(4):355-369. doi: 10.1080/17538157.2021.1896524. Epub 2021 Apr 1.
9
Low vitamin D status is associated with coronavirus disease 2019 outcomes: a systematic review and meta-analysis.维生素 D 水平低与 2019 冠状病毒病结局相关:系统评价和荟萃分析。
Int J Infect Dis. 2021 Mar;104:58-64. doi: 10.1016/j.ijid.2020.12.077. Epub 2021 Jan 2.
10
The role of vitamin D in the prevention of coronavirus disease 2019 infection and mortality.维生素 D 在预防 2019 年冠状病毒病感染和死亡中的作用。
Aging Clin Exp Res. 2020 Jul;32(7):1195-1198. doi: 10.1007/s40520-020-01570-8. Epub 2020 May 6.