• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于预测骨质疏松症患病率的特征提取方法和机器学习模型的比较分析

Comparative Analysis of Feature Extraction Methods and Machine Learning Models for Predicting Osteoporosis Prevalence.

作者信息

Zhang Danni, Yang Xingyu, Wang Fangying, Qiu Cifang, Chai Yanfu, Fang Danruo

机构信息

Department of Functional, Shaoxing Hospital of Traditional Chinese Medicine, Shaoxing, 312000, Zhejiang, China.

School of Mechanical and Electrical Engineering, Shaoxing University, Shaoxing, 312000, China.

出版信息

J Med Syst. 2025 May 29;49(1):72. doi: 10.1007/s10916-025-02203-1.

DOI:10.1007/s10916-025-02203-1
PMID:40439990
Abstract

This study systematically examined the impact of three feature selection techniques (Boruta, Extreme gradient boosting (XGBoost), and Lasso) for optimizing four machine learning models (Random forest (RF), XGBoost, Logistic regression (LR), and Support vector machine (SVM)) in predicting bone density prevalence. Our findings revealed that varying data partitioning ratios (training and test sets: 0.6:0.4; 0.7:0.3; 0.8:0.2; 0.9:0.1) minimally impacted the prediction accuracy across all four models, a conclusion reinforced by 10-fold cross validation. Besides, principal component analysis (PCA) led to substantial accuracy degradation (0.6-0.8 range), suggesting incompatibility with this study's requirements due to the inherent complex decision boundaries in the original high-dimensional data. Comparative analysis demonstrated that the Boruta-XGBoost combination achieved superior performance (accuracy: 0.9083 ± 0.0146), significantly outperforming the Lasso-LR combination (0.7480 ± 0.0157) across all evaluation frameworks. Regarding model evaluation metrics, the RF model exhibited enhanced discriminative capacity with Area under the receiver operating characteristic (AUROC) values of 0.85, 0.81, and 0.80 under different feature selection approaches, surpassing the SVM model (0.78, 0.76, and 0.76). This advantage likely stems from RF's native capability to capture non-linear relationships and feature interactions.

摘要

本研究系统地考察了三种特征选择技术(Boruta、极端梯度提升(XGBoost)和套索)对优化四种机器学习模型(随机森林(RF)、XGBoost、逻辑回归(LR)和支持向量机(SVM))预测骨密度患病率的影响。我们的研究结果表明,不同的数据划分比例(训练集和测试集:0.6:0.4;0.7:0.3;0.8:0.2;0.9:0.1)对所有四个模型的预测准确性影响极小,这一结论通过10折交叉验证得到了加强。此外,主成分分析(PCA)导致准确性大幅下降(0.6 - 0.8范围),表明由于原始高维数据中固有的复杂决策边界,其与本研究的要求不兼容。对比分析表明,在所有评估框架中,Boruta - XGBoost组合表现出卓越的性能(准确率:0.9083±0.0146),显著优于套索 - LR组合(0.7480±0.0157)。关于模型评估指标,RF模型在不同特征选择方法下的受试者操作特征曲线下面积(AUROC)值分别为0.85、0.81和0.80,表现出更强的判别能力,超过了SVM模型(0.78、0.76和0.76)。这一优势可能源于RF捕捉非线性关系和特征交互的固有能力。

相似文献

1
Comparative Analysis of Feature Extraction Methods and Machine Learning Models for Predicting Osteoporosis Prevalence.用于预测骨质疏松症患病率的特征提取方法和机器学习模型的比较分析
J Med Syst. 2025 May 29;49(1):72. doi: 10.1007/s10916-025-02203-1.
2
Interpretable machine learning for allergic rhinitis prediction among preschool children in Urumqi, China.中国乌鲁木齐学龄前儿童变应性鼻炎预测的可解释机器学习。
Sci Rep. 2024 Sep 27;14(1):22281. doi: 10.1038/s41598-024-73733-w.
3
Prediction and feature selection of low birth weight using machine learning algorithms.利用机器学习算法预测和选择低出生体重。
J Health Popul Nutr. 2024 Oct 12;43(1):157. doi: 10.1186/s41043-024-00647-8.
4
Identification of biomarkers for knee osteoarthritis through clinical data and machine learning models.通过临床数据和机器学习模型识别膝关节骨关节炎的生物标志物
Sci Rep. 2025 Jan 11;15(1):1703. doi: 10.1038/s41598-025-85945-9.
5
Supervised Machine Learning Models for Predicting Sepsis-Associated Liver Injury in Patients With Sepsis: Development and Validation Study Based on a Multicenter Cohort Study.用于预测脓毒症患者脓毒症相关肝损伤的监督式机器学习模型:基于多中心队列研究的开发与验证研究
J Med Internet Res. 2025 May 26;27:e66733. doi: 10.2196/66733.
6
Identifying determinants of malnutrition in under-five children in Bangladesh: insights from the BDHS-2022 cross-sectional study.确定孟加拉国五岁以下儿童营养不良的决定因素:来自2022年孟加拉国人口与健康调查横断面研究的见解
Sci Rep. 2025 Apr 24;15(1):14336. doi: 10.1038/s41598-025-99288-y.
7
[Constructing a predictive model for the death risk of patients with septic shock based on supervised machine learning algorithms].基于监督机器学习算法构建脓毒症休克患者死亡风险预测模型
Zhonghua Wei Zhong Bing Ji Jiu Yi Xue. 2024 Apr;36(4):345-352. doi: 10.3760/cma.j.cn121430-20230930-00832.
8
Application of machine learning model to predict osteoporosis based on abdominal computed tomography images of the psoas muscle: a retrospective study.基于腰部竖脊肌 CT 图像的机器学习模型预测骨质疏松症的应用:一项回顾性研究。
BMC Geriatr. 2022 Oct 13;22(1):796. doi: 10.1186/s12877-022-03502-9.
9
Clinical Timing-Sequence Warning Models for Serious Bacterial Infections in Adults Based on Machine Learning: Retrospective Study.基于机器学习的成人严重细菌感染临床时间序列预警模型:回顾性研究。
J Med Internet Res. 2023 Dec 18;25:e45515. doi: 10.2196/45515.
10
Application of machine learning approaches for osteoporosis risk prediction in postmenopausal women.机器学习方法在绝经后妇女骨质疏松症风险预测中的应用。
Arch Osteoporos. 2020 Oct 23;15(1):169. doi: 10.1007/s11657-020-00802-8.

本文引用的文献

1
Proceedings of the 2024 Santa Fe Bone Symposium: Update on the Management of Osteoporosis and Rare Bone Diseases.2024年圣达菲骨科学研讨会会议记录:骨质疏松症和罕见骨病管理的最新进展
J Clin Densitom. 2025 Jan-Mar;28(1):101559. doi: 10.1016/j.jocd.2024.101559. Epub 2024 Dec 28.
2
On the Readiness of Scientific Data Papers for a Fair and Transparent Use in Machine Learning.论科学数据论文在机器学习中公平透明使用的准备情况
Sci Data. 2025 Jan 13;12(1):61. doi: 10.1038/s41597-025-04402-4.
3
Mechanisms of Ferroptosis in bone disease: A new target for osteoporosis treatment.
骨疾病中细胞铁死亡的机制:骨质疏松症治疗的新靶点。
Cell Signal. 2025 Mar;127:111598. doi: 10.1016/j.cellsig.2025.111598. Epub 2025 Jan 7.
4
Enabling osteoporosis treatment in post-acute care: An algorithm for providers.在急性后期护理中实现骨质疏松症治疗:供医疗服务提供者使用的一种算法。
Geriatr Nurs. 2025 Jan-Feb;61:228-230. doi: 10.1016/j.gerinurse.2024.10.068. Epub 2024 Nov 19.
5
Trends in the prevalence of osteoporosis and effects of heavy metal exposure using interpretable machine learning.骨质疏松症患病率的变化趋势及重金属暴露的影响——基于可解释机器学习的研究。
Ecotoxicol Environ Saf. 2024 Nov 1;286:117238. doi: 10.1016/j.ecoenv.2024.117238. Epub 2024 Oct 28.
6
Bone mineral density affects tumor growth by shaping microenvironmental heterogeneity.骨密度通过塑造微环境异质性来影响肿瘤生长。
Biomaterials. 2025 Apr;315:122916. doi: 10.1016/j.biomaterials.2024.122916. Epub 2024 Oct 24.
7
Machine-learning models for diagnosis of rotator cuff tears in osteoporosis patients based on anteroposterior X-rays of the shoulder joint.基于肩关节前后位X线片的机器学习模型用于诊断骨质疏松症患者的肩袖撕裂
SLAS Technol. 2024 Aug;29(4):100149. doi: 10.1016/j.slast.2024.100149. Epub 2024 May 23.
8
An interpretable DIC risk prediction model based on convolutional neural networks with time series data.基于卷积神经网络的时间序列数据可解释 DIC 风险预测模型。
BMC Bioinformatics. 2022 Nov 8;23(1):471. doi: 10.1186/s12859-022-05004-2.
9
Prospective Evaluation of a Machine-Learning Prediction Model for Missed Radiology Appointments.机器学习预测模型在错过放射科预约中的前瞻性评估。
J Digit Imaging. 2022 Dec;35(6):1690-1693. doi: 10.1007/s10278-022-00670-3. Epub 2022 Jun 29.
10
Utility based approach in individualized optimal dose selection using machine learning methods.基于效用的个体化最优剂量选择方法:机器学习方法的应用。
Stat Med. 2022 Jul 20;41(16):2957-2977. doi: 10.1002/sim.9396. Epub 2022 Mar 28.