• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于改善肌肉减少症预测任务的插补策略的比较研究。

Comparative study of imputation strategies to improve the sarcopenia prediction task.

作者信息

Karimov Shakhzod, Turimov Dilmurod, Kim Wooseong, Kim Jiyoun

机构信息

Department of Computer Engineering, Gachon University, Seongnam-si, Republic of Korea.

Department of Exercise Rehabilitation & Welfare, Gachon University, Incheon, Republic of Korea.

出版信息

Digit Health. 2025 Jan 17;11:20552076241301960. doi: 10.1177/20552076241301960. eCollection 2025 Jan-Dec.

DOI:10.1177/20552076241301960
PMID:39839962
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11748086/
Abstract

OBJECTIVE

Sarcopenia, a condition characterized by the progressive loss of skeletal muscle mass and strength, poses significant challenges in research due to missing data. Incomplete datasets undermine the accuracy and reliability of studies, necessitating effective imputation techniques. This study conducts a comparative analysis of three advanced methods-multiple imputation by chained equations (MICE), support vector regression, and K-nearest neighbors (KNN)-to address data completeness issues in sarcopenia research.

METHODS

Following imputation, we utilized machine learning models, including logistic regression, gradient boosting, support vector machine, and random forest, to classify sarcopenia. The methodology encompassed rigorous data preprocessing, normalization, and the synthetic minority oversampling technique to address class imbalance and ensure unbiased model performance.

RESULTS

The results revealed substantial variations in model accuracy based on the imputation method employed. The gradient boosting model consistently exhibited superior performance across all imputation strategies, demonstrating its robustness with imputed datasets. Additionally, KNN and MICE emerged as effective imputation techniques, preserving the original data distribution and enabling more accurate classification outcomes.

CONCLUSION

This study underscores the pivotal role of imputation methods in maintaining data integrity and enhancing predictive accuracy in sarcopenia research. The gradient boosting model's reliability across all strategies highlights its potential as a robust classifier, while the suitability of KNN and MICE for preserving data distribution supports their application in similar research contexts. These findings contribute to more reliable and valid insights in sarcopenia studies, ultimately supporting improved clinical outcomes.

摘要

目的

肌肉减少症是一种以骨骼肌质量和力量逐渐丧失为特征的病症,由于数据缺失,在研究中带来了重大挑战。不完整的数据集会破坏研究的准确性和可靠性,因此需要有效的插补技术。本研究对三种先进方法——链式方程多重插补(MICE)、支持向量回归和K近邻(KNN)——进行了比较分析,以解决肌肉减少症研究中的数据完整性问题。

方法

在插补之后,我们使用了包括逻辑回归、梯度提升、支持向量机和随机森林在内的机器学习模型来对肌肉减少症进行分类。该方法包括严格的数据预处理、归一化以及合成少数类过采样技术,以解决类别不平衡问题并确保模型性能无偏。

结果

结果显示,基于所采用的插补方法,模型准确性存在显著差异。梯度提升模型在所有插补策略中始终表现出卓越的性能,证明了其在插补数据集上的稳健性。此外,KNN和MICE成为有效的插补技术,保留了原始数据分布并实现了更准确的分类结果。

结论

本研究强调了插补方法在维持肌肉减少症研究中的数据完整性和提高预测准确性方面的关键作用。梯度提升模型在所有策略中的可靠性凸显了其作为稳健分类器的潜力,而KNN和MICE在保留数据分布方面的适用性支持了它们在类似研究背景中的应用。这些发现有助于在肌肉减少症研究中获得更可靠和有效的见解,最终支持改善临床结果。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2527/11748086/6990504715d6/10.1177_20552076241301960-fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2527/11748086/84987696844b/10.1177_20552076241301960-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2527/11748086/0551397edfe0/10.1177_20552076241301960-fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2527/11748086/5ebde4b52220/10.1177_20552076241301960-fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2527/11748086/88a97f40d746/10.1177_20552076241301960-fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2527/11748086/1c42c8d31b46/10.1177_20552076241301960-fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2527/11748086/0c2e6f369355/10.1177_20552076241301960-fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2527/11748086/6990504715d6/10.1177_20552076241301960-fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2527/11748086/84987696844b/10.1177_20552076241301960-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2527/11748086/0551397edfe0/10.1177_20552076241301960-fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2527/11748086/5ebde4b52220/10.1177_20552076241301960-fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2527/11748086/88a97f40d746/10.1177_20552076241301960-fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2527/11748086/1c42c8d31b46/10.1177_20552076241301960-fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2527/11748086/0c2e6f369355/10.1177_20552076241301960-fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2527/11748086/6990504715d6/10.1177_20552076241301960-fig7.jpg

相似文献

1
Comparative study of imputation strategies to improve the sarcopenia prediction task.用于改善肌肉减少症预测任务的插补策略的比较研究。
Digit Health. 2025 Jan 17;11:20552076241301960. doi: 10.1177/20552076241301960. eCollection 2025 Jan-Dec.
2
Missing data imputation, prediction, and feature selection in diagnosis of vaginal prolapse.阴道脱垂诊断中的缺失数据插补、预测和特征选择。
BMC Med Res Methodol. 2023 Nov 6;23(1):259. doi: 10.1186/s12874-023-02079-0.
3
Analyzing the effect of data preprocessing techniques using machine learning algorithms on the diagnosis of COVID-19.使用机器学习算法分析数据预处理技术对新型冠状病毒肺炎诊断的影响。
Concurr Comput. 2022 Dec 25;34(28):e7393. doi: 10.1002/cpe.7393. Epub 2022 Oct 18.
4
Prediction of adverse pregnancy outcomes using machine learning techniques: evidence from analysis of electronic medical records data in Rwanda.使用机器学习技术预测不良妊娠结局:来自卢旺达电子病历数据分析的证据。
BMC Med Inform Decis Mak. 2025 Feb 12;25(1):76. doi: 10.1186/s12911-025-02921-z.
5
Improving Surgical Site Infection Prediction Using Machine Learning: Addressing Challenges of Highly Imbalanced Data.使用机器学习改善手术部位感染预测:应对高度不平衡数据的挑战。
Diagnostics (Basel). 2025 Feb 19;15(4):501. doi: 10.3390/diagnostics15040501.
6
Advanced methods for missing values imputation based on similarity learning.基于相似性学习的缺失值插补先进方法。
PeerJ Comput Sci. 2021 Jul 21;7:e619. doi: 10.7717/peerj-cs.619. eCollection 2021.
7
A novel MissForest-based missing values imputation approach with recursive feature elimination in medical applications.一种基于 MissForest 的新的缺失值插补方法,在医学应用中采用递归特征消除。
BMC Med Res Methodol. 2024 Nov 8;24(1):269. doi: 10.1186/s12874-024-02392-2.
8
Machine learning models predict triage levels, massive transfusion protocol activation, and mortality in trauma utilizing patients hemodynamics on admission.机器学习模型利用创伤患者入院时的血流动力学来预测分诊级别、大量输血方案的激活和死亡率。
Comput Biol Med. 2024 Sep;179:108880. doi: 10.1016/j.compbiomed.2024.108880. Epub 2024 Jul 16.
9
Comparison of the effects of imputation methods for missing data in predictive modelling of cohort study datasets.缺失数据插补方法对队列研究数据集预测建模效果的比较。
BMC Med Res Methodol. 2024 Feb 16;24(1):41. doi: 10.1186/s12874-024-02173-x.
10
Addressing Missing Data Challenges in Geriatric Health Monitoring: A Study of Statistical and Machine Learning Imputation Methods.应对老年健康监测中的数据缺失挑战:统计与机器学习插补方法研究
Sensors (Basel). 2025 Jan 21;25(3):614. doi: 10.3390/s25030614.

本文引用的文献

1
Improving sarcopenia in older adults: a systematic review and meta-analysis of randomized controlled trials of whey protein supplementation with or without resistance training.改善老年人肌少症:补充乳清蛋白联合或不联合抗阻训练的随机对照试验的系统评价和荟萃分析。
J Nutr Health Aging. 2024 Apr;28(4):100184. doi: 10.1016/j.jnha.2024.100184. Epub 2024 Feb 12.
2
Machine Learning for Sarcopenia Prediction in the Elderly Using Socioeconomic, Infrastructure, and Quality-of-Life Data.使用社会经济、基础设施和生活质量数据的机器学习预测老年人肌肉减少症
Healthcare (Basel). 2023 Nov 1;11(21):2881. doi: 10.3390/healthcare11212881.
3
Performance Evaluation of Machine Learning Algorithms for Sarcopenia Diagnosis in Older Adults.
老年人肌肉减少症诊断中机器学习算法的性能评估
Healthcare (Basel). 2023 Oct 9;11(19):2699. doi: 10.3390/healthcare11192699.
4
Machine Learning Applications in Sarcopenia Detection and Management: A Comprehensive Survey.机器学习在肌肉减少症检测与管理中的应用:全面综述。
Healthcare (Basel). 2023 Sep 7;11(18):2483. doi: 10.3390/healthcare11182483.
5
A systematic approach towards missing lab data in electronic health records: A case study in non-small cell lung cancer and multiple myeloma.一种针对电子健康记录中缺失实验室数据的系统方法:非小细胞肺癌和多发性骨髓瘤的案例研究。
CPT Pharmacometrics Syst Pharmacol. 2023 Sep;12(9):1201-1212. doi: 10.1002/psp4.12998. Epub 2023 Jun 15.
6
[How to deal with missing data? Multiple imputation by chained equations: recommendations and explanations for clinical practice].[如何处理缺失数据?链式方程多重填补:临床实践的建议与解释]
Nephrol Ther. 2023 Jun 19;19(3):171-179. doi: 10.1684/ndt.2023.24.
7
Prediction of the sarcopenia in peritoneal dialysis using simple clinical information: A machine learning-based model.利用简单临床信息预测腹膜透析患者的肌肉减少症:基于机器学习的模型。
Semin Dial. 2023 Sep-Oct;36(5):390-398. doi: 10.1111/sdi.13131. Epub 2023 Mar 8.
8
Assumptions and analysis planning in studies with missing data in multiple variables: moving beyond the MCAR/MAR/MNAR classification.多变量缺失数据研究中的假设和分析计划:超越 MCAR/MAR/MNAR 分类。
Int J Epidemiol. 2023 Aug 2;52(4):1268-1275. doi: 10.1093/ije/dyad008.
9
Machine Learning Model in Predicting Sarcopenia in Crohn's Disease Based on Simple Clinical and Anthropometric Measures.基于简单临床和人体测量学指标的机器学习模型预测克罗恩病肌少症。
Int J Environ Res Public Health. 2022 Dec 30;20(1):656. doi: 10.3390/ijerph20010656.
10
Development and Verification of a Combined Diagnostic Model for Sarcopenia with Random Forest and Artificial Neural Network.基于随机森林和人工神经网络的肌少症联合诊断模型的建立与验证。
Comput Math Methods Med. 2022 Aug 23;2022:2957731. doi: 10.1155/2022/2957731. eCollection 2022.