• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用WGAN-GP数据增强和XGBoost算法提高体脂预测能力。

Enhancing body fat prediction with WGAN-GP data augmentation and XGBoost algorithm.

作者信息

Wang Xiangyu, Chang Shuai

机构信息

Department of Physical Education, Capital Normal University, Beijing, China.

出版信息

Sci Prog. 2025 Jul-Sep;108(3):368504251366850. doi: 10.1177/00368504251366850. Epub 2025 Aug 6.

DOI:10.1177/00368504251366850
PMID:40770941
Abstract

Background and ObjectiveMachine learning models offer a practical approach for estimating body fat percentage from simple anthropometric data. However, the scarcity of biomedical data frequently leads to model overfitting, compromising predictive accuracy. Generative data augmentation presents a promising strategy to address this limitation. This study develops and evaluates a generative data augmentation framework to enhance body fat prediction from limited anthropometric data.MethodsA public dataset comprising 249 male subjects was partitioned into development (80%) and test (20%) sets. The fidelity of Wasserstein Generative Adversarial Network with Gradient Penalty (WGAN-GP), random noise injection, and mixup was compared to select the optimal method. Subsequently, XGBoost, Support Vector Regression, and Multi-layer Perceptron models were trained and validated, comparing performance with and without the selected augmentation. Final model generalization was assessed on the independent test set using the coefficient of determination (R²), Mean Absolute Error, and Root Mean Squared Error.ResultsAmong the evaluated augmentation techniques, the WGAN-GP generated synthetic data with the highest fidelity. On the original data, the baseline XGBoost model achieved a R² of 0.67; this performance increased to 0.77 on the test set when using WGAN-GP augmentation. Feature importance analysis of the final model identified abdominal circumference as the most significant predictor of body fat percentage.ConclusionThe WGAN-GP is a highly effective method for generating realistic synthetic anthropometric data. Integrating these synthetic samples into the training pipeline substantially improves the generalization and predictive accuracy of machine learning models. This methodology offers a robust solution for developing more accurate and accessible predictive health models in data-scarce environments.

摘要

背景与目的

机器学习模型为从简单人体测量数据估计体脂百分比提供了一种实用方法。然而,生物医学数据的稀缺常常导致模型过度拟合,从而影响预测准确性。生成式数据增强是解决这一局限性的一种有前景的策略。本研究开发并评估了一种生成式数据增强框架,以提高基于有限人体测量数据的体脂预测能力。

方法

一个包含249名男性受试者的公共数据集被划分为开发集(80%)和测试集(20%)。比较了带梯度惩罚的瓦瑟斯坦生成对抗网络(WGAN-GP)、随机噪声注入和混合方法的保真度,以选择最优方法。随后,训练并验证了XGBoost、支持向量回归和多层感知器模型,并比较了有无所选增强方法时的性能。使用决定系数(R²)、平均绝对误差和均方根误差在独立测试集上评估最终模型的泛化能力。

结果

在评估的增强技术中,WGAN-GP生成的合成数据保真度最高。在原始数据上,基线XGBoost模型的R²为0.67;使用WGAN-GP增强时,该性能在测试集上提高到了0.77。最终模型的特征重要性分析确定腹围是体脂百分比最重要的预测因子。

结论

WGAN-GP是生成逼真的合成人体测量数据的高效方法。将这些合成样本整合到训练流程中可显著提高机器学习模型的泛化能力和预测准确性。该方法为在数据稀缺环境中开发更准确、更易获取的预测健康模型提供了一种可靠的解决方案。

相似文献

1
Enhancing body fat prediction with WGAN-GP data augmentation and XGBoost algorithm.利用WGAN-GP数据增强和XGBoost算法提高体脂预测能力。
Sci Prog. 2025 Jul-Sep;108(3):368504251366850. doi: 10.1177/00368504251366850. Epub 2025 Aug 6.
2
A holistic framework for intradialytic hypotension prediction using generative adversarial networks-based data balancing.一种基于生成对抗网络的数据平衡用于透析中低血压预测的整体框架。
BMC Med Inform Decis Mak. 2025 Jul 10;25(1):257. doi: 10.1186/s12911-025-03094-5.
3
A novel ensemble Wasserstein GAN framework for effective anomaly detection in industrial internet of things environments.一种用于工业物联网环境中有效异常检测的新型集成瓦瑟斯坦生成对抗网络框架。
Sci Rep. 2025 Jul 23;15(1):26786. doi: 10.1038/s41598-025-07533-1.
4
Enhancing buckwheat maturity classification with generative adversarial networks for spectroscopy data augmentation.利用生成对抗网络增强荞麦成熟度分类以进行光谱数据增强。
Front Plant Sci. 2025 Jul 8;16:1604088. doi: 10.3389/fpls.2025.1604088. eCollection 2025.
5
Stabilizing machine learning for reproducible and explainable results: A novel validation approach to subject-specific insights.稳定机器学习以获得可重复和可解释的结果:一种针对特定个体见解的新型验证方法。
Comput Methods Programs Biomed. 2025 Jun 21;269:108899. doi: 10.1016/j.cmpb.2025.108899.
6
Supervised Machine Learning Models for Predicting Sepsis-Associated Liver Injury in Patients With Sepsis: Development and Validation Study Based on a Multicenter Cohort Study.用于预测脓毒症患者脓毒症相关肝损伤的监督式机器学习模型:基于多中心队列研究的开发与验证研究
J Med Internet Res. 2025 May 26;27:e66733. doi: 10.2196/66733.
7
Are Current Survival Prediction Tools Useful When Treating Subsequent Skeletal-related Events From Bone Metastases?当前的生存预测工具在治疗骨转移后的骨骼相关事件时有用吗?
Clin Orthop Relat Res. 2024 Sep 1;482(9):1710-1721. doi: 10.1097/CORR.0000000000003030. Epub 2024 Mar 22.
8
Proposal for Using AI to Assess Clinical Data Integrity and Generate Metadata: Algorithm Development and Validation.关于使用人工智能评估临床数据完整性并生成元数据的提案:算法开发与验证
JMIR Med Inform. 2025 Jun 30;13:e60204. doi: 10.2196/60204.
9
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
10
iEnhancer-GDM: A Deep Learning Framework Based on Generative Adversarial Network and Multi-head Attention Mechanism to Identify Enhancers and Their Strength.iEnhancer-GDM:一种基于生成对抗网络和多头注意力机制的深度学习框架,用于识别增强子及其强度。
Interdiscip Sci. 2025 May 7. doi: 10.1007/s12539-025-00703-9.

本文引用的文献

1
Prediction of fat-free mass and fat mass from bioimpedance spectroscopy and anthropometry: a validation study in 7- to 9-year-old Kuwaiti children.利用生物电阻抗光谱法和人体测量学预测无脂肪体重和脂肪量:一项针对7至9岁科威特儿童的验证研究。
Public Health Nutr. 2025 Apr 21;28(1):e95. doi: 10.1017/S1368980025000503.
2
Height estimation in children and adolescents using body composition big data: Machine-learning and explainable artificial intelligence approach.利用身体成分大数据进行儿童和青少年身高估计:机器学习与可解释人工智能方法
Digit Health. 2025 Mar 28;11:20552076251331879. doi: 10.1177/20552076251331879. eCollection 2025 Jan-Dec.
3
Tabular transformer generative adversarial network for heterogeneous distribution in healthcare.
用于医疗保健中异构分布的表格变压器生成对抗网络。
Sci Rep. 2025 Mar 25;15(1):10254. doi: 10.1038/s41598-025-93077-3.
4
Optimized Drug-Drug Interaction Extraction With BioGPT and Focal Loss-Based Attention.基于BioGPT和基于焦点损失的注意力机制的优化药物-药物相互作用提取
IEEE J Biomed Health Inform. 2025 Jun;29(6):4560-4570. doi: 10.1109/JBHI.2025.3540861.
5
Enhanced Conditional GAN for High-Quality Synthetic Tabular Data Generation in Mobile-Based Cardiovascular Healthcare.用于基于移动设备的心血管医疗保健中高质量合成表格数据生成的增强条件生成对抗网络
Sensors (Basel). 2024 Nov 30;24(23):7673. doi: 10.3390/s24237673.
6
A Wasserstein perspective of Vanilla GANs.香草生成对抗网络的瓦瑟斯坦视角。
Neural Netw. 2025 Jan;181:106770. doi: 10.1016/j.neunet.2024.106770. Epub 2024 Oct 6.
7
An Interpretable Adaptive Multiscale Attention Deep Neural Network for Tabular Data.一种用于表格数据的可解释自适应多尺度注意力深度神经网络。
IEEE Trans Neural Netw Learn Syst. 2025 Apr;36(4):6995-7009. doi: 10.1109/TNNLS.2024.3392355. Epub 2025 Apr 4.
8
Accuracy of automated segmentation and volumetry of acute intracerebral hemorrhage following minimally invasive surgery using a patch-based convolutional neural network in a small dataset.基于小块卷积神经网络的微创手术后急性脑出血自动分割与体积测量在小数据集中的准确性
Neuroradiology. 2024 Apr;66(4):601-608. doi: 10.1007/s00234-024-03311-4. Epub 2024 Feb 17.
9
Biomedical Big Data Technologies, Applications, and Challenges for Precision Medicine: A Review.生物医学大数据技术、精准医学中的应用及挑战:综述
Glob Chall. 2023 Nov 20;8(1):2300163. doi: 10.1002/gch2.202300163. eCollection 2024 Jan.
10
MedGAN: optimized generative adversarial network with graph convolutional networks for novel molecule design.MedGAN:基于图卷积网络的优化生成对抗网络用于新型分子设计。
Sci Rep. 2024 Jan 12;14(1):1212. doi: 10.1038/s41598-023-50834-6.