• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种基于生成对抗网络的数据平衡用于透析中低血压预测的整体框架。

A holistic framework for intradialytic hypotension prediction using generative adversarial networks-based data balancing.

作者信息

Lin Hsuan-Ming, Lyu JrJung

机构信息

Institute of Information Management, National Cheng Kung University, Tainan, Taiwan.

Internal Medicine, Nephrology Division, An Nan Hospital, China Medical University, Tainan, Taiwan.

出版信息

BMC Med Inform Decis Mak. 2025 Jul 10;25(1):257. doi: 10.1186/s12911-025-03094-5.

DOI:10.1186/s12911-025-03094-5
PMID:40635002
Abstract

BACKGROUND

Intradialytic Hypotension (IDH) is a frequent complication in hemodialysis, yet predictive modeling is challenged by class imbalance. Traditional oversampling methods often struggle with complex clinical data. This study evaluates an enhanced conditional Wasserstein Generative Adversarial Network with Gradient Penalty (CWGAN-GP) framework to improve IDH prediction by generating high-utility synthetic data for balancing.

METHODS

A CWGAN-GP was developed using multi-level hemodialysis data. Following rigorous preprocessing, including a strict temporal train-test split, the CWGAN-GP generated minority class samples exclusively on the training data. eXtreme Gradient Boosting (XGBoost) models were trained on the original imbalanced data and datasets balanced using the proposed CWGAN-GP method, benchmarked against traditional Synthetic Minority Over-sampling Technique(SMOTE) and Adaptive Synthetic Sampling Approach(ADASYN) balancing. Performance was evaluated using metrics sensitive to imbalance (e.g., Precision-Recall Area Under the Curve) and statistical comparisons, with SHapley Additive exPlanations (SHAP) analysis for interpretability.

RESULTS

The study population consisted of 40 chronic hemodialysis patients (45% male, mean age 66.30[Formula: see text] 10.68 years). An initial dataset, where intradialytic hypotension (IDH) events occurred in 14.85% of records (19,124 instances overall), was temporally split (75:25 ratio). This yielded an Original Training dataset of 95,856 samples (14.73% IDH rate) and a test set (15.21% IDH rate). From this Original Training dataset, a Generative Adversarial Network (GAN) was employed to construct a balanced dataset comprising 163,470 samples. The GAN Balanced dataset yielded the highest predictive performance, demonstrating statistically significant improvements over the Original Training dataset across metrics, including Precision-Recall Area Under the Curve (PR-AUC) (mean 0.735 vs 0.724) and Accuracy (mean 0.900 vs 0.892). In contrast, the GAN Augmented dataset (191,712 samples) showed mixed results (improved Accuracy/F1, decreased Receiver Operating Characteristic Curve Area Under Curve (ROC-AUC)/PR-AUC). In comparison, ADASYN (163,326 samples) and SMOTE (163,470 samples) balanced datasets significantly underperformed on PR-AUC. SHAP analysis identified Dialysis Date (as a proxy for temporal patterns like day-of-week) and hemodynamic indicators (e.g., Systolic Diastolic Difference, Previous Systolic Pressure) as key IDH predictors.

CONCLUSION

The proposed CWGAN-GP framework effectively balances complex hemodialysis data, leading to significantly improved and interpretable IDH prediction models compared to standard approaches. This work supports leveraging advanced generative models like GAN to overcome data imbalance in clinical prediction tasks, which is pending further validation.

摘要

背景

透析中低血压(IDH)是血液透析中常见的并发症,但预测模型受到类别不平衡的挑战。传统的过采样方法在处理复杂的临床数据时往往存在困难。本研究评估了一种带有梯度惩罚的增强条件瓦瑟斯坦生成对抗网络(CWGAN-GP)框架,通过生成高实用性的合成数据来平衡数据,以改善IDH预测。

方法

使用多级血液透析数据开发了CWGAN-GP。经过严格的预处理,包括严格的时间序列训练-测试分割,CWGAN-GP仅在训练数据上生成少数类样本。使用极端梯度提升(XGBoost)模型在原始不平衡数据和使用所提出的CWGAN-GP方法平衡后的数据集上进行训练,并与传统的合成少数过采样技术(SMOTE)和自适应合成采样方法(ADASYN)平衡方法进行基准比较。使用对不平衡敏感的指标(如精确召回率曲线下面积)和统计比较来评估性能,并使用SHapley值相加解释(SHAP)分析来进行可解释性分析。

结果

研究人群包括40例慢性血液透析患者(45%为男性,平均年龄66.30[公式:见原文]10.68岁)。初始数据集记录中14.85%发生透析中低血压(IDH)事件(共19124例),按时间序列进行分割(75:25比例)。这产生了一个包含95856个样本的原始训练数据集(IDH发生率为14.73%)和一个测试集(IDH发生率为15.21%)。从这个原始训练数据集中,使用生成对抗网络(GAN)构建了一个包含163470个样本的平衡数据集。GAN平衡数据集产生了最高的预测性能,在包括精确召回率曲线下面积(PR-AUC)(均值0.735对0.724)和准确率(均值0.900对0.892)等指标上,与原始训练数据集相比有统计学意义的显著改善。相比之下,GAN增强数据集(191712个样本)结果不一(准确率/F1提高,受试者操作特征曲线下面积(ROC-AUC)/PR-AUC降低)。相比之下,ADASYN(163326个样本)和SMOTE(163470个样本)平衡数据集在PR-AUC上显著表现不佳。SHAP分析确定透析日期(作为一周中某天等时间模式的代理)和血流动力学指标(如收缩压与舒张压差值、先前收缩压)为关键的IDH预测因素。

结论

所提出的CWGAN-GP框架有效地平衡了复杂的血液透析数据,与标准方法相比,显著改进了IDH预测模型并使其具有可解释性。这项工作支持利用像GAN这样的先进生成模型来克服临床预测任务中的数据不平衡问题,不过这有待进一步验证。

相似文献

1
A holistic framework for intradialytic hypotension prediction using generative adversarial networks-based data balancing.一种基于生成对抗网络的数据平衡用于透析中低血压预测的整体框架。
BMC Med Inform Decis Mak. 2025 Jul 10;25(1):257. doi: 10.1186/s12911-025-03094-5.
2
Supervised Machine Learning Models for Predicting Sepsis-Associated Liver Injury in Patients With Sepsis: Development and Validation Study Based on a Multicenter Cohort Study.用于预测脓毒症患者脓毒症相关肝损伤的监督式机器学习模型:基于多中心队列研究的开发与验证研究
J Med Internet Res. 2025 May 26;27:e66733. doi: 10.2196/66733.
3
Synthetic neurosurgical data generation with generative adversarial networks and large language models:an investigation on fidelity, utility, and privacy.使用生成对抗网络和大语言模型生成合成神经外科数据:关于保真度、实用性和隐私性的调查
Neurosurg Focus. 2025 Jul 1;59(1):E17. doi: 10.3171/2025.4.FOCUS25225.
4
Are Current Survival Prediction Tools Useful When Treating Subsequent Skeletal-related Events From Bone Metastases?当前的生存预测工具在治疗骨转移后的骨骼相关事件时有用吗?
Clin Orthop Relat Res. 2024 Sep 1;482(9):1710-1721. doi: 10.1097/CORR.0000000000003030. Epub 2024 Mar 22.
5
Construction and validation of HBV-ACLF bacterial infection diagnosis model based on machine learning.基于机器学习的HBV-ACLF细菌感染诊断模型的构建与验证
BMC Infect Dis. 2025 Jul 1;25(1):847. doi: 10.1186/s12879-025-11199-5.
6
Design of e-commerce product price prediction model based on generative adversarial network with adaptive weight adjustment.基于具有自适应权重调整的生成对抗网络的电子商务产品价格预测模型设计
Sci Rep. 2025 Jul 11;15(1):25126. doi: 10.1038/s41598-025-10767-8.
7
Development and Validation of a Convolutional Neural Network Model to Predict a Pathologic Fracture in the Proximal Femur Using Abdomen and Pelvis CT Images of Patients With Advanced Cancer.利用晚期癌症患者腹部和骨盆 CT 图像建立卷积神经网络模型预测股骨近端病理性骨折的研究
Clin Orthop Relat Res. 2023 Nov 1;481(11):2247-2256. doi: 10.1097/CORR.0000000000002771. Epub 2023 Aug 23.
8
An explainable machine learning model for early warning of hypertensive and hypotensive anomalies in maintenance hemodialysis patients.一种用于维持性血液透析患者高血压和低血压异常预警的可解释机器学习模型。
BMC Nephrol. 2025 Jul 1;26(1):318. doi: 10.1186/s12882-025-04270-5.
9
Development of an interpretable machine learning model for frailty risk prediction in older adult care institutions: a mixed-methods, cross-sectional study in China.老年护理机构衰弱风险预测的可解释机器学习模型的开发:中国的一项混合方法横断面研究。
BMJ Open. 2025 Jul 5;15(7):e095460. doi: 10.1136/bmjopen-2024-095460.
10
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.

本文引用的文献

1
Multicenter research in dialysis centers in Brazil: recruitment and implementation of the SARC-HD study.巴西透析中心的多中心研究:SARC-HD研究的招募与实施
J Bras Nefrol. 2025 Jan-Mar;47(1):e20240009. doi: 10.1590/2175-8239-JBN-2024-0009en.
2
Acute Kidney Injury Prognosis Prediction Using Machine Learning Methods: A Systematic Review.使用机器学习方法预测急性肾损伤的预后:一项系统评价。
Kidney Med. 2024 Nov 15;7(1):100936. doi: 10.1016/j.xkme.2024.100936. eCollection 2025 Jan.
3
A hybridization of XGBoost machine learning model by Optuna hyperparameter tuning suite for cardiovascular disease classification with significant effect of outliers and heterogeneous training datasets.
一种通过Optuna超参数调整套件对XGBoost机器学习模型进行的杂交,用于心血管疾病分类,对异常值和异构训练数据集有显著影响。
Int J Cardiol. 2025 Feb 1;420:132757. doi: 10.1016/j.ijcard.2024.132757. Epub 2024 Nov 28.
4
Chronic kidney disease and the global public health agenda: an international consensus.慢性肾脏病与全球公共卫生议程:国际共识。
Nat Rev Nephrol. 2024 Jul;20(7):473-485. doi: 10.1038/s41581-024-00820-6. Epub 2024 Apr 3.
5
Optimizing HCV Disease Prediction in Egypt: The hyOPTGB Framework.优化埃及丙型肝炎病毒疾病预测:hyOPTGB框架。
Diagnostics (Basel). 2023 Nov 13;13(22):3439. doi: 10.3390/diagnostics13223439.
6
Application of Machine Learning to Predict COVID-19 Spread via an Optimized BPSO Model.通过优化的粒子群优化算法模型应用机器学习预测新冠病毒传播
Biomimetics (Basel). 2023 Sep 28;8(6):457. doi: 10.3390/biomimetics8060457.
7
Prevention of Intradialytic Hypotension in Hemodialysis Patients: Current Challenges and Future Prospects.血液透析患者透析中低血压的预防:当前挑战与未来前景
Int J Nephrol Renovasc Dis. 2023 Aug 1;16:173-181. doi: 10.2147/IJNRD.S245621. eCollection 2023.
8
Impact of frequent intradialytic hypotension on quality of life in patients undergoing hemodialysis.血液透析患者频繁透析中低血压对生活质量的影响。
BMC Nephrol. 2023 Jul 14;24(1):209. doi: 10.1186/s12882-023-03263-6.
9
Intra-dialytic blood pressure variability is a greater predictor of cardiovascular events in hemodialysis patients.透析中血压变异性是血液透析患者心血管事件的更强预测因子。
BMC Nephrol. 2023 Apr 26;24(1):113. doi: 10.1186/s12882-023-03162-w.
10
Real-time prediction of intradialytic hypotension using machine learning and cloud computing infrastructure.使用机器学习和云计算基础设施实时预测透析中低血压。
Nephrol Dial Transplant. 2023 Jun 30;38(7):1761-1769. doi: 10.1093/ndt/gfad070.