• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
A Resampling Method to Improve the Prognostic Model of End-Stage Kidney Disease: A Better Strategy for Imbalanced Data.一种改进终末期肾病预后模型的重采样方法:针对不平衡数据的更好策略。
Front Med (Lausanne). 2022 Mar 7;9:730748. doi: 10.3389/fmed.2022.730748. eCollection 2022.
2
A hybrid resampling algorithms SMOTE and ENN based deep learning models for identification of Marburg virus inhibitors.一种基于混合重采样算法SMOTE和ENN的深度学习模型用于鉴定马尔堡病毒抑制剂。
Future Med Chem. 2022 May;14(10):701-715. doi: 10.4155/fmc-2021-0290. Epub 2022 Apr 8.
3
Structure-activity relationship-based chemical classification of highly imbalanced Tox21 datasets.基于结构-活性关系的高度不平衡Tox21数据集的化学分类
J Cheminform. 2020 Oct 27;12(1):66. doi: 10.1186/s13321-020-00468-x.
4
RSMOTE: improving classification performance over imbalanced medical datasets.RSMOTE:提升不平衡医学数据集的分类性能
Health Inf Sci Syst. 2020 Jun 12;8(1):22. doi: 10.1007/s13755-020-00112-w. eCollection 2020 Dec.
5
A hybrid sampling algorithm combining M-SMOTE and ENN based on Random forest for medical imbalanced data.基于随机森林的 M-SMOTE 与ENN 混合采样算法在医学不平衡数据中的应用
J Biomed Inform. 2020 Jul;107:103465. doi: 10.1016/j.jbi.2020.103465. Epub 2020 Jun 5.
6
Combining Resampling Strategies and Ensemble Machine Learning Methods to Enhance Prediction of Neonates with a Low Apgar Score After Induction of Labor in Northern Tanzania.结合重采样策略和集成机器学习方法以增强对坦桑尼亚北部引产术后低阿氏评分新生儿的预测
Risk Manag Healthc Policy. 2021 Sep 7;14:3711-3720. doi: 10.2147/RMHP.S331077. eCollection 2021.
7
A self-inspected adaptive SMOTE algorithm (SASMOTE) for highly imbalanced data classification in healthcare.一种用于医疗保健中高度不平衡数据分类的自检测自适应合成少数过采样技术算法(SASMOTE)。
BioData Min. 2023 Apr 25;16(1):15. doi: 10.1186/s13040-023-00330-4.
8
Stroke Prediction with Machine Learning Methods among Older Chinese.基于机器学习方法对中国老年人进行中风预测。
Int J Environ Res Public Health. 2020 Mar 12;17(6):1828. doi: 10.3390/ijerph17061828.
9
[Comparison of machine learning method and logistic regression model in prediction of acute kidney injury in severely burned patients].[机器学习方法与逻辑回归模型在预测重度烧伤患者急性肾损伤中的比较]
Zhonghua Shao Shang Za Zhi. 2018 Jun 20;34(6):343-348. doi: 10.3760/cma.j.issn.1009-2587.2018.06.006.
10
Comparison of Resampling Techniques for Imbalanced Datasets in Machine Learning: Application to Epileptogenic Zone Localization From Interictal Intracranial EEG Recordings in Patients With Focal Epilepsy.机器学习中不平衡数据集的重采样技术比较:在局灶性癫痫患者发作间期颅内脑电图记录的致痫区定位中的应用
Front Neuroinform. 2021 Nov 19;15:715421. doi: 10.3389/fninf.2021.715421. eCollection 2021.

引用本文的文献

1
Deep learning analysis of long COVID and vaccine impact in low- and middle-income countries (LMICs): development of a risk calculator in a multicentric study.低收入和中等收入国家(LMICs)中长新冠及疫苗影响的深度学习分析:一项多中心研究中风险计算器的开发
Front Public Health. 2025 Jun 26;13:1416273. doi: 10.3389/fpubh.2025.1416273. eCollection 2025.
2
Assessment of Soybean Lodging Using UAV Imagery and Machine Learning.利用无人机影像和机器学习评估大豆倒伏情况
Plants (Basel). 2023 Aug 8;12(16):2893. doi: 10.3390/plants12162893.
3
Multilayer perceptron-based self-care early prediction of children with disabilities.基于多层感知器的残疾儿童自我护理早期预测
Digit Health. 2023 Jun 29;9:20552076231184054. doi: 10.1177/20552076231184054. eCollection 2023 Jan-Dec.
4
Explainable ensemble machine learning model for prediction of 28-day mortality risk in patients with sepsis-associated acute kidney injury.用于预测脓毒症相关性急性肾损伤患者28天死亡风险的可解释集成机器学习模型。
Front Med (Lausanne). 2023 May 18;10:1165129. doi: 10.3389/fmed.2023.1165129. eCollection 2023.
5
Construct and Validate a Predictive Model for Surgical Site Infection after Posterior Lumbar Interbody Fusion Based on Machine Learning Algorithm.基于机器学习算法构建和验证后路腰椎椎间融合术后手术部位感染的预测模型。
Comput Math Methods Med. 2022 Aug 23;2022:2697841. doi: 10.1155/2022/2697841. eCollection 2022.

本文引用的文献

1
Prognostic models for chronic kidney disease: a systematic review and external validation.慢性肾脏病的预后模型:系统评价与外部验证
Nephrol Dial Transplant. 2021 Sep 27;36(10):1837-1850. doi: 10.1093/ndt/gfaa155.
2
Long-Term Effects of Intensive Low-Salt Diet Education on Deterioration of Glomerular Filtration Rate among Non-Diabetic Hypertensive Patients with Chronic Kidney Disease.非糖尿病肾病高血压患者强化低盐饮食教育对肾小球滤过率恶化的长期影响。
Kidney Blood Press Res. 2019;44(5):1101-1114. doi: 10.1159/000502354. Epub 2019 Sep 18.
3
Healthcare costs of patients on different renal replacement modalities - Analysis of Dutch health insurance claims data.不同肾脏替代治疗方式患者的医疗费用 - 荷兰健康保险索赔数据分析。
PLoS One. 2019 Aug 15;14(8):e0220800. doi: 10.1371/journal.pone.0220800. eCollection 2019.
4
Towards the best kidney failure prediction tool: a systematic review and selection aid.迈向最佳肾衰竭预测工具:系统评价与选择辅助。
Nephrol Dial Transplant. 2020 Sep 1;35(9):1527-1538. doi: 10.1093/ndt/gfz018.
5
Antihypertensive Medications and Change in Stages of Chronic Kidney Disease.抗高血压药物与慢性肾脏病分期的变化
Int J Chronic Dis. 2018 Feb 25;2018:1382705. doi: 10.1155/2018/1382705. eCollection 2018.
6
Effects of Intensive BP Control in CKD.慢性肾脏病中强化血压控制的效果
J Am Soc Nephrol. 2017 Sep;28(9):2812-2823. doi: 10.1681/ASN.2017020148. Epub 2017 Jun 22.
7
International differences in chronic kidney disease prevalence: a key public health and epidemiologic research issue.慢性肾脏病患病率的国际差异:一个关键的公共卫生与流行病学研究问题。
Nephrol Dial Transplant. 2017 Apr 1;32(suppl_2):ii129-ii135. doi: 10.1093/ndt/gfw420.
8
Anthropometric and Metabolic Risk Factors for ESRD Are Disease-Specific: Results from a Large Population-Based Cohort Study in Austria.终末期肾病的人体测量和代谢风险因素具有疾病特异性:奥地利一项基于大规模人群队列研究的结果
PLoS One. 2016 Aug 18;11(8):e0161376. doi: 10.1371/journal.pone.0161376. eCollection 2016.
9
Long-term Benefits of Intensive Glucose Control for Preventing End-Stage Kidney Disease: ADVANCE-ON.强化血糖控制对预防终末期肾病的长期获益: ADVANCE-ON。
Diabetes Care. 2016 May;39(5):694-700. doi: 10.2337/dc15-2322. Epub 2016 Mar 22.
10
The correlation between blood pressure and kidney function decline in older people: a registry-based cohort study.老年人血压与肾功能衰退之间的相关性:一项基于登记处的队列研究。
BMJ Open. 2015 Jun 30;5(6):e007571. doi: 10.1136/bmjopen-2015-007571.

一种改进终末期肾病预后模型的重采样方法:针对不平衡数据的更好策略。

A Resampling Method to Improve the Prognostic Model of End-Stage Kidney Disease: A Better Strategy for Imbalanced Data.

作者信息

Shi Xi, Qu Tingyu, Van Pottelbergh Gijs, van den Akker Marjan, De Moor Bart

机构信息

Department of Electrical Engineering (ESAT), Stadius Center for Dynamical Systems, Signal Processing and Data Analytics, KU Leuven, Leuven, Belgium.

Vlerick Business School, Leuven, Belgium.

出版信息

Front Med (Lausanne). 2022 Mar 7;9:730748. doi: 10.3389/fmed.2022.730748. eCollection 2022.

DOI:10.3389/fmed.2022.730748
PMID:35321465
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8935060/
Abstract

BACKGROUND

Prognostic models can help to identify patients at risk for end-stage kidney disease (ESKD) at an earlier stage to provide preventive medical interventions. Previous studies mostly applied the Cox proportional hazards model. The aim of this study is to present a resampling method, which can deal with imbalanced data structure for the prognostic model and help to improve predictive performance.

METHODS

The electronic health records of patients with chronic kidney disease (CKD) older than 50 years during 2005-2015 collected from primary care in Belgium were used ( = 11,645). Both the Cox proportional hazards model and the logistic regression analysis were applied as reference model. Then, the resampling method, the Synthetic Minority Over-Sampling Technique-Edited Nearest Neighbor (SMOTE-ENN), was applied as a preprocessing procedure followed by the logistic regression analysis. The performance was evaluated by accuracy, the area under the curve (AUC), confusion matrix, and score.

RESULTS

The C statistics for the Cox proportional hazards model was 0.807, while the AUC for the logistic regression analysis was 0.700, both on a comparable level to previous studies. With the model trained on the resampled set, 86.3% of patients with ESKD were correctly identified, although it was at the cost of the high misclassification rate of negative cases. The score was 0.245, much higher than 0.043 for the logistic regression analysis and 0.022 for the Cox proportional hazards model.

CONCLUSION

This study pointed out the imbalanced data structure and its effects on prediction accuracy, which were not thoroughly discussed in previous studies. We were able to identify patients with high risk for ESKD better from a clinical perspective by using the resampling method. But, it has the limitation of the high misclassification of negative cases. The technique can be widely used in other clinical topics when imbalanced data structure should be considered.

摘要

背景

预后模型有助于在更早阶段识别终末期肾病(ESKD)风险患者,以便提供预防性医疗干预措施。以往研究大多应用Cox比例风险模型。本研究旨在提出一种重采样方法,该方法可处理预后模型的不平衡数据结构并有助于提高预测性能。

方法

使用从比利时初级保健机构收集的2005年至2015年期间年龄大于50岁的慢性肾脏病(CKD)患者的电子健康记录(n = 11,645)。Cox比例风险模型和逻辑回归分析均作为参考模型应用。然后,应用重采样方法,即合成少数过采样技术编辑最近邻法(SMOTE - ENN)作为预处理程序,随后进行逻辑回归分析。通过准确性、曲线下面积(AUC)、混淆矩阵和F1分数评估性能。

结果

Cox比例风险模型的C统计量为0.807,而逻辑回归分析的AUC为0.700,两者均与以往研究处于可比水平。使用在重采样集上训练的模型,86.3%的ESKD患者被正确识别,尽管这是以阴性病例的高误分类率为代价的。F1分数为0.245,远高于逻辑回归分析的0.043和Cox比例风险模型的0.022。

结论

本研究指出了不平衡数据结构及其对预测准确性的影响,而以往研究并未对此进行充分讨论。通过使用重采样方法,我们能够从临床角度更好地识别ESKD高风险患者。但是,它存在阴性病例误分类率高的局限性。当应考虑不平衡数据结构时,该技术可广泛应用于其他临床主题。