• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于机器学习的预测模型中类别不平衡校正的危害:一项模拟研究

The Harms of Class Imbalance Corrections for Machine Learning Based Prediction Models: A Simulation Study.

作者信息

Carriero Alex, Luijken Kim, de Hond Anne, Moons Karel G M, van Calster Ben, van Smeden Maarten

机构信息

Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht, The Netherlands.

KU Leuven, Leuven, Belgium.

出版信息

Stat Med. 2025 Feb 10;44(3-4):e10320. doi: 10.1002/sim.10320.

DOI:10.1002/sim.10320
PMID:39865585
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11771573/
Abstract

INTRODUCTION

Risk prediction models are increasingly used in healthcare to aid in clinical decision-making. In most clinical contexts, model calibration (i.e., assessing the reliability of risk estimates) is critical. Data available for model development are often not perfectly balanced with the modeled outcome (i.e., individuals with vs. without the event of interest are not equally prevalent in the data). It is common for researchers to correct for class imbalance, yet, the effect of such imbalance corrections on the calibration of machine learning models is largely unknown.

METHODS

We studied the effect of imbalance corrections on model calibration for a variety of machine learning algorithms. Using extensive Monte Carlo simulations we compared the out-of-sample predictive performance of models developed with an imbalance correction to those developed without a correction for class imbalance across different data-generating scenarios (varying sample size, the number of predictors, and event fraction). Our findings were illustrated in a case study using MIMIC-III data.

RESULTS

In all simulation scenarios, prediction models developed without a correction for class imbalance consistently had equal or better calibration performance than prediction models developed with a correction for class imbalance. The miscalibration introduced by correcting for class imbalance was characterized by an over-estimation of risk and was not always able to be corrected with re-calibration.

CONCLUSION

Correcting for class imbalance is not always necessary and may even be harmful to clinical prediction models which aim to produce reliable risk estimates on an individual basis.

摘要

引言

风险预测模型在医疗保健领域越来越多地用于辅助临床决策。在大多数临床情况下,模型校准(即评估风险估计的可靠性)至关重要。可用于模型开发的数据通常与建模结果不完全平衡(即,数据中具有与不具有感兴趣事件的个体并不同样普遍)。研究人员校正类别不平衡是很常见的,然而,这种不平衡校正对机器学习模型校准的影响在很大程度上尚不清楚。

方法

我们研究了不平衡校正对各种机器学习算法模型校准的影响。使用广泛的蒙特卡罗模拟,我们比较了在不同数据生成场景(不同样本量、预测变量数量和事件发生率)下,使用不平衡校正开发的模型与未校正类别不平衡开发的模型的样本外预测性能。我们的研究结果在一个使用MIMIC-III数据的案例研究中得到了说明。

结果

在所有模拟场景中,未校正类别不平衡开发的预测模型始终具有与校正类别不平衡开发的预测模型相同或更好的校准性能。通过校正类别不平衡引入的校准错误的特征是风险高估,并且并不总是能够通过重新校准来校正。

结论

校正类别不平衡并不总是必要的,甚至可能对旨在为个体生成可靠风险估计的临床预测模型有害。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac2a/11771573/923490be3331/SIM-44-0-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac2a/11771573/eaad179de117/SIM-44-0-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac2a/11771573/92949f06c788/SIM-44-0-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac2a/11771573/15da618cccb2/SIM-44-0-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac2a/11771573/775f25cb8c5e/SIM-44-0-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac2a/11771573/5fa5830631c0/SIM-44-0-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac2a/11771573/fe25dac84684/SIM-44-0-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac2a/11771573/12b6695e48b8/SIM-44-0-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac2a/11771573/923490be3331/SIM-44-0-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac2a/11771573/eaad179de117/SIM-44-0-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac2a/11771573/92949f06c788/SIM-44-0-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac2a/11771573/15da618cccb2/SIM-44-0-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac2a/11771573/775f25cb8c5e/SIM-44-0-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac2a/11771573/5fa5830631c0/SIM-44-0-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac2a/11771573/fe25dac84684/SIM-44-0-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac2a/11771573/12b6695e48b8/SIM-44-0-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac2a/11771573/923490be3331/SIM-44-0-g001.jpg

相似文献

1
The Harms of Class Imbalance Corrections for Machine Learning Based Prediction Models: A Simulation Study.基于机器学习的预测模型中类别不平衡校正的危害:一项模拟研究
Stat Med. 2025 Feb 10;44(3-4):e10320. doi: 10.1002/sim.10320.
2
Understanding random resampling techniques for class imbalance correction and their consequences on calibration and discrimination of clinical risk prediction models.理解随机重采样技术在类别不平衡校正中的应用及其对临床风险预测模型校准和区分的影响。
J Biomed Inform. 2024 Jul;155:104666. doi: 10.1016/j.jbi.2024.104666. Epub 2024 Jun 6.
3
The harm of class imbalance corrections for risk prediction models: illustration and simulation using logistic regression.类别不平衡校正对风险预测模型的危害:使用逻辑回归进行说明和模拟。
J Am Med Inform Assoc. 2022 Aug 16;29(9):1525-1534. doi: 10.1093/jamia/ocac093.
4
Comparison of discrimination and calibration performance of ECG-based machine learning models for prediction of new-onset atrial fibrillation.基于心电图的机器学习模型预测新发心房颤动的鉴别和校准性能比较。
BMC Med Res Methodol. 2023 Jul 22;23(1):169. doi: 10.1186/s12874-023-01989-3.
5
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
6
The performance of prognostic models depended on the choice of missing value imputation algorithm: a simulation study.预后模型的性能取决于缺失值插补算法的选择:一项模拟研究。
J Clin Epidemiol. 2024 Dec;176:111539. doi: 10.1016/j.jclinepi.2024.111539. Epub 2024 Sep 24.
7
Stability of clinical prediction models developed using statistical or machine learning methods.基于统计或机器学习方法开发的临床预测模型的稳定性。
Biom J. 2023 Dec;65(8):e2200302. doi: 10.1002/bimj.202200302. Epub 2023 Jul 19.
8
Assessing the effects of data drift on the performance of machine learning models used in clinical sepsis prediction.评估数据漂移对临床脓毒症预测中使用的机器学习模型性能的影响。
Int J Med Inform. 2023 May;173:104930. doi: 10.1016/j.ijmedinf.2022.104930. Epub 2022 Nov 19.
9
Handling Class Imbalance in Machine Learning-based Prediction Models: A Case Study in Asthma Management.基于机器学习的预测模型中的类别不平衡处理:哮喘管理案例研究。
Annu Int Conf IEEE Eng Med Biol Soc. 2023 Jul;2023:1-5. doi: 10.1109/EMBC40787.2023.10340751.
10
Machine Learning Did Not Outperform Conventional Competing Risk Modeling to Predict Revision Arthroplasty.在预测翻修关节成形术方面,机器学习的表现并未优于传统的竞争风险模型。
Clin Orthop Relat Res. 2024 Aug 1;482(8):1472-1482. doi: 10.1097/CORR.0000000000003018. Epub 2024 Mar 12.

引用本文的文献

1
A data-driven analysis of lumbar steroid injection satisfaction in patients with chronic low back pain.慢性下腰痛患者腰椎类固醇注射满意度的数据驱动分析
Sci Rep. 2025 Jul 29;15(1):27734. doi: 10.1038/s41598-025-10907-0.
2
Electrocardiogram-based diagnosis of liver diseases: an externally validated and explainable machine learning approach.基于心电图的肝脏疾病诊断:一种经过外部验证且可解释的机器学习方法。
EClinicalMedicine. 2025 May 20;84:103242. doi: 10.1016/j.eclinm.2025.103242. eCollection 2025 Jun.
3
A descriptor-free machine learning framework to improve antigen discovery for bacterial pathogens.

本文引用的文献

1
Understanding random resampling techniques for class imbalance correction and their consequences on calibration and discrimination of clinical risk prediction models.理解随机重采样技术在类别不平衡校正中的应用及其对临床风险预测模型校准和区分的影响。
J Biomed Inform. 2024 Jul;155:104666. doi: 10.1016/j.jbi.2024.104666. Epub 2024 Jun 6.
2
On the 12th Day of Christmas, a Statistician Sent to Me . .在圣诞节第十二天,一个统计学家寄给我..
BMJ. 2022 Dec 20;379:e072883. doi: 10.1136/bmj-2022-072883.
3
Systematic review identifies the design and methodological conduct of studies on machine learning-based prediction models.
一种无描述符的机器学习框架,用于改进细菌病原体的抗原发现。
PLoS One. 2025 Jun 5;20(6):e0323895. doi: 10.1371/journal.pone.0323895. eCollection 2025.
4
Primary care prediction of hip and knee replacement 1-5 years in advance using Temporal Graph-based Convolutional Neural Networks (TG-CNNs).使用基于时间图的卷积神经网络(TG-CNNs)提前1至5年对髋关节和膝关节置换进行初级保健预测。
Rheumatology (Oxford). 2025 Aug 1;64(8):4589-4598. doi: 10.1093/rheumatology/keaf185.
系统评价确定了基于机器学习的预测模型研究的设计和方法实施情况。
J Clin Epidemiol. 2023 Feb;154:8-22. doi: 10.1016/j.jclinepi.2022.11.015. Epub 2022 Nov 25.
4
The harm of class imbalance corrections for risk prediction models: illustration and simulation using logistic regression.类别不平衡校正对风险预测模型的危害:使用逻辑回归进行说明和模拟。
J Am Med Inform Assoc. 2022 Aug 16;29(9):1525-1534. doi: 10.1093/jamia/ocac093.
5
The class imbalance problem.类别不平衡问题。
Nat Methods. 2021 Nov;18(11):1270-1272. doi: 10.1038/s41592-021-01302-4.
6
Clinical prediction models: diagnosis versus prognosis.临床预测模型:诊断与预后。
J Clin Epidemiol. 2021 Apr;132:142-145. doi: 10.1016/j.jclinepi.2021.01.009.
7
Overview of clinical prediction models.临床预测模型概述。
Ann Transl Med. 2020 Feb;8(4):71. doi: 10.21037/atm.2019.11.121.
8
Calibration: the Achilles heel of predictive analytics.校准:预测分析的阿喀琉斯之踵。
BMC Med. 2019 Dec 16;17(1):230. doi: 10.1186/s12916-019-1466-7.
9
Using simulation studies to evaluate statistical methods.运用模拟研究评估统计方法。
Stat Med. 2019 May 20;38(11):2074-2102. doi: 10.1002/sim.8086. Epub 2019 Jan 16.
10
GLUMIP 2.0: Software for Planning Internal Pilots.GLUMIP 2.0:内部试点规划软件。
J Stat Softw. 2008 Nov;28(7). doi: 10.18637/jss.v028.i07. Epub 2008 Nov 13.