• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

估计误分类误差:深入研究基于交叉验证的方法。

Estimating misclassification error: a closer look at cross-validation based methods.

作者信息

Ounpraseuth Songthip, Lensing Shelly Y, Spencer Horace J, Kodell Ralph L

机构信息

Department of Biostatistics, University of Arkansas for Medical Sciences, 4301 W. Markham St. Slot 781, Little Rock, AR 72205, USA.

出版信息

BMC Res Notes. 2012 Nov 28;5:656. doi: 10.1186/1756-0500-5-656.

DOI:10.1186/1756-0500-5-656
PMID:23190936
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3556102/
Abstract

BACKGROUND

To estimate a classifier's error in predicting future observations, bootstrap methods have been proposed as reduced-variation alternatives to traditional cross-validation (CV) methods based on sampling without replacement. Monte Carlo (MC) simulation studies aimed at estimating the true misclassification error conditional on the training set are commonly used to compare CV methods. We conducted an MC simulation study to compare a new method of bootstrap CV (BCV) to k-fold CV for estimating clasification error.

FINDINGS

For the low-dimensional conditions simulated, the modest positive bias of k-fold CV contrasted sharply with the substantial negative bias of the new BCV method. This behavior was corroborated using a real-world dataset of prognostic gene-expression profiles in breast cancer patients. Our simulation results demonstrate some extreme characteristics of variance and bias that can occur due to a fault in the design of CV exercises aimed at estimating the true conditional error of a classifier, and that appear not to have been fully appreciated in previous studies. Although CV is a sound practice for estimating a classifier's generalization error, using CV to estimate the fixed misclassification error of a trained classifier conditional on the training set is problematic. While MC simulation of this estimation exercise can correctly represent the average bias of a classifier, it will overstate the between-run variance of the bias.

CONCLUSIONS

We recommend k-fold CV over the new BCV method for estimating a classifier's generalization error. The extreme negative bias of BCV is too high a price to pay for its reduced variance.

摘要

背景

为了估计分类器在预测未来观测值时的误差,已提出自助法作为基于不放回抽样的传统交叉验证(CV)方法的低方差替代方法。旨在估计基于训练集的真实误分类误差的蒙特卡罗(MC)模拟研究通常用于比较CV方法。我们进行了一项MC模拟研究,以比较一种新的自助CV(BCV)方法和k折CV在估计分类误差方面的表现。

研究结果

对于模拟的低维条件,k折CV的适度正偏差与新BCV方法的显著负偏差形成鲜明对比。这种现象在乳腺癌患者预后基因表达谱的真实世界数据集中得到了证实。我们的模拟结果表明,由于旨在估计分类器真实条件误差的CV练习设计存在缺陷,可能会出现一些方差和偏差的极端特征,而这些特征在以前的研究中似乎没有得到充分认识。虽然CV是估计分类器泛化误差的一种合理方法,但使用CV来估计基于训练集的训练好的分类器的固定误分类误差是有问题的。虽然对这种估计练习进行MC模拟可以正确地表示分类器的平均偏差,但它会高估偏差的运行间方差。

结论

在估计分类器的泛化误差时,我们推荐使用k折CV而不是新的BCV方法。BCV的极端负偏差对于其降低的方差来说代价太高。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/20f3/3556102/3197bc8e9863/1756-0500-5-656-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/20f3/3556102/5946f0a79b96/1756-0500-5-656-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/20f3/3556102/176768904d72/1756-0500-5-656-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/20f3/3556102/c5ef50140625/1756-0500-5-656-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/20f3/3556102/aa0de615b2ab/1756-0500-5-656-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/20f3/3556102/3197bc8e9863/1756-0500-5-656-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/20f3/3556102/5946f0a79b96/1756-0500-5-656-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/20f3/3556102/176768904d72/1756-0500-5-656-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/20f3/3556102/c5ef50140625/1756-0500-5-656-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/20f3/3556102/aa0de615b2ab/1756-0500-5-656-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/20f3/3556102/3197bc8e9863/1756-0500-5-656-5.jpg

相似文献

1
Estimating misclassification error: a closer look at cross-validation based methods.估计误分类误差:深入研究基于交叉验证的方法。
BMC Res Notes. 2012 Nov 28;5:656. doi: 10.1186/1756-0500-5-656.
2
Bias in error estimation when using cross-validation for model selection.在使用交叉验证进行模型选择时误差估计中的偏差。
BMC Bioinformatics. 2006 Feb 23;7:91. doi: 10.1186/1471-2105-7-91.
3
Estimating misclassification error with small samples via bootstrap cross-validation.通过自助法交叉验证用小样本估计误分类误差。
Bioinformatics. 2005 May 1;21(9):1979-86. doi: 10.1093/bioinformatics/bti294. Epub 2005 Feb 2.
4
Is cross-validation valid for small-sample microarray classification?交叉验证对小样本微阵列分类是否有效?
Bioinformatics. 2004 Feb 12;20(3):374-80. doi: 10.1093/bioinformatics/btg419.
5
Classification based upon gene expression data: bias and precision of error rates.基于基因表达数据的分类:错误率的偏差与精度
Bioinformatics. 2007 Jun 1;23(11):1363-70. doi: 10.1093/bioinformatics/btm117. Epub 2007 Mar 28.
6
A Simulation Study to Assess the Effect of Analytic Error on Neonatal Glucose Measurements Using the Canadian Pediatric Society Position Statement Action Thresholds.一项使用加拿大儿科学会立场声明行动阈值评估分析误差对新生儿血糖测量影响的模拟研究。
J Diabetes Sci Technol. 2020 May;14(3):519-525. doi: 10.1177/1932296819884923. Epub 2019 Nov 6.
7
Prediction error estimation: a comparison of resampling methods.预测误差估计:重采样方法的比较
Bioinformatics. 2005 Aug 1;21(15):3301-7. doi: 10.1093/bioinformatics/bti499. Epub 2005 May 19.
8
Comparison of Artificial Intelligence Techniques to Evaluate Performance of a Classifier for Automatic Grading of Prostate Cancer From Digitized Histopathologic Images.比较人工智能技术评估分类器在从数字化组织病理学图像自动分级前列腺癌方面的性能。
JAMA Netw Open. 2019 Mar 1;2(3):e190442. doi: 10.1001/jamanetworkopen.2019.0442.
9
Parametric and nonparametric population methods: their comparative performance in analysing a clinical dataset and two Monte Carlo simulation studies.参数和非参数总体方法:它们在分析临床数据集和两项蒙特卡罗模拟研究中的比较性能。
Clin Pharmacokinet. 2006;45(4):365-83. doi: 10.2165/00003088-200645040-00003.
10
Is cross-validation better than resubstitution for ranking genes?在对基因进行排名时,交叉验证是否比重替代法更好?
Bioinformatics. 2004 Jan 22;20(2):253-8. doi: 10.1093/bioinformatics/btg399.

引用本文的文献

1
An early sepsis prediction model utilizing machine learning and unbalanced data processing in a clinical context.一种在临床环境中利用机器学习和不平衡数据处理的早期脓毒症预测模型。
Prev Med Rep. 2024 Aug 2;45:102841. doi: 10.1016/j.pmedr.2024.102841. eCollection 2024 Sep.
2
A nutritional biomarker score of the Mediterranean diet and incident type 2 diabetes: Integrated analysis of data from the MedLey randomised controlled trial and the EPIC-InterAct case-cohort study.地中海饮食的营养生物标志物评分与 2 型糖尿病发病风险:来自 MedLey 随机对照试验和 EPIC-InterAct 病例对照研究的综合分析。
PLoS Med. 2023 Apr 27;20(4):e1004221. doi: 10.1371/journal.pmed.1004221. eCollection 2023 Apr.
3

本文引用的文献

1
An evaluation of resampling methods for assessment of survival risk prediction in high-dimensional settings.高维环境下评估生存风险预测的重采样方法评估。
Stat Med. 2011 Mar 15;30(6):642-53. doi: 10.1002/sim.4106. Epub 2010 Dec 1.
2
Ensemble methods for classification of patients for personalized medicine with high-dimensional data.用于基于高维数据的个性化医疗中患者分类的集成方法。
Artif Intell Med. 2007 Nov;41(3):197-207. doi: 10.1016/j.artmed.2007.07.003. Epub 2007 Aug 23.
3
Prediction error estimation: a comparison of resampling methods.
Single classifier vs. ensemble machine learning approaches for mental health prediction.
用于心理健康预测的单分类器与集成机器学习方法
Brain Inform. 2023 Jan 3;10(1):1. doi: 10.1186/s40708-022-00180-6.
4
Automated Prediction of Employee Attrition Using Ensemble Model Based on Machine Learning Algorithms.基于机器学习算法的集成模型对员工离职的自动预测。
Comput Intell Neurosci. 2022 Jun 26;2022:7728668. doi: 10.1155/2022/7728668. eCollection 2022.
5
Predicting hERG channel blockers with directed message passing neural networks.使用定向消息传递神经网络预测人乙醚 - 去极化激活钾离子通道(hERG)阻滞剂
RSC Adv. 2022 Jan 26;12(6):3423-3430. doi: 10.1039/d1ra07956e. eCollection 2022 Jan 24.
6
Prevalence of Non-alcoholic Fatty Liver Disease among Patients with Diabetes Mellitus Attending Primary Health Care Centers in Bahrain.巴林初级卫生保健中心糖尿病患者中非酒精性脂肪性肝病的患病率
Oman Med J. 2022 Mar 22;37(2):e350. doi: 10.5001/omj.2022.53. eCollection 2022 Mar.
7
Predicting Future Geographic Hotspots of Potentially Preventable Hospitalisations Using All Subset Model Selection and Repeated K-Fold Cross-Validation.使用全子集模型选择和重复 K 折交叉验证预测潜在可预防住院的未来地理热点。
Int J Environ Res Public Health. 2021 Sep 29;18(19):10253. doi: 10.3390/ijerph181910253.
8
Prediction of Preeclampsia and Intrauterine Growth Restriction: Development of Machine Learning Models on a Prospective Cohort.子痫前期和胎儿生长受限的预测:基于前瞻性队列的机器学习模型开发
JMIR Med Inform. 2020 May 18;8(5):e15411. doi: 10.2196/15411.
9
Analysis of TaqMan Array Cards Data by an Assumption-Free Improvement of the maxRatio Algorithm Is More Accurate than the Cycle-Threshold Method.通过对maxRatio算法进行无假设改进来分析TaqMan Array Cards数据,比循环阈值法更准确。
PLoS One. 2016 Nov 9;11(11):e0165282. doi: 10.1371/journal.pone.0165282. eCollection 2016.
预测误差估计:重采样方法的比较
Bioinformatics. 2005 Aug 1;21(15):3301-7. doi: 10.1093/bioinformatics/bti499. Epub 2005 May 19.
4
Estimating misclassification error with small samples via bootstrap cross-validation.通过自助法交叉验证用小样本估计误分类误差。
Bioinformatics. 2005 May 1;21(9):1979-86. doi: 10.1093/bioinformatics/bti294. Epub 2005 Feb 2.
5
The utility of structure-activity relationship (SAR) models for prediction and covariate selection in developmental toxicity: comparative analysis of logistic regression and decision tree models.结构-活性关系(SAR)模型在发育毒性预测和协变量选择中的效用:逻辑回归模型与决策树模型的比较分析
SAR QSAR Environ Res. 2004 Feb;15(1):1-18. doi: 10.1080/1062936032000169633.
6
A gene-expression signature as a predictor of survival in breast cancer.一种作为乳腺癌生存预测指标的基因表达特征。
N Engl J Med. 2002 Dec 19;347(25):1999-2009. doi: 10.1056/NEJMoa021967.
7
Selection bias in gene extraction on the basis of microarray gene-expression data.基于微阵列基因表达数据进行基因提取时的选择偏倚。
Proc Natl Acad Sci U S A. 2002 May 14;99(10):6562-6. doi: 10.1073/pnas.102102699. Epub 2002 Apr 30.
8
Tumor classification by partial least squares using microarray gene expression data.利用微阵列基因表达数据通过偏最小二乘法进行肿瘤分类。
Bioinformatics. 2002 Jan;18(1):39-50. doi: 10.1093/bioinformatics/18.1.39.
9
Gene expression profiling predicts clinical outcome of breast cancer.基因表达谱分析可预测乳腺癌的临床预后。
Nature. 2002 Jan 31;415(6871):530-6. doi: 10.1038/415530a.