交叉验证 t 检验比较监督分类学习算法的 3×2 块。

Blocked 3×2 cross-validated t-test for comparing supervised classification learning algorithms.

机构信息

Computer Center of Shanxi University, Taiyuan 030006, P.R.C.

出版信息

Neural Comput. 2014 Jan;26(1):208-35. doi: 10.1162/NECO_a_00532. Epub 2013 Oct 8.

Abstract

In the research of machine learning algorithms for classification tasks, the comparison of the performances of algorithms is extremely important, and a statistical test of significance for generalization error is often used to perform it in the machine learning literature. In view of the randomness of partitions in cross-validation, a new blocked 3×2 cross-validation is proposed to estimate generalization error in this letter. We then conduct an analysis of variance of the blocked 3×2 cross-validated estimator. A relatively conservative variance estimator that considers the correlation between any two two-fold cross-validations, and was previously neglected in 5×2 cross-validated t and F-tests is put forward. A corresponding test using this variance estimator is presented to compare the performances of algorithms. Simulated results show that the performance of our test is comparable with that of 5×2 cross-validated tests but with less computation complexity.

摘要

在分类任务的机器学习算法研究中，算法性能的比较非常重要，而在机器学习文献中，通常使用对泛化误差的显著性统计检验来进行比较。针对交叉验证中划分的随机性问题，本文提出了一种新的分块 3×2 交叉验证方法来估计泛化误差。然后，我们对分块 3×2 交叉验证估计量进行方差分析。提出了一种相对保守的方差估计量，它考虑了任何两个两重交叉验证之间的相关性，而在之前的 5×2 交叉验证 t 和 F 检验中被忽略了。提出了一种使用该方差估计量的相应检验方法，用于比较算法的性能。模拟结果表明，我们的检验方法的性能与 5×2 交叉验证检验方法相当，但计算复杂度较低。

相似文献

Blocked 3×2 cross-validated t-test for comparing supervised classification learning algorithms.交叉验证 t 检验比较监督分类学习算法的 3×2 块。

Neural Comput. 2014 Jan;26(1):208-35. doi: 10.1162/NECO_a_00532. Epub 2013 Oct 8.

Block-Regularized m × 2 Cross-Validated Estimator of the Generalization Error.泛化误差的块正则化m×2交叉验证估计器

Neural Comput. 2017 Feb;29(2):519-554. doi: 10.1162/NECO_a_00923. Epub 2016 Dec 28.

Benchmarking protein classification algorithms via supervised cross-validation.通过监督交叉验证对蛋白质分类算法进行基准测试。

J Biochem Biophys Methods. 2008 Apr 24;70(6):1215-23. doi: 10.1016/j.jbbm.2007.05.011. Epub 2007 May 31.

Sensitivity analysis of kappa-fold cross validation in prediction error estimation.kappa 折叠交叉验证在预测误差估计中的敏感性分析。

IEEE Trans Pattern Anal Mach Intell. 2010 Mar;32(3):569-75. doi: 10.1109/TPAMI.2009.187.

Supervised machine learning algorithms for protein structure classification.用于蛋白质结构分类的监督式机器学习算法。

Comput Biol Chem. 2009 Jun;33(3):216-23. doi: 10.1016/j.compbiolchem.2009.04.004. Epub 2009 May 3.

Approximate Statistical Tests for Comparing Supervised Classification Learning Algorithms.用于比较监督分类学习算法的近似统计检验

Neural Comput. 1998 Sep 15;10(7):1895-1923. doi: 10.1162/089976698300017197.

Classifier ensemble construction with rotation forest to improve medical diagnosis performance of machine learning algorithms.基于旋转森林的分类器集成构建，以提高机器学习算法的医学诊断性能。

Comput Methods Programs Biomed. 2011 Dec;104(3):443-51. doi: 10.1016/j.cmpb.2011.03.018. Epub 2011 Apr 30.

SemiBoost: boosting for semi-supervised learning.半增强算法：用于半监督学习的增强算法

IEEE Trans Pattern Anal Mach Intell. 2009 Nov;31(11):2000-14. doi: 10.1109/TPAMI.2008.235.

Channel selection and classification of electroencephalogram signals: an artificial neural network and genetic algorithm-based approach.脑电信号的通道选择与分类：基于人工神经网络和遗传算法的方法。

Artif Intell Med. 2012 Jun;55(2):117-26. doi: 10.1016/j.artmed.2012.02.001. Epub 2012 Apr 12.

[Application of support vector machines to classification of blood cells].[支持向量机在血细胞分类中的应用]

Sheng Wu Yi Xue Gong Cheng Xue Za Zhi. 2003 Sep;20(3):484-7.

引用本文的文献

An intrusion detection system based on convolution neural network.一种基于卷积神经网络的入侵检测系统。

PeerJ Comput Sci. 2024 Jun 28;10:e2152. doi: 10.7717/peerj-cs.2152. eCollection 2024.

Interpretable prediction of brain activity during conversations from multimodal behavioral signals.从多模态行为信号可解释地预测对话中的大脑活动。

PLoS One. 2024 Mar 21;19(3):e0284342. doi: 10.1371/journal.pone.0284342. eCollection 2024.

Variance estimation based on blocked 3×2 cross-validation in high-dimensional linear regression.基于高维线性回归中分组3×2交叉验证的方差估计

J Appl Stat. 2020 Jun 18;48(11):1934-1947. doi: 10.1080/02664763.2020.1780571. eCollection 2021.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

交叉验证 t 检验比较监督分类学习算法的 3×2 块。

Blocked 3×2 cross-validated t-test for comparing supervised classification learning algorithms.

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献