• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

高斯模型中分类误差的贝叶斯最小均方误差估计器的矩和均方根误差

Moments and Root-Mean-Square Error of the Bayesian MMSE Estimator of Classification Error in the Gaussian Model.

作者信息

Zollanvari Amin, Dougherty Edward R

机构信息

Department of Electrical and Computer Engineering, Texas A&M University, College Station, TX 77843 ; Department of Statistics, Texas A&M University, College Station, TX 77843.

Department of Electrical and Computer Engineering, Texas A&M University, College Station, TX 77843 ; Translational Genomics Research Institute (TGEN), Phoenix, AZ 85004.

出版信息

Pattern Recognit. 2014 Jun 1;47(6):2178-2192. doi: 10.1016/j.patcog.2013.11.022.

DOI:10.1016/j.patcog.2013.11.022
PMID:24729636
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3979595/
Abstract

The most important aspect of any classifier is its error rate, because this quantifies its predictive capacity. Thus, the accuracy of error estimation is critical. Error estimation is problematic in small-sample classifier design because the error must be estimated using the same data from which the classifier has been designed. Use of prior knowledge, in the form of a prior distribution on an uncertainty class of feature-label distributions to which the true, but unknown, feature-distribution belongs, can facilitate accurate error estimation (in the mean-square sense) in circumstances where accurate completely model-free error estimation is impossible. This paper provides analytic asymptotically exact finite-sample approximations for various performance metrics of the resulting Bayesian Minimum Mean-Square-Error (MMSE) error estimator in the case of linear discriminant analysis (LDA) in the multivariate Gaussian model. These performance metrics include the first, second, and cross moments of the Bayesian MMSE error estimator with the true error of LDA, and therefore, the Root-Mean-Square (RMS) error of the estimator. We lay down the theoretical groundwork for Kolmogorov double-asymptotics in a Bayesian setting, which enables us to derive asymptotic expressions of the desired performance metrics. From these we produce analytic finite-sample approximations and demonstrate their accuracy via numerical examples. Various examples illustrate the behavior of these approximations and their use in determining the necessary sample size to achieve a desired RMS. The Supplementary Material contains derivations for some equations and added figures.

摘要

任何分类器最重要的方面是其错误率,因为这量化了它的预测能力。因此,错误估计的准确性至关重要。在小样本分类器设计中,错误估计存在问题,因为必须使用设计分类器所依据的相同数据来估计错误。在真实但未知的特征分布所属的特征 - 标签分布的不确定性类上,以先验分布的形式使用先验知识,在无法进行完全无模型的准确错误估计的情况下,可以促进(在均方意义上)准确的错误估计。本文针对多元高斯模型中线性判别分析(LDA)情况下所得贝叶斯最小均方误差(MMSE)错误估计器的各种性能指标,提供了分析渐近精确的有限样本近似。这些性能指标包括贝叶斯MMSE错误估计器与LDA真实错误的一阶、二阶和交叉矩,因此也包括估计器的均方根(RMS)误差。我们为贝叶斯环境下的柯尔莫哥洛夫双渐近性奠定了理论基础,这使我们能够推导出所需性能指标的渐近表达式。由此我们得出分析有限样本近似,并通过数值示例证明其准确性。各种示例说明了这些近似的行为及其在确定实现所需RMS所需样本量方面的用途。补充材料包含一些方程的推导和补充图表。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f612/3979595/3f150e10f0d4/nihms555121f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f612/3979595/7f5b821ae9d0/nihms555121f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f612/3979595/1f46b6448d33/nihms555121f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f612/3979595/3f150e10f0d4/nihms555121f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f612/3979595/7f5b821ae9d0/nihms555121f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f612/3979595/1f46b6448d33/nihms555121f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f612/3979595/3f150e10f0d4/nihms555121f3.jpg

相似文献

1
Moments and Root-Mean-Square Error of the Bayesian MMSE Estimator of Classification Error in the Gaussian Model.高斯模型中分类误差的贝叶斯最小均方误差估计器的矩和均方根误差
Pattern Recognit. 2014 Jun 1;47(6):2178-2192. doi: 10.1016/j.patcog.2013.11.022.
2
On Kolmogorov Asymptotics of Estimators of the Misclassification Error Rate in Linear Discriminant Analysis.关于线性判别分析中误分类错误率估计量的柯尔莫哥洛夫渐近性
Sankhya Ser A. 2013 Aug 1;75(2). doi: 10.1007/s13171-013-0029-9.
3
Application of the Bayesian MMSE estimator for classification error to gene expression microarray data.贝叶斯 MMSE 估计器在基因表达微阵列数据分类误差中的应用。
Bioinformatics. 2011 Jul 1;27(13):1822-31. doi: 10.1093/bioinformatics/btr272. Epub 2011 May 5.
4
Unbiased bootstrap error estimation for linear discriminant analysis.线性判别分析的无偏自助法误差估计
EURASIP J Bioinform Syst Biol. 2014 Oct 3;2014:15. doi: 10.1186/s13637-014-0015-0. eCollection 2014 Dec.
5
Robust importance sampling for error estimation in the context of optimal Bayesian transfer learning.在最优贝叶斯迁移学习背景下用于误差估计的稳健重要性抽样。
Patterns (N Y). 2022 Jan 25;3(3):100428. doi: 10.1016/j.patter.2021.100428. eCollection 2022 Mar 11.
6
Bayesian estimation of the discrete coefficient of determination.离散决定系数的贝叶斯估计。
EURASIP J Bioinform Syst Biol. 2016 Jan 15;2016(1):1. doi: 10.1186/s13637-015-0035-4. eCollection 2016 Dec.
7
The illusion of distribution-free small-sample classification in genomics.基因组学中小样本分类的无分布假象。
Curr Genomics. 2011 Aug;12(5):333-41. doi: 10.2174/138920211796429763.
8
Incorporating prior knowledge induced from stochastic differential equations in the classification of stochastic observations.将由随机微分方程推导得出的先验知识纳入随机观测的分类中。
EURASIP J Bioinform Syst Biol. 2016 Jan 20;2016(1):2. doi: 10.1186/s13637-016-0036-y. eCollection 2016 Dec.
9
Insights Into the Robustness of Minimum Error Entropy Estimation.最小误差熵估计稳健性的研究进展
IEEE Trans Neural Netw Learn Syst. 2018 Mar;29(3):731-737. doi: 10.1109/TNNLS.2016.2636160. Epub 2016 Dec 22.
10
On optimal Bayesian classification and risk estimation under multiple classes.关于多类情况下的最优贝叶斯分类与风险估计。
EURASIP J Bioinform Syst Biol. 2015 Oct 24;2015(1):8. doi: 10.1186/s13637-015-0028-3. eCollection 2015 Dec.

引用本文的文献

1
A novel machine learning prediction model for metastasis in breast cancer.一种用于乳腺癌转移的新型机器学习预测模型。
Cancer Rep (Hoboken). 2024 Mar;7(3):e2006. doi: 10.1002/cnr2.2006.
2
Modeling and Composition Design of Low-Alloy Steel's Mechanical Properties Based on Neural Networks and Genetic Algorithms.基于神经网络和遗传算法的低合金钢力学性能建模与成分设计
Materials (Basel). 2020 Nov 24;13(23):5316. doi: 10.3390/ma13235316.
3
High-Dimensional Statistical Learning: Roots, Justifications, and Potential Machineries.高维统计学习:根源、依据及潜在机制
Cancer Inform. 2016 Apr 12;14(Suppl 5):109-21. doi: 10.4137/CIN.S30804. eCollection 2015.
4
Incorporating prior knowledge induced from stochastic differential equations in the classification of stochastic observations.将由随机微分方程推导得出的先验知识纳入随机观测的分类中。
EURASIP J Bioinform Syst Biol. 2016 Jan 20;2016(1):2. doi: 10.1186/s13637-016-0036-y. eCollection 2016 Dec.

本文引用的文献

1
The illusion of distribution-free small-sample classification in genomics.基因组学中小样本分类的无分布假象。
Curr Genomics. 2011 Aug;12(5):333-41. doi: 10.2174/138920211796429763.