分类器不确定性：证据、潜在影响及概率性处理

Classifier uncertainty: evidence, potential impact, and probabilistic treatment.

作者信息

Tötsch Niklas, Hoffmann Daniel

机构信息

Faculty of Biology, University of Duisburg-Essen, Essen, Germany.

出版信息

PeerJ Comput Sci. 2021 Mar 4;7:e398. doi: 10.7717/peerj-cs.398. eCollection 2021.

DOI:10.7717/peerj-cs.398

PMID:33817044

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7959610/

Abstract

Classifiers are often tested on relatively small data sets, which should lead to uncertain performance metrics. Nevertheless, these metrics are usually taken at face value. We present an approach to quantify the uncertainty of classification performance metrics, based on a probability model of the confusion matrix. Application of our approach to classifiers from the scientific literature and a classification competition shows that uncertainties can be surprisingly large and limit performance evaluation. In fact, some published classifiers may be misleading. The application of our approach is simple and requires only the confusion matrix. It is agnostic of the underlying classifier. Our method can also be used for the estimation of sample sizes that achieve a desired precision of a performance metric.

摘要

分类器通常在相对较小的数据集上进行测试，这会导致性能指标存在不确定性。然而，这些指标通常被直接接受。我们提出了一种基于混淆矩阵概率模型来量化分类性能指标不确定性的方法。将我们的方法应用于科学文献中的分类器和一场分类竞赛中发现，不确定性可能大得出奇，并限制了性能评估。事实上，一些已发表的分类器可能会产生误导。我们的方法应用简单，只需要混淆矩阵。它与底层分类器无关。我们的方法还可用于估计实现所需性能指标精度的样本量。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8226/7959610/c6fde6804704/peerj-cs-07-398-g001.jpg

相似文献

Classifier uncertainty: evidence, potential impact, and probabilistic treatment.分类器不确定性：证据、潜在影响及概率性处理

PeerJ Comput Sci. 2021 Mar 4;7:e398. doi: 10.7717/peerj-cs.398. eCollection 2021.

Formal definition of the MARS method for quantifying the unique target class discoveries of selected machine classifiers.MARS 方法用于定量选择的机器分类器的独特目标类发现的正式定义。

F1000Res. 2022 Apr 4;11:391. doi: 10.12688/f1000research.110567.2. eCollection 2022.

Machine learning algorithms for outcome prediction in (chemo)radiotherapy: An empirical comparison of classifiers.机器学习算法在（放化疗）治疗结果预测中的应用：分类器的实证比较。

Med Phys. 2018 Jul;45(7):3449-3459. doi: 10.1002/mp.12967. Epub 2018 Jun 13.

Probabilistic classifiers with high-dimensional data.高维数据的概率分类器。

Biostatistics. 2011 Jul;12(3):399-412. doi: 10.1093/biostatistics/kxq069. Epub 2010 Nov 17.

Quantifying uncertainty in machine learning classifiers for medical imaging.量化医学成像机器学习分类器中的不确定性。

Int J Comput Assist Radiol Surg. 2022 Apr;17(4):711-718. doi: 10.1007/s11548-022-02578-3. Epub 2022 Mar 12.

Evaluation of performance metrics for histopathological image classifier optimization.用于组织病理学图像分类器优化的性能指标评估。

Annu Int Conf IEEE Eng Med Biol Soc. 2014;2014:1933-6. doi: 10.1109/EMBC.2014.6943990.

Optimal classifier for imbalanced data using Matthews Correlation Coefficient metric.使用马修斯相关系数度量的不平衡数据最优分类器。

PLoS One. 2017 Jun 2;12(6):e0177678. doi: 10.1371/journal.pone.0177678. eCollection 2017.

Automated Brain Tumour Detection and Classification using Deep Features and Bayesian Optimised Classifiers.使用深度特征和贝叶斯优化分类器的自动脑肿瘤检测与分类

Curr Med Imaging. 2023 Mar 28. doi: 10.2174/1573405620666230328092218.

Comparing within-subject classification and regularization methods in fMRI for large and small sample sizes.比较功能磁共振成像中针对大样本量和小样本量的受试者内分类及正则化方法。

Hum Brain Mapp. 2014 Sep;35(9):4499-517. doi: 10.1002/hbm.22490. Epub 2014 Mar 17.

A probabilistic classifier ensemble weighting scheme based on cross-validated accuracy estimates.一种基于交叉验证准确率估计的概率分类器集成加权方案。

Data Min Knowl Discov. 2019;33(6):1674-1709. doi: 10.1007/s10618-019-00638-y. Epub 2019 Jun 17.

引用本文的文献

Do Treatment Choices by Artificial Intelligence Correspond to Reality? Retrospective Comparative Research with Necrotizing Enterocolitis as a Use Case.人工智能做出的治疗选择与实际情况相符吗？以坏死性小肠结肠炎为例的回顾性比较研究。

Med Decis Making. 2025 May;45(4):449-461. doi: 10.1177/0272989X251324530. Epub 2025 Mar 12.

Development of a diagnostic support system for distal humerus fracture using artificial intelligence.利用人工智能开发用于诊断肱骨远端骨折的诊断支持系统。

Int Orthop. 2024 May;48(5):1303-1311. doi: 10.1007/s00264-024-06125-4. Epub 2024 Mar 19.

Machine-learning vs. logistic regression for preoperative prediction of medical morbidity after fast-track hip and knee arthroplasty-a comparative study.机器学习与逻辑回归在快通道髋关节和膝关节置换术后医疗发病率术前预测中的比较研究。

BMC Anesthesiol. 2023 Nov 29;23(1):391. doi: 10.1186/s12871-023-02354-z.

Review and assessment of Boolean approaches for inference of gene regulatory networks.基因调控网络推理的布尔方法综述与评估

Heliyon. 2022 Aug 9;8(8):e10222. doi: 10.1016/j.heliyon.2022.e10222. eCollection 2022 Aug.

Framework for Testing Robustness of Machine Learning-Based Classifiers.基于机器学习的分类器稳健性测试框架

J Pers Med. 2022 Aug 14;12(8):1314. doi: 10.3390/jpm12081314.

The coefficient of determination R-squared is more informative than SMAPE, MAE, MAPE, MSE and RMSE in regression analysis evaluation.在回归分析评估中，决定系数R平方比对称平均绝对百分比误差（SMAPE）、平均绝对误差（MAE）、平均绝对百分比误差（MAPE）、均方误差（MSE）和均方根误差（RMSE）更具信息量。

PeerJ Comput Sci. 2021 Jul 5;7:e623. doi: 10.7717/peerj-cs.623. eCollection 2021.

The Matthews correlation coefficient (MCC) is more reliable than balanced accuracy, bookmaker informedness, and markedness in two-class confusion matrix evaluation.在二分类混淆矩阵评估中，马修斯相关系数（MCC）比平衡准确率、庄家知情度和标记度更可靠。

BioData Min. 2021 Feb 4;14(1):13. doi: 10.1186/s13040-021-00244-z.

本文引用的文献

PyMC: a modern, and comprehensive probabilistic programming framework in Python.PyMC：Python 中一个现代且全面的概率编程框架。

PeerJ Comput Sci. 2023 Sep 1;9:e1516. doi: 10.7717/peerj-cs.1516. eCollection 2023.

Ten quick tips for machine learning in computational biology.计算生物学中机器学习的十条快速提示。

BioData Min. 2017 Dec 8;10:35. doi: 10.1186/s13040-017-0155-3. eCollection 2017.

Transcription initiation complex structures elucidate DNA opening.转录起始复合物结构阐明了 DNA 的开启。

Nature. 2016 May 19;533(7603):353-8. doi: 10.1038/nature17990. Epub 2016 May 11.

Approximate Statistical Tests for Comparing Supervised Classification Learning Algorithms.用于比较监督分类学习算法的近似统计检验

Neural Comput. 1998 Sep 15;10(7):1895-1923. doi: 10.1162/089976698300017197.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

分类器不确定性：证据、潜在影响及概率性处理

Classifier uncertainty: evidence, potential impact, and probabilistic treatment.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献