解构概率二元分类器的交叉熵

Deconstructing Cross-Entropy for Probabilistic Binary Classifiers.

作者信息

Ramos Daniel, Franco-Pedroso Javier, Lozano-Diez Alicia, Gonzalez-Rodriguez Joaquin

机构信息

AuDIaS-Audio, Data Intelligence and Speech, Escuela Politecnica Superior, Universidad Autonoma de Madrid, Calle Francisco Tomas y Valiente 11, 28049 Madrid, Spain.

出版信息

Entropy (Basel). 2018 Mar 20;20(3):208. doi: 10.3390/e20030208.

DOI:10.3390/e20030208

PMID:33265299

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7512723/

Abstract

In this work, we analyze the cross-entropy function, widely used in classifiers both as a performance measure and as an optimization objective. We contextualize cross-entropy in the light of Bayesian decision theory, the formal probabilistic framework for making decisions, and we thoroughly analyze its motivation, meaning and interpretation from an information-theoretical point of view. In this sense, this article presents several contributions: First, we explicitly analyze the contribution to cross-entropy of (i) prior knowledge; and (ii) the value of the features in the form of a likelihood ratio. Second, we introduce a decomposition of cross-entropy into two components: discrimination and calibration. This decomposition enables the measurement of different performance aspects of a classifier in a more precise way; and justifies previously reported strategies to obtain reliable probabilities by means of the calibration of the output of a discriminating classifier. Third, we give different information-theoretical interpretations of cross-entropy, which can be useful in different application scenarios, and which are related to the concept of reference probabilities. Fourth, we present an analysis tool, the Empirical Cross-Entropy (ECE) plot, a compact representation of cross-entropy and its aforementioned decomposition. We show the power of ECE plots, as compared to other classical performance representations, in two diverse experimental examples: a speaker verification system, and a forensic case where some glass findings are present.

摘要

在这项工作中，我们分析了交叉熵函数，它在分类器中被广泛用作性能度量和优化目标。我们根据贝叶斯决策理论（用于决策的形式化概率框架）对交叉熵进行情境化，并从信息论的角度深入分析其动机、含义和解释。从这个意义上讲，本文有以下几个贡献：第一，我们明确分析了（i）先验知识；以及（ii）以似然比形式表示的特征值对交叉熵的贡献。第二，我们将交叉熵分解为两个分量：区分度和校准度。这种分解能够更精确地衡量分类器的不同性能方面；并证明了先前报道的通过校准区分性分类器的输出以获得可靠概率的策略。第三，我们给出了交叉熵的不同信息论解释，这些解释在不同的应用场景中可能有用，并且与参考概率的概念相关。第四，我们提出了一种分析工具，即经验交叉熵（ECE）图，它是交叉熵及其上述分解的一种简洁表示。在两个不同的实验示例中，我们展示了与其他经典性能表示相比，ECE图的强大之处：一个说话人验证系统，以及一个存在一些玻璃物证的法医案例。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/50d3/7512723/3521936f7fbb/entropy-20-00208-g001.jpg

相似文献

Deconstructing Cross-Entropy for Probabilistic Binary Classifiers.解构概率二元分类器的交叉熵

Entropy (Basel). 2018 Mar 20;20(3):208. doi: 10.3390/e20030208.

Reliable support: Measuring calibration of likelihood ratios.可靠的支持：测量似然比的校准。

Forensic Sci Int. 2013 Jul 10;230(1-3):156-69. doi: 10.1016/j.forsciint.2013.04.014. Epub 2013 May 10.

Information-theoretical assessment of the performance of likelihood ratio computation methods.似然比计算方法性能的信息论评估。

J Forensic Sci. 2013 Nov;58(6):1503-18. doi: 10.1111/1556-4029.12233. Epub 2013 Jul 23.

Information-theoretical feature selection using data obtained by scanning electron microscopy coupled with and energy dispersive X-ray spectrometer for the classification of glass traces.基于扫描电子显微镜与能量色散 X 射线光谱仪获取的数据，采用信息论特征选择对玻璃痕迹进行分类。

Anal Chim Acta. 2011 Oct 31;705(1-2):207-17. doi: 10.1016/j.aca.2011.05.029. Epub 2011 May 24.

[Standard technical specifications for methacholine chloride (Methacholine) bronchial challenge test (2023)].[氯化乙酰甲胆碱支气管激发试验标准技术规范（2023年）]

Zhonghua Jie He He Hu Xi Za Zhi. 2024 Feb 12;47(2):101-119. doi: 10.3760/cma.j.cn112147-20231019-00247.

Macromolecular crowding: chemistry and physics meet biology (Ascona, Switzerland, 10-14 June 2012).大分子拥挤现象：化学与物理邂逅生物学（瑞士阿斯科纳，2012年6月10日至14日）

Phys Biol. 2013 Aug;10(4):040301. doi: 10.1088/1478-3975/10/4/040301. Epub 2013 Aug 2.

Integrated Decision-Making Method for Heterogeneous Attributes Based on Probabilistic Linguistic Cross-Entropy and Priority Relations.基于概率语言交叉熵和优先级关系的异构属性集成决策方法

Entropy (Basel). 2020 Sep 9;22(9):1009. doi: 10.3390/e22091009.

Binary Classifier Calibration Using an Ensemble of Linear Trend Estimation.使用线性趋势估计集成的二元分类器校准

Proc SIAM Int Conf Data Min. 2016 May;2016:261-269. doi: 10.1137/1.9781611974348.30.

Binary Classifier Calibration using an Ensemble of Near Isotonic Regression Models.使用近等渗回归模型集成的二元分类器校准

Proc IEEE Int Conf Data Min. 2016 Dec;2016:360-369. doi: 10.1109/ICDM.2016.0047. Epub 2017 Feb 2.

Binary Classifier Calibration Using an Ensemble of Piecewise Linear Regression Models.使用分段线性回归模型集成进行二元分类器校准

Knowl Inf Syst. 2018 Jan;54(1):151-170. doi: 10.1007/s10115-017-1133-2. Epub 2017 Nov 17.

引用本文的文献

The InterModel Vigorish (IMV) as a flexible and portable approach for quantifying predictive accuracy with binary outcomes.跨模型活力值（IMV）作为一种灵活且便携的方法，用于量化二元结果的预测准确性。

PLoS One. 2025 Mar 21;20(3):e0316491. doi: 10.1371/journal.pone.0316491. eCollection 2025.

Artificial neural networks analysis predicts long-term fistula function in hemodialysis patients following percutaneous transluminal angioplasty.人工神经网络分析可预测经皮腔内血管成形术后血液透析患者的长期瘘管功能。

EngMedicine. 2024 Jun;1(1). doi: 10.1016/j.engmed.2024.100010. Epub 2024 May 15.

Class imbalance on medical image classification: towards better evaluation practices for discrimination and calibration performance.医学图像分类中的不平衡问题：提高判别和校准性能的评估实践

Eur Radiol. 2024 Dec;34(12):7895-7903. doi: 10.1007/s00330-024-10834-0. Epub 2024 Jun 11.

An overview of log likelihood ratio cost in forensic science - Where is it used and what values can we expect?法医学中对数似然比代价概述——其应用于何处以及我们可预期何种值？

Forensic Sci Int Synerg. 2024 Apr 17;8:100466. doi: 10.1016/j.fsisyn.2024.100466. eCollection 2024.

Fixed Effects or Mixed Effects Classifiers? Evidence From Simulated and Archival Data.固定效应还是混合效应分类器？来自模拟数据和存档数据的证据。

Educ Psychol Meas. 2023 Aug;83(4):710-739. doi: 10.1177/00131644221108180. Epub 2022 Jun 30.

A multiclass CNN cascade model for the clinical detection support of cardiac arrhythmia based on subject-exclusive ECG dataset.基于个体专属心电图数据集的用于心律失常临床检测支持的多类卷积神经网络级联模型。

Biomed Eng Lett. 2022 Sep 12;12(4):433-444. doi: 10.1007/s13534-022-00246-8. eCollection 2022 Nov.

A Quantitative Comparison between Shannon and Tsallis-Havrda-Charvat Entropies Applied to Cancer Outcome Prediction.应用于癌症预后预测的香农熵与Tsallis-Havrda-Charvat熵之间的定量比较

Entropy (Basel). 2022 Mar 22;24(4):436. doi: 10.3390/e24040436.

Using an Artificial Neural Network to Predict Coronary Microvascular Obstruction (No-Reflow Phenomenon) during Percutaneous Coronary Interventions in Patients with Myocardial Infarction.利用人工神经网络预测心肌梗死患者经皮冠状动脉介入治疗中的冠状动脉微血管阻塞（无复流现象）。

Sovrem Tekhnologii Med. 2021;13(6):6-13. doi: 10.17691/stm2021.13.6.01. Epub 2021 Dec 28.

Mutual Information Scaling for Tensor Network Machine Learning.张量网络机器学习的互信息缩放

Mach Learn Sci Technol. 2022 Mar;3(1). doi: 10.1088/2632-2153/ac44a9. Epub 2022 Jan 20.

A Generative Adversarial Network Fused with Dual-Attention Mechanism and Its Application in Multitarget Image Fine Segmentation.基于生成对抗网络融合双注意力机制及其在多目标图像精细分割中的应用。

Comput Intell Neurosci. 2021 Dec 18;2021:2464648. doi: 10.1155/2021/2464648. eCollection 2021.

本文引用的文献

Neural Network Classifiers Estimate Bayesian Probabilities.神经网络分类器估计贝叶斯概率。

Neural Comput. 1991 Winter;3(4):461-483. doi: 10.1162/neco.1991.3.4.461.

The use of LA-ICP-MS databases to calculate likelihood ratios for the forensic analysis of glass evidence.利用激光剥蚀电感耦合等离子体质谱数据库计算玻璃证据法医分析的似然比。

Talanta. 2018 Aug 15;186:655-661. doi: 10.1016/j.talanta.2018.02.027. Epub 2018 Feb 8.

Gaussian Mixture Models of Between-Source Variation for Likelihood Ratio Computation from Multivariate Data.用于从多变量数据计算似然比的源间变异的高斯混合模型。

PLoS One. 2016 Feb 22;11(2):e0149958. doi: 10.1371/journal.pone.0149958. eCollection 2016.

Lay understanding of forensic statistics: Evaluation of random match probabilities, likelihood ratios, and verbal equivalents.公众对法医统计学的理解：随机匹配概率、似然比及等效文字表述的评估。

Law Hum Behav. 2015 Aug;39(4):332-49. doi: 10.1037/lhb0000134. Epub 2015 May 18.

Information-theoretical assessment of the performance of likelihood ratio computation methods.似然比计算方法性能的信息论评估。

J Forensic Sci. 2013 Nov;58(6):1503-18. doi: 10.1111/1556-4029.12233. Epub 2013 Jul 23.

Reliable support: Measuring calibration of likelihood ratios.可靠的支持：测量似然比的校准。

Forensic Sci Int. 2013 Jul 10;230(1-3):156-69. doi: 10.1016/j.forsciint.2013.04.014. Epub 2013 May 10.

A comparison of MCC and CEN error measures in multi-class prediction.多类预测中 MCC 和 CEN 误差度量的比较。

PLoS One. 2012;7(8):e41882. doi: 10.1371/journal.pone.0041882. Epub 2012 Aug 8.

Expressing evaluative opinions: a position statement.表达评价性意见：一份立场声明。

Sci Justice. 2011 Mar;51(1):1-2. doi: 10.1016/j.scijus.2011.01.002. Epub 2011 Feb 5.

Judgment under Uncertainty: Heuristics and Biases.《不确定性下的判断：启发式与偏差》

Science. 1974 Sep 27;185(4157):1124-31. doi: 10.1126/science.185.4157.1124.

Clinical decision support systems for the practice of evidence-based medicine.用于循证医学实践的临床决策支持系统。

J Am Med Inform Assoc. 2001 Nov-Dec;8(6):527-34. doi: 10.1136/jamia.2001.0080527.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

解构概率二元分类器的交叉熵

Deconstructing Cross-Entropy for Probabilistic Binary Classifiers.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献