• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

分类算法评估:基于混淆矩阵的五种性能度量

Classification-algorithm evaluation: five performance measures based on confusion matrices.

作者信息

Forbes A D

机构信息

Medical Department, Hewlett-Packard Laboratories, Palo Alto, CA 94303-0867, USA.

出版信息

J Clin Monit. 1995 May;11(3):189-206. doi: 10.1007/BF01617722.

DOI:10.1007/BF01617722
PMID:7623060
Abstract

OBJECTIVE

The objective of this paper is to introduce, explain, and extend methods for comparing the performance of classification algorithms using error tallies obtained on properly sized, populated, and labeled data sets.

METHODS

Two distinct contexts of classification are defined, involving "objects-by-inspection" and "objects-by-segmentation." In the former context, the total number of objects to be classified is unambiguously and self-evidently defined. In the latter, there is troublesome ambiguity. All five of the measures of performance here considered are based on confusion matrices, tables of counts revealing the extent of an algorithm's "confusion" regarding the true classifications. A proper measure of classification-algorithm performance must meet four requirements. A proper measure should obey six additional constraints.

RESULTS

Four traditional measures of performance are critiqued in terms of the requirements and constraints. Each measure meets the requirements, but fails to obey at least one of the constraints. A nontraditional measure of algorithm performance, the normalized mutual information (NMI), is therefore introduced. Based on the NMI, methods for comparing algorithm performance using confusion matrices are devised.

CONCLUSIONS

The five performance measures lead to similar inferences when comparing a trio of QRS-detection algorithms using a large data set. The modified NMI is preferred, however, because it obeys each of the constraints and is the most conservative measure of performance.

摘要

目的

本文的目的是介绍、解释并扩展一些方法,这些方法用于使用在大小合适、数据充实且带有标签的数据集上获得的错误计数来比较分类算法的性能。

方法

定义了两种不同的分类情境,分别涉及“逐个检查对象”和“逐个分割对象”。在前一种情境中,要分类的对象总数是明确且不言而喻地定义的。而在后一种情境中,存在麻烦的模糊性。这里所考虑的所有五种性能度量都是基于混淆矩阵的,混淆矩阵是一种计数表,揭示了算法在真实分类方面的“混淆”程度。一种合适的分类算法性能度量必须满足四个要求。一种合适的度量还应遵循另外六个约束条件。

结果

从这些要求和约束条件的角度对四种传统的性能度量进行了批判。每种度量都满足了要求,但至少未能遵循其中一个约束条件。因此,引入了一种非传统的算法性能度量,即归一化互信息(NMI)。基于NMI,设计了使用混淆矩阵来比较算法性能的方法。

结论

当使用一个大数据集比较三种QRS检测算法时,这五种性能度量会得出相似的推断。然而,改进后的NMI更受青睐,因为它遵循每个约束条件,并且是最保守的性能度量。

相似文献

1
Classification-algorithm evaluation: five performance measures based on confusion matrices.分类算法评估:基于混淆矩阵的五种性能度量
J Clin Monit. 1995 May;11(3):189-206. doi: 10.1007/BF01617722.
2
QRS detection based ECG quality assessment.基于 QRS 检测的心电图质量评估。
Physiol Meas. 2012 Sep;33(9):1449-61. doi: 10.1088/0967-3334/33/9/1449. Epub 2012 Aug 17.
3
Performance of an open-source heart sound segmentation algorithm on eight independent databases.开源心音分割算法在八个独立数据库上的性能。
Physiol Meas. 2017 Aug 1;38(8):1730-1745. doi: 10.1088/1361-6579/aa6e9f.
4
[Specific features of QRS-complex identification algorithms for real-time ECG systems].[实时心电图系统中QRS波群识别算法的具体特征]
Med Tekh. 2001 Nov-Dec(6):18-23.
5
A lightweight QRS detector for single lead ECG signals using a max-min difference algorithm.一种使用极大极小差分算法的单导联 ECG 信号的轻量级 QRS 检测器。
Comput Methods Programs Biomed. 2017 Jun;144:61-75. doi: 10.1016/j.cmpb.2017.02.028. Epub 2017 Mar 18.
6
Evaluation of an algorithm based on single-condition decision rules for binary classification of 12-lead ambulatory ECG recording quality.基于单条件决策规则的算法对 12 导联动态心电图记录质量进行二分类的评估。
Physiol Meas. 2012 Sep;33(9):1435-48. doi: 10.1088/0967-3334/33/9/1435. Epub 2012 Aug 17.
7
Redefining performance evaluation tools for real-time QRS complex classification systems.重新定义用于实时QRS波群分类系统的性能评估工具。
IEEE Trans Biomed Eng. 2007 Sep;54(9):1706-10. doi: 10.1109/TBME.2007.902594.
8
An algorithm for sleep apnea detection from single-lead ECG using Hermite basis functions.一种使用埃尔米特基函数从单导联心电图检测睡眠呼吸暂停的算法。
Comput Biol Med. 2016 Oct 1;77:116-24. doi: 10.1016/j.compbiomed.2016.08.012. Epub 2016 Aug 13.
9
Evaluation of real-time QRS detection algorithms in variable contexts.可变环境下实时QRS检测算法的评估
Med Biol Eng Comput. 2005 May;43(3):379-85. doi: 10.1007/BF02345816.
10
Feature selection of fMRI data based on normalized mutual information and fisher discriminant ratio.基于归一化互信息和Fisher判别比的功能磁共振成像数据特征选择
J Xray Sci Technol. 2016 Mar 17;24(3):467-75. doi: 10.3233/XST-160565.

引用本文的文献

1
A GIS Approach to Modeling the Ecological Niche of an Ecotype of (Michx.) Torr. in Mexican Grasslands.一种利用地理信息系统(GIS)对墨西哥草原上的(米乔克斯)托尔生态型生态位进行建模的方法。
Plants (Basel). 2025 Jul 8;14(14):2090. doi: 10.3390/plants14142090.
2
A Multimorbidity Analysis of Hospitalized Patients With COVID-19 in Northwest Italy: Longitudinal Study Using Evolutionary Machine Learning and Health Administrative Data.意大利西北部 COVID-19 住院患者的多种合并症分析:使用进化机器学习和健康行政数据的纵向研究。
JMIR Public Health Surveill. 2024 Jul 18;10:e52353. doi: 10.2196/52353.
3
Emotion regulation in bipolar disorder type-I: multivariate analysis of fMRI data.

本文引用的文献

1
A decision theory approach to the approximation of discrete probability densities.一种用于离散概率密度逼近的决策理论方法。
IEEE Trans Pattern Anal Mach Intell. 1980 Jan;2(1):61-7. doi: 10.1109/tpami.1980.4766971.
I型双相情感障碍中的情绪调节:功能磁共振成像数据的多变量分析
Int J Bipolar Disord. 2023 Mar 25;11(1):12. doi: 10.1186/s40345-023-00292-w.
4
Near-infrared spectroscopy for early selection of waxy cassava clones via seed analysis.通过种子分析利用近红外光谱技术早期筛选木薯蜡质品种
Front Plant Sci. 2023 Jan 23;14:1089759. doi: 10.3389/fpls.2023.1089759. eCollection 2023.
5
PToPI: A Comprehensive Review, Analysis, and Knowledge Representation of Binary Classification Performance Measures/Metrics.PToPI:二元分类性能度量/指标的全面综述、分析与知识表示
SN Comput Sci. 2023;4(1):13. doi: 10.1007/s42979-022-01409-1. Epub 2022 Oct 16.
6
Deep_KsuccSite: A novel deep learning method for the identification of lysine succinylation sites.深度赖氨酸琥珀酰化位点:一种用于识别赖氨酸琥珀酰化位点的新型深度学习方法。
Front Genet. 2022 Sep 29;13:1007618. doi: 10.3389/fgene.2022.1007618. eCollection 2022.
7
The influence of data characteristics on detecting wetland/stream surface-water connections in the Delmarva Peninsula, Maryland and Delaware.数据特征对马里兰州和特拉华州德尔马瓦半岛湿地/溪流地表水连通性检测的影响。
Wetl Ecol Manag. 2017 Jun 8;26(1):63-86. doi: 10.1007/s11273-017-9554-y.
8
An analysis of Koreans' attitudes towards migrants by application of algorithmic approaches.运用算法方法对韩国人对移民的态度进行分析。
Heliyon. 2022 Aug 12;8(8):e10087. doi: 10.1016/j.heliyon.2022.e10087. eCollection 2022 Aug.
9
Wetlands inform how climate extremes influence surface water expansion and contraction.湿地揭示了极端气候如何影响地表水的扩张和收缩。
Hydrol Earth Syst Sci. 2018 Mar 15;22(3):1851-1873. doi: 10.5194/hess-22-1851-2018.
10
Cross-ECV consistency at global scale: LAI and FAPAR changes.全球尺度上的交叉有效叶面积指数(ECV)一致性:叶面积指数(LAI)和光合有效辐射吸收比例(FAPAR)变化
Remote Sens Environ. 2021 Sep 15;263:112561. doi: 10.1016/j.rse.2021.112561.