受试者工作特征曲线能准确评估不均衡数据集。

The receiver operating characteristic curve accurately assesses imbalanced datasets.

作者信息

Richardson Eve, Trevizani Raphael, Greenbaum Jason A, Carter Hannah, Nielsen Morten, Peters Bjoern

机构信息

Center for Infectious Disease and Vaccine Research, La Jolla Institute for Immunology, La Jolla, CA, USA.

Fiocruz Ceará, Fundação Oswaldo Cruz, Rua São José s/n, Precabura, Eusébio/CE, Brazil.

出版信息

Patterns (N Y). 2024 May 31;5(6):100994. doi: 10.1016/j.patter.2024.100994. eCollection 2024 Jun 14.

DOI:10.1016/j.patter.2024.100994

PMID:39005487

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11240176/

Abstract

Many problems in biology require looking for a "needle in a haystack," corresponding to a binary classification where there are a few positives within a much larger set of negatives, which is referred to as a class imbalance. The receiver operating characteristic (ROC) curve and the associated area under the curve (AUC) have been reported as ill-suited to evaluate prediction performance on imbalanced problems where there is more interest in performance on the positive minority class, while the precision-recall (PR) curve is preferable. We show via simulation and a real case study that this is a misinterpretation of the difference between the ROC and PR spaces, showing that the ROC curve is robust to class imbalance, while the PR curve is highly sensitive to class imbalance. Furthermore, we show that class imbalance cannot be easily disentangled from classifier performance measured via PR-AUC.

摘要

生物学中的许多问题都需要在“大海捞针”式的情况下寻找答案，这对应于一种二元分类，即在大量的负样本中存在少量正样本，这被称为类别不平衡。据报道，受试者工作特征（ROC）曲线及相关的曲线下面积（AUC）不适合评估不平衡问题的预测性能，因为在这类问题中，人们更关注少数正类别样本的性能，而精确率-召回率（PR）曲线则更适用。我们通过模拟和实际案例研究表明，这是对ROC和PR空间差异的误解，结果显示ROC曲线对类别不平衡具有鲁棒性，而PR曲线对类别不平衡高度敏感。此外，我们还表明，类别不平衡无法轻易地与通过PR-AUC测量的分类器性能区分开来。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d7e/11240176/3909b126b797/fx1.jpg

相似文献

The receiver operating characteristic curve accurately assesses imbalanced datasets.

Patterns (N Y). 2024 May 31;5(6):100994. doi: 10.1016/j.patter.2024.100994. eCollection 2024 Jun 14.

Tuning model parameters in class-imbalanced learning with precision-recall curve.

Biom J. 2019 May;61(3):652-664. doi: 10.1002/bimj.201800148. Epub 2018 Dec 12.

A new concordant partial AUC and partial c statistic for imbalanced data in the evaluation of machine learning algorithms.

BMC Med Inform Decis Mak. 2020 Jan 6;20(1):4. doi: 10.1186/s12911-019-1014-6.

Limitations of receiver operating characteristic curve on imbalanced data: Assist device mortality risk scores.

J Thorac Cardiovasc Surg. 2023 Apr;165(4):1433-1442.e2. doi: 10.1016/j.jtcvs.2021.07.041. Epub 2021 Jul 30.

The precision-recall plot is more informative than the ROC plot when evaluating binary classifiers on imbalanced datasets.

PLoS One. 2015 Mar 4;10(3):e0118432. doi: 10.1371/journal.pone.0118432. eCollection 2015.

Class imbalance should not throw you off balance: Choosing the right classifiers and performance metrics for brain decoding with imbalanced data.

Neuroimage. 2023 Aug 15;277:120253. doi: 10.1016/j.neuroimage.2023.120253. Epub 2023 Jun 28.

Comparison of evaluation metrics of deep learning for imbalanced imaging data in osteoarthritis studies.

Osteoarthritis Cartilage. 2023 Sep;31(9):1242-1248. doi: 10.1016/j.joca.2023.05.006. Epub 2023 May 19.

The Matthews correlation coefficient (MCC) should replace the ROC AUC as the standard metric for assessing binary classification.

BioData Min. 2023 Feb 17;16(1):4. doi: 10.1186/s13040-023-00322-4.

Evaluating the three-level approach of the U-smile method for imbalanced binary classification.

PLoS One. 2025 Apr 10;20(4):e0321661. doi: 10.1371/journal.pone.0321661. eCollection 2025.

G4 & the balanced metric family - a novel approach to solving binary classification problems in medical device validation & verification studies.

BioData Min. 2024 Oct 23;17(1):43. doi: 10.1186/s13040-024-00402-z.

引用本文的文献

Multimodal Deep Learning for Generating Potential Anti-Dengue Peptides.

ACS Omega. 2025 Aug 19;10(34):38653-38674. doi: 10.1021/acsomega.5c03510. eCollection 2025 Sep 2.

Predicting suicidality in people living with HIV in Uganda: a machine learning approach.

Front Psychiatry. 2025 Aug 15;16:1584335. doi: 10.3389/fpsyt.2025.1584335. eCollection 2025.

A novel machine learning framework for stroke type identification in resource constrained settings with robustness to missing data.

Sci Rep. 2025 Aug 25;15(1):31207. doi: 10.1038/s41598-025-16660-8.

NetStart 2.0: prediction of eukaryotic translation initiation sites using a protein language model.

BMC Bioinformatics. 2025 Aug 19;26(1):216. doi: 10.1186/s12859-025-06220-2.

PolyLLM: polypharmacy side effect prediction via LLM-based SMILES encodings.

Front Pharmacol. 2025 Jul 31;16:1617142. doi: 10.3389/fphar.2025.1617142. eCollection 2025.

A review of machine learning applications in heart health.

Biomed Eng Online. 2025 Aug 11;24(1):99. doi: 10.1186/s12938-025-01430-4.

Biomarkers for Early Detection of Pancreatic Cancer.

Visc Med. 2025 May 28. doi: 10.1159/000546584.

Ranked placement of phage predation as a determinant of dehydration severity among cholera patients in Bangladesh.

medRxiv. 2025 Jun 18:2025.06.17.25329780. doi: 10.1101/2025.06.17.25329780.

DGHNN: a deep graph and hypergraph neural network for pan-cancer related gene prediction.

Bioinformatics. 2025 Jul 1;41(7). doi: 10.1093/bioinformatics/btaf379.

Machine learning allows robust classification of lung neoplasm tissue using an electronic biopsy through minimally-invasive electrical impedance spectroscopy.

Sci Rep. 2025 Mar 21;15(1):9716. doi: 10.1038/s41598-025-94826-0.

本文引用的文献

NetAllergen, a random forest model integrating MHC-II presentation propensity for improved allergenicity prediction.

Bioinform Adv. 2023 Oct 16;3(1):vbad151. doi: 10.1093/bioadv/vbad151. eCollection 2023.

Class imbalance should not throw you off balance: Choosing the right classifiers and performance metrics for brain decoding with imbalanced data.

Neuroimage. 2023 Aug 15;277:120253. doi: 10.1016/j.neuroimage.2023.120253. Epub 2023 Jun 28.

Machine learning reveals limited contribution of trans-only encoded variants to the HLA-DQ immunopeptidome.

Commun Biol. 2023 Apr 21;6(1):442. doi: 10.1038/s42003-023-04749-7.

DockNet: high-throughput protein-protein interface contact prediction.

Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btac797.

Paragraph-antibody paratope prediction using graph neural networks with minimal feature vectors.

Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btac732.

BepiPred-3.0: Improved B-cell epitope prediction using protein language models.

Protein Sci. 2022 Dec;31(12):e4497. doi: 10.1002/pro.4497.

Protein interaction interface region prediction by geometric deep learning.

Bioinformatics. 2021 Sep 9;37(17):2580-2588. doi: 10.1093/bioinformatics/btab154.

Learning context-aware structural representations to predict antigen and antibody binding interfaces.

Bioinformatics. 2020 Jul 1;36(13):3996-4003. doi: 10.1093/bioinformatics/btaa263.

The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation.

BMC Genomics. 2020 Jan 2;21(1):6. doi: 10.1186/s12864-019-6413-7.

Deciphering interaction fingerprints from protein molecular surfaces using geometric deep learning.

Nat Methods. 2020 Feb;17(2):184-192. doi: 10.1038/s41592-019-0666-6. Epub 2019 Dec 9.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

受试者工作特征曲线能准确评估不均衡数据集。

The receiver operating characteristic curve accurately assesses imbalanced datasets.

作者信息

Richardson Eve, Trevizani Raphael, Greenbaum Jason A, Carter Hannah, Nielsen Morten, Peters Bjoern

机构信息

Center for Infectious Disease and Vaccine Research, La Jolla Institute for Immunology, La Jolla, CA, USA.

Fiocruz Ceará, Fundação Oswaldo Cruz, Rua São José s/n, Precabura, Eusébio/CE, Brazil.

出版信息

Patterns (N Y). 2024 May 31;5(6):100994. doi: 10.1016/j.patter.2024.100994. eCollection 2024 Jun 14.

DOI:10.1016/j.patter.2024.100994

PMID:39005487

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11240176/

Abstract

摘要

受试者工作特征曲线能准确评估不均衡数据集。

The receiver operating characteristic curve accurately assesses imbalanced datasets.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

受试者工作特征曲线能准确评估不均衡数据集。

The receiver operating characteristic curve accurately assesses imbalanced datasets.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献