ROCS：针对类别倾斜的高通量数据的接收者操作特征曲面。

ROCS: receiver operating characteristic surface for class-skewed high-throughput data.

机构信息

Department of Biostatistics and Bioinformatics, Rollins School of Public Health, Emory University, Atlanta, Georgia, United States of America.

出版信息

PLoS One. 2012;7(7):e40598. doi: 10.1371/journal.pone.0040598. Epub 2012 Jul 6.

DOI:10.1371/journal.pone.0040598

PMID:22792381

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3391298/

Abstract

The receiver operating characteristic (ROC) curve is an important tool to gauge the performance of classifiers. In certain situations of high-throughput data analysis, the data is heavily class-skewed, i.e. most features tested belong to the true negative class. In such cases, only a small portion of the ROC curve is relevant in practical terms, rendering the ROC curve and its area under the curve (AUC) insufficient for the purpose of judging classifier performance. Here we define an ROC surface (ROCS) using true positive rate (TPR), false positive rate (FPR), and true discovery rate (TDR). The ROC surface, together with the associated quantities, volume under the surface (VUS) and FDR-controlled area under the ROC curve (FCAUC), provide a useful approach for gauging classifier performance on class-skewed high-throughput data. The implementation as an R package is available at http://userwww.service.emory.edu/~tyu8/ROCS/.

摘要

受试者工作特征（ROC）曲线是评估分类器性能的重要工具。在高通量数据分析的某些情况下，数据严重偏向于某一类，即大多数测试的特征属于真正的阴性类。在这种情况下，ROC 曲线及其下面积（AUC）在实际应用中只有很小的一部分是相关的，这使得 ROC 曲线及其下面积不足以用于判断分类器的性能。在这里，我们使用真阳性率（TPR）、假阳性率（FPR）和真发现率（TDR）定义了一个 ROC 曲面（ROCS）。ROC 曲面及其相关量，曲面下的体积（VUS）和 FDR 控制的 ROC 曲线下面积（FCAUC），为在偏向于某一类的高通量数据上评估分类器的性能提供了一种有用的方法。该实现作为一个 R 包可在 http://userwww.service.emory.edu/~tyu8/ROCS/ 获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f276/3391298/2dfa7f153d0c/pone.0040598.g001.jpg

相似文献

ROCS: receiver operating characteristic surface for class-skewed high-throughput data.

PLoS One. 2012;7(7):e40598. doi: 10.1371/journal.pone.0040598. Epub 2012 Jul 6.

Performance analysis of three-class classifiers: properties of a 3-D ROC surface and the normalized volume under the surface for the ideal observer.

IEEE Trans Med Imaging. 2008 Feb;27(2):215-27. doi: 10.1109/TMI.2007.905822.

Two-way partial AUC and its properties.

Stat Methods Med Res. 2019 Jan;28(1):184-195. doi: 10.1177/0962280217718866. Epub 2017 Jul 14.

A CROC stronger than ROC: measuring, visualizing and optimizing early retrieval.

Bioinformatics. 2010 May 15;26(10):1348-56. doi: 10.1093/bioinformatics/btq140. Epub 2010 Apr 7.

An extension of the receiver operating characteristic curve and AUC-optimal classification.

Neural Comput. 2012 Oct;24(10):2789-824. doi: 10.1162/NECO_a_00336. Epub 2012 Jun 26.

Receiver operating characteristic analysis under tree orderings of disease classes.

Stat Med. 2016 May 20;35(11):1907-26. doi: 10.1002/sim.6843. Epub 2015 Dec 17.

Direct estimation of volume under the ROC surface with verification bias.

J Biopharm Stat. 2024 Jul 3;34(4):553-581. doi: 10.1080/10543406.2023.2236202. Epub 2023 Jul 20.

Smooth non-parametric receiver operating characteristic (ROC) curves for continuous diagnostic tests.

Stat Med. 1997 Oct 15;16(19):2143-56. doi: 10.1002/(sici)1097-0258(19971015)16:19<2143::aid-sim655>3.0.co;2-3.

Smooth ROC curve estimation via Bernstein polynomials.

PLoS One. 2021 May 25;16(5):e0251959. doi: 10.1371/journal.pone.0251959. eCollection 2021.

Constructing "proper" ROCs from ordinal response data using weighted power functions.

Med Decis Making. 2014 May;34(4):523-35. doi: 10.1177/0272989X13503046. Epub 2013 Sep 12.

引用本文的文献

Parameter Reduction and Optimisation for Point Cloud and Occupancy Mapping Algorithms.

Sensors (Basel). 2021 Oct 22;21(21):7004. doi: 10.3390/s21217004.

Contra: Contrarian statistics for controlled variable selection.

Proc Mach Learn Res. 2021 Apr;130:1900-1908.

Validity and Reliability Aspects of a Newly Developed Questionnaire for Auditory Localization.

J Int Adv Otol. 2019 Apr;15(1):182-183. doi: 10.5152/iao.2019.5959.

Host Taxon Predictor - A Tool for Predicting Taxon of the Host of a Newly Discovered Virus.

Sci Rep. 2019 Mar 5;9(1):3436. doi: 10.1038/s41598-019-39847-2.

Partner-specific prediction of RNA-binding residues in proteins: A critical assessment.

Proteins. 2019 Mar;87(3):198-211. doi: 10.1002/prot.25639. Epub 2018 Dec 30.

Mitigating the adverse impact of batch effects in sample pattern detection.

Bioinformatics. 2018 Aug 1;34(15):2634-2641. doi: 10.1093/bioinformatics/bty117.

bcROCsurface: an R package for correcting verification bias in estimation of the ROC surface and its volume for continuous diagnostic tests.

BMC Bioinformatics. 2017 Nov 18;18(1):503. doi: 10.1186/s12859-017-1914-3.

ROC Curve Analysis in the Presence of Imperfect Reference Standards.

Stat Biosci. 2017 Jun;9(1):91-104. doi: 10.1007/s12561-016-9159-7. Epub 2016 Jul 19.

MEG Connectivity and Power Detections with Minimum Norm Estimates Require Different Regularization Parameters.

Comput Intell Neurosci. 2016;2016:3979547. doi: 10.1155/2016/3979547. Epub 2016 Mar 22.

Improving peak detection in high-resolution LC/MS metabolomics data using preexisting knowledge and machine learning approach.

Bioinformatics. 2014 Oct 15;30(20):2941-8. doi: 10.1093/bioinformatics/btu430. Epub 2014 Jul 7.

本文引用的文献

Caveats and pitfalls of ROC analysis in clinical microarray research (and how to avoid them).

Brief Bioinform. 2012 Jan;13(1):83-97. doi: 10.1093/bib/bbr008. Epub 2011 Mar 21.

pROC: an open-source package for R and S+ to analyze and compare ROC curves.

BMC Bioinformatics. 2011 Mar 17;12:77. doi: 10.1186/1471-2105-12-77.

A CROC stronger than ROC: measuring, visualizing and optimizing early retrieval.

Bioinformatics. 2010 May 15;26(10):1348-56. doi: 10.1093/bioinformatics/btq140. Epub 2010 Apr 7.

Receiver-operating characteristic curve analysis in diagnostic, prognostic and predictive biomarker research.

J Clin Pathol. 2009 Jan;62(1):1-5. doi: 10.1136/jcp.2008.061010. Epub 2008 Sep 25.

BGX: a Bioconductor package for the Bayesian integrated analysis of Affymetrix GeneChips.

BMC Bioinformatics. 2007 Nov 12;8:439. doi: 10.1186/1471-2105-8-439.

A forward-backward fragment assembling algorithm for the identification of genomic amplification and deletion breakpoints using high-density single nucleotide polymorphism (SNP) array.

BMC Bioinformatics. 2007 May 3;8:145. doi: 10.1186/1471-2105-8-145.

The partial area under the summary ROC curve.

Stat Med. 2005 Jul 15;24(13):2025-40. doi: 10.1002/sim.2103.

Preferred analysis methods for Affymetrix GeneChips revealed by a wholly defined control dataset.

Genome Biol. 2005;6(2):R16. doi: 10.1186/gb-2005-6-2-r16. Epub 2005 Jan 28.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

ROCS：针对类别倾斜的高通量数据的接收者操作特征曲面。

ROCS: receiver operating characteristic surface for class-skewed high-throughput data.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献