StAR：一种用于ROC曲线统计比较的简单工具。

StAR: a simple tool for the statistical comparison of ROC curves.

作者信息

Vergara Ismael A, Norambuena Tomás, Ferrada Evandro, Slater Alex W, Melo Francisco

机构信息

Departamento de Genética Molecular y Microbiología, Facultad de Ciencias Biológicas, Pontificia Universidad Católica de Chile, Alameda 340, Santiago, Chile.

出版信息

BMC Bioinformatics. 2008 Jun 5;9:265. doi: 10.1186/1471-2105-9-265.

DOI:10.1186/1471-2105-9-265

PMID:18534022

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2435548/

Abstract

BACKGROUND

As in many different areas of science and technology, most important problems in bioinformatics rely on the proper development and assessment of binary classifiers. A generalized assessment of the performance of binary classifiers is typically carried out through the analysis of their receiver operating characteristic (ROC) curves. The area under the ROC curve (AUC) constitutes a popular indicator of the performance of a binary classifier. However, the assessment of the statistical significance of the difference between any two classifiers based on this measure is not a straightforward task, since not many freely available tools exist. Most existing software is either not free, difficult to use or not easy to automate when a comparative assessment of the performance of many binary classifiers is intended. This constitutes the typical scenario for the optimization of parameters when developing new classifiers and also for their performance validation through the comparison to previous art.

RESULTS

In this work we describe and release new software to assess the statistical significance of the observed difference between the AUCs of any two classifiers for a common task estimated from paired data or unpaired balanced data. The software is able to perform a pairwise comparison of many classifiers in a single run, without requiring any expert or advanced knowledge to use it. The software relies on a non-parametric test for the difference of the AUCs that accounts for the correlation of the ROC curves. The results are displayed graphically and can be easily customized by the user. A human-readable report is generated and the complete data resulting from the analysis are also available for download, which can be used for further analysis with other software. The software is released as a web server that can be used in any client platform and also as a standalone application for the Linux operating system.

CONCLUSION

A new software for the statistical comparison of ROC curves is released here as a web server and also as standalone software for the LINUX operating system.

摘要

背景

与许多不同的科学技术领域一样，生物信息学中的大多数重要问题都依赖于二元分类器的正确开发和评估。二元分类器性能的一般评估通常通过分析其接收器操作特征（ROC）曲线来进行。ROC曲线下的面积（AUC）是二元分类器性能的一个常用指标。然而，基于此度量评估任意两个分类器之间差异的统计显著性并非易事，因为可用的免费工具不多。当要对许多二元分类器的性能进行比较评估时，大多数现有软件要么不免费，要么难以使用，要么不容易自动化。这是开发新分类器时参数优化以及通过与现有技术比较进行性能验证的典型情况。

结果

在这项工作中，我们描述并发布了新软件，用于评估从配对数据或未配对平衡数据估计的常见任务中任意两个分类器的AUC之间观察到的差异的统计显著性。该软件能够在一次运行中对许多分类器进行成对比较，无需任何专业或高级知识即可使用。该软件依赖于一种用于AUC差异的非参数检验，该检验考虑了ROC曲线的相关性。结果以图形方式显示，用户可以轻松定制。生成一份可读的报告，分析产生的完整数据也可供下载，可用于与其他软件进行进一步分析。该软件作为一个网络服务器发布，可在任何客户端平台上使用，也作为Linux操作系统的独立应用程序发布。

结论

这里发布了一种用于ROC曲线统计比较的新软件，作为网络服务器以及Linux操作系统的独立软件。

相似文献

StAR: a simple tool for the statistical comparison of ROC curves.StAR：一种用于ROC曲线统计比较的简单工具。

BMC Bioinformatics. 2008 Jun 5;9:265. doi: 10.1186/1471-2105-9-265.

A global goodness-of-fit test for receiver operating characteristic curve analysis via the bootstrap method.一种通过自助法进行受试者工作特征曲线分析的全局拟合优度检验。

J Biomed Inform. 2005 Oct;38(5):395-403. doi: 10.1016/j.jbi.2005.02.004. Epub 2005 Mar 9.

pROC: an open-source package for R and S+ to analyze and compare ROC curves.pROC：一个用于 R 和 S+的开源软件包，用于分析和比较 ROC 曲线。

BMC Bioinformatics. 2011 Mar 17;12:77. doi: 10.1186/1471-2105-12-77.

Minimum-norm estimation for binormal receiver operating characteristic (ROC) curves.双正态接收器操作特征（ROC）曲线的最小范数估计

Biom J. 2009 Dec;51(6):1030-46. doi: 10.1002/bimj.200900128.

Verification of modified receiver-operating characteristic software using simulated rating data.使用模拟评级数据验证改良的接收者操作特征软件。

Radiol Phys Technol. 2018 Dec;11(4):406-414. doi: 10.1007/s12194-018-0479-9. Epub 2018 Sep 22.

A bivariate contaminated binormal model for robust fitting of proper ROC curves to a pair of correlated, possibly degenerate, ROC datasets.一种双变量污染的双正态模型，用于稳健拟合一对相关的、可能退化的 ROC 数据集的合适 ROC 曲线。

Med Phys. 2017 Jun;44(6):2207-2222. doi: 10.1002/mp.12263. Epub 2017 May 18.

A program for computing the prediction probability and the related receiver operating characteristic graph.一个用于计算预测概率和相关受试者工作特征曲线的程序。

Anesth Analg. 2010 Dec;111(6):1416-21. doi: 10.1213/ANE.0b013e3181fb919e. Epub 2010 Nov 8.

Multiobjective genetic optimization of diagnostic classifiers with implications for generating receiver operating characteristic curves.诊断分类器的多目标遗传优化及其对生成受试者工作特征曲线的意义

IEEE Trans Med Imaging. 1999 Aug;18(8):675-85. doi: 10.1109/42.796281.

Estimation of the ROC curve under verification bias.验证性偏倚下ROC曲线的估计

Biom J. 2009 Jun;51(3):475-90. doi: 10.1002/bimj.200800128.

Classifier design for computer-aided diagnosis: effects of finite sample size on the mean performance of classical and neural network classifiers.用于计算机辅助诊断的分类器设计：有限样本量对经典分类器和神经网络分类器平均性能的影响。

Med Phys. 1999 Dec;26(12):2654-68. doi: 10.1118/1.598805.

引用本文的文献

ELW-CNN: An extremely lightweight convolutional neural network for enhancing interoperability in colon and lung cancer identification using explainable AI.ELW-CNN：一种超轻量级卷积神经网络，用于借助可解释人工智能提高结肠癌和肺癌识别中的互操作性。

Healthc Technol Lett. 2025 Jan 22;12(1):e12122. doi: 10.1049/htl2.12122. eCollection 2025 Jan-Dec.

Discriminating Parkinson's disease patients from healthy controls using nasal respiratory airflow.利用鼻腔呼吸气流区分帕金森病患者与健康对照。

Commun Med (Lond). 2024 Nov 14;4(1):233. doi: 10.1038/s43856-024-00660-2.

Utility of diffusion-weighted imaging in differentiating benign and malignant breast lesions.扩散加权成像在鉴别乳腺良恶性病变中的应用价值。

SA J Radiol. 2024 Oct 9;28(1):2952. doi: 10.4102/sajr.v28i1.2952. eCollection 2024.

How Effective is the Systemic Inflammatory Immune Index in the Etiopathogenesis of Isolated Coronary Artery Ectasia?Reply.全身炎症免疫指数在孤立性冠状动脉扩张病因学中的有效性如何？回复。

Arq Bras Cardiol. 2023 Sep 4;120(7):e20230048. doi: 10.36660/abc.20230048. eCollection 2023.

Predictive value of abbreviated olfactory tests in prodromal Parkinson disease.简化嗅觉测试在前驱期帕金森病中的预测价值。

NPJ Parkinsons Dis. 2023 Jun 29;9(1):103. doi: 10.1038/s41531-023-00530-z.

Direct Gene Expression Profile Prediction for Uveal Melanoma from Digital Cytopathology Images via Deep Learning and Salient Image Region Identification.通过深度学习和显著图像区域识别从数字细胞病理学图像预测葡萄膜黑色素瘤的直接基因表达谱

Ophthalmol Sci. 2022 Oct 30;3(1):100240. doi: 10.1016/j.xops.2022.100240. eCollection 2023 Mar.

Accurate and generalizable quantitative scoring of liver steatosis from ultrasound images scalable deep learning.基于可扩展深度学习的超声图像肝脂肪变性准确且可推广的定量评分

World J Gastroenterol. 2022 Jun 14;28(22):2494-2508. doi: 10.3748/wjg.v28.i22.2494.

PEGALUS: predictivity of elderly age, arterial gas analysis, and lung ultrasound. A new prognostic score for COVID-19 patients in the emergency department-an observational prospective study.PEGALUS：老年预测、动脉血气分析和肺部超声。急诊 COVID-19 患者的新预后评分：一项观察性前瞻性研究。

Intern Emerg Med. 2022 Nov;17(8):2357-2365. doi: 10.1007/s11739-022-03047-0. Epub 2022 Jul 27.

Integrated structure-based protein interface prediction.基于结构的蛋白质界面整体预测。

BMC Bioinformatics. 2022 Jul 25;23(1):301. doi: 10.1186/s12859-022-04852-2.

An olfactory self-test effectively screens for COVID-19.嗅觉自我检测可有效筛查新冠病毒。

Commun Med (Lond). 2022 Apr 5;2:34. doi: 10.1038/s43856-022-00095-7. eCollection 2022.

本文引用的文献

A knowledge-based potential with an accurate description of local interactions improves discrimination between native and near-native protein conformations.一种能准确描述局部相互作用的基于知识的势，可提高对天然和近天然蛋白质构象之间的区分能力。

Cell Biochem Biophys. 2007;49(2):111-24. doi: 10.1007/s12013-007-0050-5.

Fold assessment for comparative protein structure modeling.用于比较蛋白质结构建模的折叠评估

Protein Sci. 2007 Nov;16(11):2412-26. doi: 10.1110/ps.072895107. Epub 2007 Sep 28.

Nonbonded terms extrapolated from nonlocal knowledge-based energy functions improve error detection in near-native protein structure models.从基于非局部知识的能量函数外推得到的非键合项可改善近天然蛋白质结构模型中的错误检测。

Protein Sci. 2007 Jul;16(7):1410-21. doi: 10.1110/ps.062735907.

Monte Carlo validation of the Dorfman-Berbaum-Metz method using normalized pseudovalues and less data-based model simplification.使用归一化伪值和基于较少数据的模型简化对多夫曼-贝鲍姆-梅茨方法进行蒙特卡洛验证。

Acad Radiol. 2005 Dec;12(12):1534-41. doi: 10.1016/j.acra.2005.07.012.

A comparison of the Dorfman-Berbaum-Metz and Obuchowski-Rockette methods for receiver operating characteristic (ROC) data.多夫曼-贝鲍姆-梅茨法与奥布霍夫斯基-罗凯特法用于接受者操作特征（ROC）数据的比较。

Stat Med. 2005 May 30;24(10):1579-607. doi: 10.1002/sim.2024.

Global protein function prediction from protein-protein interaction networks.基于蛋白质-蛋白质相互作用网络的全局蛋白质功能预测

Nat Biotechnol. 2003 Jun;21(6):697-700. doi: 10.1038/nbt825. Epub 2003 May 12.

Comparison of eight computer programs for receiver-operating characteristic analysis.八种用于接受者操作特征分析的计算机程序的比较

Clin Chem. 2003 Mar;49(3):433-9. doi: 10.1373/49.3.433.

Better decisions through science.借助科学，做出更优决策。

Sci Am. 2000 Oct;283(4):82-7. doi: 10.1038/scientificamerican1000-82.

Gene ontology: tool for the unification of biology. The Gene Ontology Consortium.基因本体论：生物学统一工具。基因本体论联合会。

Nat Genet. 2000 May;25(1):25-9. doi: 10.1038/75556.

Gene structure prediction by spliced alignment of genomic DNA with protein sequences: increased accuracy by differential splice site scoring.通过基因组DNA与蛋白质序列的剪接比对进行基因结构预测：通过差异剪接位点评分提高准确性。

J Mol Biol. 2000 Apr 14;297(5):1075-85. doi: 10.1006/jmbi.2000.3641.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验