准备好进行ROC分析了吗？关于基于模拟的功效分析的教程，用于ROC曲线分析的零假设显著性检验、最小效应检验和等效性检验。

Ready to ROC? A tutorial on simulation-based power analyses for null hypothesis significance, minimum-effect, and equivalence testing for ROC curve analyses.

作者信息

Riesthuis Paul, Otgaar Henry, Bücken Charlotte

机构信息

Faculty of Law and Criminology, KU Leuven, Leuven, Belgium.

Faculty of Psychology and Neuroscience, Maastricht University, Maastricht, The Netherlands.

出版信息

Behav Res Methods. 2025 Mar 18;57(4):120. doi: 10.3758/s13428-025-02646-x.

DOI:10.3758/s13428-025-02646-x

PMID:40102332

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11920309/

Abstract

The receiver operating characteristic (ROC) curve and its corresponding (partial) area under the curve (AUC) are frequently used statistical tools in psychological research to assess the discriminability of a test, method, intervention, or procedure. In this paper, we provide a tutorial on conducting simulation-based power analyses for ROC curve and (p)AUC analyses in R. We also created a Shiny app and the R package "ROCpower" to perform such power analyses. In our tutorial, we highlight the importance of setting the smallest effect size of interest (SESOI) for which researchers want to conduct their power analysis. The SESOI is the smallest effect that is practically or theoretically relevant for a specific field of research or study. We provide how such a SESOI can be established and how it changes hypotheses from simply establishing whether there is a statistically significant effect (i.e., null-hypothesis significance testing) to whether the effects are practically or theoretically important (i.e., minimum-effect testing) or whether the effect is too small to care about (i.e., equivalence testing). We show how power analyses for these different hypothesis tests can be conducted via a confidence interval-focused approach. This confidence interval-focused, simulation-based power analysis can be adapted to different research designs and questions and improves the reproducibility of power analyses.

摘要

接受者操作特征（ROC）曲线及其相应的曲线下（部分）面积（AUC）是心理学研究中常用的统计工具，用于评估测试、方法、干预或程序的辨别力。在本文中，我们提供了一个关于在R中对ROC曲线和（p）AUC分析进行基于模拟的功效分析的教程。我们还创建了一个Shiny应用程序和R包“ROCpower”来执行此类功效分析。在我们的教程中，我们强调了设定研究者想要进行功效分析的最小效应量感兴趣值（SESOI）的重要性。SESOI是对特定研究领域在实际或理论上相关的最小效应。我们介绍了如何建立这样一个SESOI，以及它如何将假设从简单地确定是否存在统计学上显著的效应（即零假设显著性检验）转变为效应在实际或理论上是否重要（即最小效应检验），或者效应是否小到可以忽略不计（即等效性检验）。我们展示了如何通过一种以置信区间为重点的方法对这些不同的假设检验进行功效分析。这种以置信区间为重点、基于模拟的功效分析可以适应不同的研究设计和问题，并提高功效分析的可重复性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2a5/11920309/fcd2f908f06a/13428_2025_2646_Fig1_HTML.jpg

相似文献

Ready to ROC? A tutorial on simulation-based power analyses for null hypothesis significance, minimum-effect, and equivalence testing for ROC curve analyses.准备好进行ROC分析了吗？关于基于模拟的功效分析的教程，用于ROC曲线分析的零假设显著性检验、最小效应检验和等效性检验。

Behav Res Methods. 2025 Mar 18;57(4):120. doi: 10.3758/s13428-025-02646-x.

Hypothesis testing in noninferiority and equivalence MRMC ROC studies.非劣效性和等效性 MRMC ROC 研究中的假设检验。

Acad Radiol. 2012 Sep;19(9):1158-65. doi: 10.1016/j.acra.2012.04.011. Epub 2012 Jun 19.

On use of partial area under the ROC curve for evaluation of diagnostic performance.ROC 曲线下面积的使用评估诊断性能。

Stat Med. 2013 Sep 10;32(20):3449-58. doi: 10.1002/sim.5777. Epub 2013 Mar 18.

Jackknife variance of the partial area under the empirical receiver operating characteristic curve.经验性接收者操作特征曲线下部分面积的刀切法方差

Stat Methods Med Res. 2017 Apr;26(2):528-541. doi: 10.1177/0962280214551190. Epub 2014 Sep 16.

Joint hypothesis testing of the area under the receiver operating characteristic curve and the Youden index.联合假设检验受试者工作特征曲线下面积和约登指数。

Pharm Stat. 2021 May;20(3):657-674. doi: 10.1002/pst.2099. Epub 2021 Jan 29.

Statistical Inference for Box-Cox based Receiver Operating Characteristic Curves.基于Box-Cox变换的接收者操作特征曲线的统计推断

Stat Med. 2024 Dec 30;43(30):6099-6122. doi: 10.1002/sim.10252. Epub 2024 Nov 17.

Sample size calculation for comparing two ROC curves.比较两条 ROC 曲线的样本量计算。

Pharm Stat. 2024 Jul-Aug;23(4):557-569. doi: 10.1002/pst.2371. Epub 2024 Feb 28.

Comparison of Paired ROC Curves through a Two-Stage Test.通过两阶段检验比较配对ROC曲线

J Biopharm Stat. 2015;25(5):881-902. doi: 10.1080/10543406.2014.920874. Epub 2014 Jun 6.

Statistical power in observer-performance studies: comparison of the receiver operating characteristic and free-response methods in tasks involving localization.观察者表现研究中的统计功效：涉及定位任务的接受者操作特征法与自由反应法的比较

Acad Radiol. 2002 Feb;9(2):147-56. doi: 10.1016/s1076-6332(03)80164-2.

Misuse of DeLong test to compare AUCs for nested models.误用 Delong 检验比较嵌套模型的 AUC。

Stat Med. 2012 Oct 15;31(23):2577-87. doi: 10.1002/sim.5328. Epub 2012 Mar 13.

本文引用的文献

Null regions: a unified conceptual framework for statistical inference.零区域：统计推断的统一概念框架

R Soc Open Sci. 2023 Nov 22;10(11):221328. doi: 10.1098/rsos.221328. eCollection 2023 Nov.

Evidence of questionable research practices in clinical prediction models.临床预测模型中存在可疑研究行为的证据。

BMC Med. 2023 Sep 4;21(1):339. doi: 10.1186/s12916-023-03048-6.

The receiver operating characteristic area under the curve (or mean ridit) as an effect size.作为效应量的曲线下接受者操作特征面积（或平均里地值）。

Psychol Methods. 2025 Jun;30(3):678-686. doi: 10.1037/met0000601. Epub 2023 Jul 13.

Measuring memory is harder than you think: How to avoid problematic measurement practices in memory research.衡量记忆比你想象的要难：如何避免记忆研究中的有问题的测量实践。

Psychon Bull Rev. 2023 Apr;30(2):421-449. doi: 10.3758/s13423-022-02179-w. Epub 2022 Oct 19.

Correspondence: Reward, but do not yet require, interval hypothesis tests.通信：奖励，但尚不要求进行区间假设检验。

J Physiother. 2022 Jul;68(3):213-214. doi: 10.1016/j.jphys.2022.06.004. Epub 2022 Jun 24.

Myths and methodologies: The use of equivalence and non-inferiority tests for interventional studies in exercise physiology and sport science.误区与方法：等效性检验和非劣效性检验在运动生理学与运动科学干预性研究中的应用

Exp Physiol. 2022 Mar;107(3):201-212. doi: 10.1113/EP090171. Epub 2022 Jan 26.

Denouncing the use of field-specific effect size distributions to inform magnitude.谴责使用特定领域的效应量分布来告知大小。

PeerJ. 2021 Jun 14;9:e11383. doi: 10.7717/peerj.11383. eCollection 2021.

Avoid Cohen's 'Small', 'Medium', and 'Large' for Power Analysis.避免 Cohen 的“小”、“中”和“大”进行功效分析。

Trends Cogn Sci. 2020 Mar;24(3):200-207. doi: 10.1016/j.tics.2019.12.009. Epub 2020 Jan 15.

Exploring perceptions of meaningfulness in visual representations of bivariate relationships.探索二元关系视觉表征中意义感的认知。

PeerJ. 2019 May 14;7:e6853. doi: 10.7717/peerj.6853. eCollection 2019.

Establishing the Minimal Clinically Important Difference for the Hospital Anxiety and Depression Scale in Patients With Cardiovascular Disease.确立心血管疾病患者医院焦虑抑郁量表的最小临床重要差异。

J Cardiopulm Rehabil Prev. 2019 Nov;39(6):E6-E11. doi: 10.1097/HCR.0000000000000379.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

准备好进行ROC分析了吗？关于基于模拟的功效分析的教程，用于ROC曲线分析的零假设显著性检验、最小效应检验和等效性检验。

Ready to ROC? A tutorial on simulation-based power analyses for null hypothesis significance, minimum-effect, and equivalence testing for ROC curve analyses.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献