文献检索文档翻译深度研究
Suppr Zotero 插件Zotero 插件
邀请有礼套餐&价格历史记录

新学期,新优惠

限时优惠:9月1日-9月22日

30天高级会员仅需29元

1天体验卡首发特惠仅需5.99元

了解详情
不再提醒
插件&应用
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
高级版
套餐订阅购买积分包
AI 工具
文献检索文档翻译深度研究
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2025

Ready to ROC? A tutorial on simulation-based power analyses for null hypothesis significance, minimum-effect, and equivalence testing for ROC curve analyses.

作者信息

Riesthuis Paul, Otgaar Henry, Bücken Charlotte

机构信息

Faculty of Law and Criminology, KU Leuven, Leuven, Belgium.

Faculty of Psychology and Neuroscience, Maastricht University, Maastricht, The Netherlands.

出版信息

Behav Res Methods. 2025 Mar 18;57(4):120. doi: 10.3758/s13428-025-02646-x.


DOI:10.3758/s13428-025-02646-x
PMID:40102332
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11920309/
Abstract

The receiver operating characteristic (ROC) curve and its corresponding (partial) area under the curve (AUC) are frequently used statistical tools in psychological research to assess the discriminability of a test, method, intervention, or procedure. In this paper, we provide a tutorial on conducting simulation-based power analyses for ROC curve and (p)AUC analyses in R. We also created a Shiny app and the R package "ROCpower" to perform such power analyses. In our tutorial, we highlight the importance of setting the smallest effect size of interest (SESOI) for which researchers want to conduct their power analysis. The SESOI is the smallest effect that is practically or theoretically relevant for a specific field of research or study. We provide how such a SESOI can be established and how it changes hypotheses from simply establishing whether there is a statistically significant effect (i.e., null-hypothesis significance testing) to whether the effects are practically or theoretically important (i.e., minimum-effect testing) or whether the effect is too small to care about (i.e., equivalence testing). We show how power analyses for these different hypothesis tests can be conducted via a confidence interval-focused approach. This confidence interval-focused, simulation-based power analysis can be adapted to different research designs and questions and improves the reproducibility of power analyses.

摘要
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2a5/11920309/8f6f1979afed/13428_2025_2646_Fig14_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2a5/11920309/fcd2f908f06a/13428_2025_2646_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2a5/11920309/e8ec1ac425b6/13428_2025_2646_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2a5/11920309/d15fffcc1844/13428_2025_2646_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2a5/11920309/171d4925998b/13428_2025_2646_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2a5/11920309/f130237a4d94/13428_2025_2646_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2a5/11920309/437f2a422efd/13428_2025_2646_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2a5/11920309/34bfd712bc73/13428_2025_2646_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2a5/11920309/e9ea741dffd0/13428_2025_2646_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2a5/11920309/a4cffebdb54d/13428_2025_2646_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2a5/11920309/d3a60fb287f1/13428_2025_2646_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2a5/11920309/939051bfaef3/13428_2025_2646_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2a5/11920309/3c8a79502824/13428_2025_2646_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2a5/11920309/eec4f462615c/13428_2025_2646_Fig13_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2a5/11920309/8f6f1979afed/13428_2025_2646_Fig14_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2a5/11920309/fcd2f908f06a/13428_2025_2646_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2a5/11920309/e8ec1ac425b6/13428_2025_2646_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2a5/11920309/d15fffcc1844/13428_2025_2646_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2a5/11920309/171d4925998b/13428_2025_2646_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2a5/11920309/f130237a4d94/13428_2025_2646_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2a5/11920309/437f2a422efd/13428_2025_2646_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2a5/11920309/34bfd712bc73/13428_2025_2646_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2a5/11920309/e9ea741dffd0/13428_2025_2646_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2a5/11920309/a4cffebdb54d/13428_2025_2646_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2a5/11920309/d3a60fb287f1/13428_2025_2646_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2a5/11920309/939051bfaef3/13428_2025_2646_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2a5/11920309/3c8a79502824/13428_2025_2646_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2a5/11920309/eec4f462615c/13428_2025_2646_Fig13_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2a5/11920309/8f6f1979afed/13428_2025_2646_Fig14_HTML.jpg

相似文献

[1]
Ready to ROC? A tutorial on simulation-based power analyses for null hypothesis significance, minimum-effect, and equivalence testing for ROC curve analyses.

Behav Res Methods. 2025-3-18

[2]
Hypothesis testing in noninferiority and equivalence MRMC ROC studies.

Acad Radiol. 2012-6-19

[3]
On use of partial area under the ROC curve for evaluation of diagnostic performance.

Stat Med. 2013-3-18

[4]
Jackknife variance of the partial area under the empirical receiver operating characteristic curve.

Stat Methods Med Res. 2017-4

[5]
Joint hypothesis testing of the area under the receiver operating characteristic curve and the Youden index.

Pharm Stat. 2021-5

[6]
Statistical Inference for Box-Cox based Receiver Operating Characteristic Curves.

Stat Med. 2024-12-30

[7]
Sample size calculation for comparing two ROC curves.

Pharm Stat. 2024

[8]
Comparison of Paired ROC Curves through a Two-Stage Test.

J Biopharm Stat. 2015

[9]
Statistical power in observer-performance studies: comparison of the receiver operating characteristic and free-response methods in tasks involving localization.

Acad Radiol. 2002-2

[10]
Misuse of DeLong test to compare AUCs for nested models.

Stat Med. 2012-3-13

本文引用的文献

[1]
Null regions: a unified conceptual framework for statistical inference.

R Soc Open Sci. 2023-11-22

[2]
Evidence of questionable research practices in clinical prediction models.

BMC Med. 2023-9-4

[3]
The receiver operating characteristic area under the curve (or mean ridit) as an effect size.

Psychol Methods. 2025-6

[4]
Measuring memory is harder than you think: How to avoid problematic measurement practices in memory research.

Psychon Bull Rev. 2023-4

[5]
Correspondence: Reward, but do not yet require, interval hypothesis tests.

J Physiother. 2022-7

[6]
Myths and methodologies: The use of equivalence and non-inferiority tests for interventional studies in exercise physiology and sport science.

Exp Physiol. 2022-3

[7]
Denouncing the use of field-specific effect size distributions to inform magnitude.

PeerJ. 2021-6-14

[8]
Avoid Cohen's 'Small', 'Medium', and 'Large' for Power Analysis.

Trends Cogn Sci. 2020-3

[9]
Exploring perceptions of meaningfulness in visual representations of bivariate relationships.

PeerJ. 2019-5-14

[10]
Establishing the Minimal Clinically Important Difference for the Hospital Anxiety and Depression Scale in Patients With Cardiovascular Disease.

J Cardiopulm Rehabil Prev. 2019-11

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

推荐工具

医学文档翻译智能文献检索