拒绝概率与拒绝比率：关于假设检验中统计实践的一项提议。

Rejection odds and rejection ratios: A proposal for statistical practice in testing hypotheses.

作者信息

Bayarri M J, Benjamin Daniel J, Berger James O, Sellke Thomas M

机构信息

Universitat de València, Spain.

University of Southern California, United States.

出版信息

J Math Psychol. 2016 Jun;72:90-103. doi: 10.1016/j.jmp.2015.12.007. Epub 2016 Feb 5.

DOI:10.1016/j.jmp.2015.12.007

PMID:30713353

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6358203/

Abstract

Much of science is (rightly or wrongly) driven by hypothesis testing. Even in situations where the hypothesis testing paradigm is correct, the common practice of basing inferences solely on -values has been under intense criticism for over 50 years. We propose, as an alternative, the use of the odds of a correct rejection of the null hypothesis to incorrect rejection. Both pre-experimental versions (involving the power and Type I error) and post-experimental versions (depending on the actual data) are considered. Implementations are provided that range from depending only on the -value to consideration of full Bayesian analysis. A surprise is that all implementations - even the full Bayesian analysis - have complete frequentist justification. Versions of our proposal can be implemented that require only minor modifications to existing practices yet overcome some of their most severe shortcomings.

摘要

许多科学研究（无论正确与否）都是由假设检验驱动的。即使在假设检验范式正确的情况下，仅基于P值进行推断的常见做法在过去50多年里一直受到强烈批评。作为一种替代方法，我们建议使用正确拒绝原假设与错误拒绝原假设的概率。我们考虑了实验前版本（涉及检验功效和I类错误）和实验后版本（取决于实际数据）。我们提供了从仅依赖P值到考虑完全贝叶斯分析的各种实现方法。令人惊讶的是，所有实现方法——甚至是完全贝叶斯分析——都有完整的频率主义依据。我们的提议的版本可以通过对现有做法进行微小修改来实现，同时克服它们一些最严重的缺点。

相似文献

Rejection odds and rejection ratios: A proposal for statistical practice in testing hypotheses.拒绝概率与拒绝比率：关于假设检验中统计实践的一项提议。

J Math Psychol. 2016 Jun;72:90-103. doi: 10.1016/j.jmp.2015.12.007. Epub 2016 Feb 5.

Bayesian evaluation of informative hypotheses in cluster-randomized trials.贝叶斯评价在整群随机试验中信息性假设。

Behav Res Methods. 2019 Feb;51(1):126-137. doi: 10.3758/s13428-018-1149-x.

Using epistemic ratios to evaluate hypotheses: an imprecision penalty for imprecise hypotheses.使用认知比率评估假设：对不精确假设的不精确惩罚。

Genet Soc Gen Psychol Monogr. 2006 Nov;132(4):431-62. doi: 10.3200/mono.132.4.431-462.

Deciding on Null Hypotheses using P-values or Bayesian alternatives: A simulation study.使用 P 值或贝叶斯替代方案确定零假设：一项模拟研究。

Psicothema. 2018 Feb;30(1):110-115. doi: 10.7334/psicothema2017.308.

Improving Inferences About Null Effects With Bayes Factors and Equivalence Tests.贝叶斯因子和等效检验提高关于零效应的推断。

J Gerontol B Psychol Sci Soc Sci. 2020 Jan 1;75(1):45-57. doi: 10.1093/geronb/gby065.

A Review of Bayesian Hypothesis Testing and Its Practical Implementations.贝叶斯假设检验及其实际应用综述

Entropy (Basel). 2022 Jan 21;24(2):161. doi: 10.3390/e24020161.

Bayesian -tests for correlations and partial correlations.贝叶斯相关性和偏相关性检验。

J Appl Stat. 2019 Nov 21;47(10):1820-1832. doi: 10.1080/02664763.2019.1695760. eCollection 2020.

Bayesian hypothesis testing in two-arm trials with dichotomous outcomes.双臂二分结果试验中的贝叶斯假设检验。

Biometrics. 2013 Mar;69(1):157-63. doi: 10.1111/j.1541-0420.2012.01806.x. Epub 2012 Sep 24.

Statistical hypothesis testing and common misinterpretations: Should we abandon p-value in forensic science applications?统计假设检验及常见误解：在法医学应用中我们应该摒弃p值吗？

Forensic Sci Int. 2016 Feb;259:e32-6. doi: 10.1016/j.forsciint.2015.11.013. Epub 2015 Dec 12.

Bayes factor and posterior probability: Complementary statistical evidence to p-value.贝叶斯因子与后验概率：作为p值补充的统计证据

Contemp Clin Trials. 2015 Sep;44:33-35. doi: 10.1016/j.cct.2015.07.001. Epub 2015 Jul 26.

引用本文的文献

The neural signatures of the psychological construct "flow": A replication study.心理建构“心流”的神经特征：一项重复研究。

Neuroimage Rep. 2022 Oct 10;2(4):100139. doi: 10.1016/j.ynirp.2022.100139. eCollection 2022 Dec.

Tidal modulation of the seismic activity related to the 2021 La Palma volcanic eruption.潮汐对 2021 年拉帕尔马火山喷发相关地震活动的调制。

Sci Rep. 2023 Apr 20;13(1):6485. doi: 10.1038/s41598-023-33691-1.

Academic, Activist, or Advocate? Angry, Entangled, and Emerging: A Critical Reflection on Autism Knowledge Production.学者、活动家还是倡导者？愤怒、纠缠与兴起：对自闭症知识生成的批判性反思

Front Psychol. 2021 Sep 28;12:727542. doi: 10.3389/fpsyg.2021.727542. eCollection 2021.

The Practical Alternative to the Value Is the Correctly Used Value.实用的替代价值是正确使用的价值。

Perspect Psychol Sci. 2021 May;16(3):639-648. doi: 10.1177/1745691620958012. Epub 2021 Feb 9.

Détente: A Practical Understanding of P values and Bayesian Posterior Probabilities.放松：对 P 值和贝叶斯后验概率的实际理解。

Clin Pharmacol Ther. 2021 Jun;109(6):1489-1498. doi: 10.1002/cpt.2004. Epub 2020 Sep 26.

A decision-theoretic approach to Bayesian clinical trial design and evaluation of robustness to prior-data conflict.一种决策理论方法在贝叶斯临床试验设计中的应用及对先验数据冲突稳健性的评估。

Biostatistics. 2022 Jan 13;23(1):328-344. doi: 10.1093/biostatistics/kxaa027.

Accumulation Bias in meta-analysis: the need to consider in error control.Meta分析中的累积偏倚：误差控制中需要考虑的因素。

F1000Res. 2019 Jun 25;8:962. doi: 10.12688/f1000research.19375.1. eCollection 2019.

Noninferiority and equivalence tests in sequential, multiple assignment, randomized trials (SMARTs).序贯、多次分配、随机试验（SMARTs）中的非劣效性和等效性检验。

Psychol Methods. 2020 Apr;25(2):182-205. doi: 10.1037/met0000232. Epub 2019 Sep 9.

Recognizing that Evidence is Made, not Born.认识到证据是被制造出来的，而不是天生的。

Clin Pharmacol Ther. 2019 Apr;105(4):844-856. doi: 10.1002/cpt.1317. Epub 2019 Jan 4.

A tutorial on bridge sampling.桥抽样教程。

J Math Psychol. 2017 Dec;81:80-97. doi: 10.1016/j.jmp.2017.09.005.

本文引用的文献

Using prediction markets to estimate the reproducibility of scientific research.利用预测市场评估科研的可重复性。

Proc Natl Acad Sci U S A. 2015 Dec 15;112(50):15343-7. doi: 10.1073/pnas.1516179112. Epub 2015 Nov 9.

Beyond Power Calculations: Assessing Type S (Sign) and Type M (Magnitude) Errors.超越功效计算：评估 S 型（信号）和 M 型（幅度）误差。

Perspect Psychol Sci. 2014 Nov;9(6):641-51. doi: 10.1177/1745691614551642.

Bayesian Assessment of Null Values Via Parameter Estimation and Model Comparison.贝叶斯方法通过参数估计和模型比较来评估缺失值。

Perspect Psychol Sci. 2011 May;6(3):299-312. doi: 10.1177/1745691611406925.

Genetic studies of body mass index yield new insights for obesity biology.遗传研究体重指数为肥胖生物学提供了新的见解。

Nature. 2015 Feb 12;518(7538):197-206. doi: 10.1038/nature14177.

Replicability and robustness of genome-wide-association studies for behavioral traits.行为特征全基因组关联研究的可重复性和稳健性

Psychol Sci. 2014 Nov;25(11):1975-86. doi: 10.1177/0956797614545132. Epub 2014 Oct 6.

Defining the role of common variation in the genomic and biological architecture of adult human height.确定常见变异在成年人类身高的基因组和生物学结构中的作用。

Nat Genet. 2014 Nov;46(11):1173-86. doi: 10.1038/ng.3097. Epub 2014 Oct 5.

Biological insights from 108 schizophrenia-associated genetic loci.108 个精神分裂症相关遗传位点的生物学见解。

Nature. 2014 Jul 24;511(7510):421-7. doi: 10.1038/nature13595. Epub 2014 Jul 22.

On the persistence of low power in psychological science.论心理学中低效能的持续存在。

Q J Exp Psychol (Hove). 2014 May;67(5):1037-40. doi: 10.1080/17470218.2014.885986. Epub 2014 Mar 3.

Revised standards for statistical evidence.修订后的统计证据标准。

Proc Natl Acad Sci U S A. 2013 Nov 26;110(48):19313-7. doi: 10.1073/pnas.1313476110. Epub 2013 Nov 11.

GWAS of 126,559 individuals identifies genetic variants associated with educational attainment.对 126559 人的全基因组关联研究发现了与受教育程度相关的遗传变异。

Science. 2013 Jun 21;340(6139):1467-71. doi: 10.1126/science.1235488. Epub 2013 May 30.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

拒绝概率与拒绝比率：关于假设检验中统计实践的一项提议。

Rejection odds and rejection ratios: A proposal for statistical practice in testing hypotheses.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献