Muradchanian Jasmine, Hoekstra Rink, Kiers Henk, van Ravenzwaaij Don
Behavioural and Social Sciences, University of Groningen, The Netherlands.
R Soc Open Sci. 2021 May 19;8(5):201697. doi: 10.1098/rsos.201697.
To overcome the frequently debated crisis of confidence, replicating studies is becoming increasingly more common. Multiple frequentist and Bayesian measures have been proposed to evaluate whether a replication is successful, but little is known about which method best captures replication success. This study is one of the first attempts to compare a number of quantitative measures of replication success with respect to their ability to draw the correct inference when the underlying truth is known, while taking publication bias into account. Our results show that Bayesian metrics seem to slightly outperform frequentist metrics across the board. Generally, meta-analytic approaches seem to slightly outperform metrics that evaluate single studies, except in the scenario of extreme publication bias, where this pattern reverses.
为克服经常被讨论的信任危机,重复研究正变得越来越普遍。已经提出了多种频率主义和贝叶斯方法来评估重复研究是否成功,但对于哪种方法最能体现重复研究的成功却知之甚少。本研究首次尝试比较多种重复研究成功的定量指标,考察在已知潜在真相的情况下,这些指标在得出正确推断方面的能力,同时考虑发表偏倚。我们的结果表明,贝叶斯指标总体上似乎略优于频率主义指标。一般来说,元分析方法似乎略优于评估单个研究的指标,但在极端发表偏倚的情况下,这种模式会反转。