Pawel Samuel, Consonni Guido, Held Leonhard
Department of Biostatistics, Center for Reproducible Science, University of Zurich.
Dipartimento di Scienze Statistiche, Universita Cattolica del Sacro Cuore.
Psychol Methods. 2023 Aug 10. doi: 10.1037/met0000604.
Replication studies are essential for assessing the credibility of claims from original studies. A critical aspect of designing replication studies is determining their sample size; a too-small sample size may lead to inconclusive studies whereas a too-large sample size may waste resources that could be allocated better in other studies. Here, we show how Bayesian approaches can be used for tackling this problem. The Bayesian framework allows researchers to combine the original data and external knowledge in a design prior distribution for the underlying parameters. Based on a design prior, predictions about the replication data can be made, and the replication sample size can be chosen to ensure a sufficiently high probability of replication success. Replication success may be defined by Bayesian or non-Bayesian criteria and different criteria may also be combined to meet distinct stakeholders and enable conclusive inferences based on multiple analysis approaches. We investigate sample size determination in the normal-normal hierarchical model where analytical results are available and traditional sample size determination is a special case where the uncertainty on parameter values is not accounted for. We use data from a multisite replication project of social-behavioral experiments to illustrate how Bayesian approaches can help design informative and cost-effective replication studies. Our methods can be used through the R package BayesRepDesign. (PsycInfo Database Record (c) 2023 APA, all rights reserved).
重复研究对于评估原始研究结果的可信度至关重要。设计重复研究的一个关键方面是确定其样本量;样本量过小可能导致研究结果不明确,而样本量过大则可能浪费资源,这些资源本可在其他研究中得到更好的分配。在此,我们展示了如何使用贝叶斯方法来解决这个问题。贝叶斯框架允许研究人员在基础参数的设计先验分布中结合原始数据和外部知识。基于设计先验,可以对重复数据进行预测,并选择重复样本量以确保重复成功的概率足够高。重复成功可以通过贝叶斯或非贝叶斯标准来定义,不同的标准也可以结合起来,以满足不同的利益相关者,并基于多种分析方法进行确定性推断。我们在正态-正态层次模型中研究样本量的确定,在该模型中可以获得分析结果,传统的样本量确定是不考虑参数值不确定性的特殊情况。我们使用来自社会行为实验多站点重复项目的数据来说明贝叶斯方法如何有助于设计信息丰富且具有成本效益的重复研究。我们的方法可以通过R包BayesRepDesign使用。(PsycInfo数据库记录(c)2023美国心理学会,保留所有权利)