Dreber Anna, Pfeiffer Thomas, Almenberg Johan, Isaksson Siri, Wilson Brad, Chen Yiling, Nosek Brian A, Johannesson Magnus
Department of Economics, Stockholm School of Economics, SE-113 83 Stockholm, Sweden;
New Zealand Institute for Advanced Study, Massey University, Auckland 0745, New Zealand; Wissenschaftskolleg zu Berlin-Institute for Advanced Study, D-14193 Berlin, Germany;
Proc Natl Acad Sci U S A. 2015 Dec 15;112(50):15343-7. doi: 10.1073/pnas.1516179112. Epub 2015 Nov 9.
Concerns about a lack of reproducibility of statistically significant results have recently been raised in many fields, and it has been argued that this lack comes at substantial economic costs. We here report the results from prediction markets set up to quantify the reproducibility of 44 studies published in prominent psychology journals and replicated in the Reproducibility Project: Psychology. The prediction markets predict the outcomes of the replications well and outperform a survey of market participants' individual forecasts. This shows that prediction markets are a promising tool for assessing the reproducibility of published scientific results. The prediction markets also allow us to estimate probabilities for the hypotheses being true at different testing stages, which provides valuable information regarding the temporal dynamics of scientific discovery. We find that the hypotheses being tested in psychology typically have low prior probabilities of being true (median, 9%) and that a "statistically significant" finding needs to be confirmed in a well-powered replication to have a high probability of being true. We argue that prediction markets could be used to obtain speedy information about reproducibility at low cost and could potentially even be used to determine which studies to replicate to optimally allocate limited resources into replications.
最近,许多领域都对统计显著结果缺乏可重复性表示担忧,有人认为这种缺乏可重复性会带来巨大的经济成本。我们在此报告为量化发表在著名心理学期刊上并在“心理学可重复性项目”中进行重复研究的44项研究的可重复性而设立的预测市场的结果。预测市场能很好地预测重复研究的结果,并且优于对市场参与者个人预测的调查。这表明预测市场是评估已发表科学结果可重复性的一种很有前景的工具。预测市场还使我们能够估计在不同测试阶段假设为真的概率,这提供了有关科学发现时间动态的有价值信息。我们发现,心理学中正在测试的假设通常为真的先验概率较低(中位数为9%),并且一个“统计显著”的发现需要在有充分效力的重复研究中得到证实,才更有可能为真。我们认为,预测市场可用于以低成本快速获取有关可重复性的信息,甚至有可能用于确定要重复哪些研究,以便将有限资源最优地分配到重复研究中。