Holzmeister Felix, Johannesson Magnus, Camerer Colin F, Chen Yiling, Ho Teck-Hua, Hoogeveen Suzanne, Huber Juergen, Imai Noriko, Imai Taisuke, Jin Lawrence, Kirchler Michael, Ly Alexander, Mandl Benjamin, Manfredi Dylan, Nave Gideon, Nosek Brian A, Pfeiffer Thomas, Sarafoglou Alexandra, Schwaiger Rene, Wagenmakers Eric-Jan, Waldén Viking, Dreber Anna
Department of Economics, University of Innsbruck, Innsbruck, Austria.
Department of Economics, Stockholm School of Economics, Stockholm, Sweden.
Nat Hum Behav. 2025 Feb;9(2):316-330. doi: 10.1038/s41562-024-02062-9. Epub 2024 Nov 19.
Here we test the feasibility of using decision markets to select studies for replication and provide evidence about the replicability of online experiments. Social scientists (n = 162) traded on the outcome of close replications of 41 systematically selected MTurk social science experiments published in PNAS 2015-2018, knowing that the 12 studies with the lowest and the 12 with the highest final market prices would be selected for replication, along with 2 randomly selected studies. The replication rate, based on the statistical significance indicator, was 83% for the top-12 and 33% for the bottom-12 group. Overall, 54% of the studies were successfully replicated, with replication effect size estimates averaging 45% of the original effect size estimates. The replication rate varied between 54% and 62% for alternative replication indicators. The observed replicability of MTurk experiments is comparable to that of previous systematic replication projects involving laboratory experiments.
在此,我们测试了使用决策市场来选择进行重复研究的实验,并提供有关在线实验可重复性证据的可行性。社会科学家(n = 162)对2015 - 2018年发表于《美国国家科学院院刊》上的41项系统挑选的MTurk社会科学实验的紧密重复实验结果进行交易,他们知道最终市场价格最低的12项研究和最高的12项研究将被挑选出来进行重复研究,另外还随机挑选2项研究。基于统计显著性指标,排名前12的研究组重复率为83%,排名后12的研究组重复率为33%。总体而言,54%的研究被成功重复,重复效应量估计平均为原始效应量估计的45%。对于其他重复指标,重复率在54%至62%之间变化。观察到的MTurk实验的可重复性与之前涉及实验室实验的系统重复项目相当。