Schäfer Thomas, Schwarz Marcus A
Department of Psychology, Chemnitz University of Technology, Chemnitz, Germany.
Front Psychol. 2019 Apr 11;10:813. doi: 10.3389/fpsyg.2019.00813. eCollection 2019.
Effect sizes are the currency of psychological research. They quantify the results of a study to answer the research question and are used to calculate statistical power. The interpretation of effect sizes-when is an effect small, medium, or large?-has been guided by the recommendations Jacob Cohen gave in his pioneering writings starting in 1962: Either compare an effect with the effects found in past research or use certain conventional benchmarks. The present analysis shows that neither of these recommendations is currently applicable. From past publications without pre-registration, 900 effects were randomly drawn and compared with 93 effects from publications with pre-registration, revealing a large difference: Effects from the former (median = 0.36) were much larger than effects from the latter (median = 0.16). That is, certain biases, such as publication bias or questionable research practices, have caused a dramatic inflation in published effects, making it difficult to compare an actual effect with the real population effects (as these are unknown). In addition, there were very large differences in the mean effects between psychological sub-disciplines and between different study designs, making it impossible to apply any global benchmarks. Many more pre-registered studies are needed in the future to derive a reliable picture of real population effects.
效应量是心理学研究的关键指标。它们量化研究结果以回答研究问题,并用于计算统计功效。效应量的解释——何时为小效应、中等效应或大效应——一直遵循雅各布·科恩自1962年起在其开创性著作中给出的建议:要么将一个效应与过去研究中发现的效应进行比较,要么使用某些传统基准。目前的分析表明,这些建议目前都不适用。从过去未预先注册的出版物中随机抽取900个效应,并与预先注册的出版物中的93个效应进行比较,结果显示出巨大差异:前者的效应(中位数 = 0.36)远大于后者(中位数 = 0.16)。也就是说,某些偏差,如发表偏倚或有问题的研究行为,导致已发表的效应大幅膨胀,使得难以将实际效应与真实总体效应进行比较(因为真实总体效应未知)。此外,心理学子学科之间以及不同研究设计之间的平均效应存在非常大的差异,这使得无法应用任何通用基准。未来需要更多预先注册的研究,以得出关于真实总体效应的可靠情况。