Department of Psychology, The University of Texas at Austin, Austin, TX78712-1043,
Behav Brain Sci. 2020 Dec 21;45:e1. doi: 10.1017/S0140525X20001685.
Most theories and hypotheses in psychology are verbal in nature, yet their evaluation overwhelmingly relies on inferential statistical procedures. The validity of the move from qualitative to quantitative analysis depends on the verbal and statistical expressions of a hypothesis being closely aligned - that is, that the two must refer to roughly the same set of hypothetical observations. Here, I argue that many applications of statistical inference in psychology fail to meet this basic condition. Focusing on the most widely used class of model in psychology - the linear mixed model - I explore the consequences of failing to statistically operationalize verbal hypotheses in a way that respects researchers' actual generalization intentions. I demonstrate that although the "random effect" formalism is used pervasively in psychology to model intersubject variability, few researchers accord the same treatment to other variables they clearly intend to generalize over (e.g., stimuli, tasks, or research sites). The under-specification of random effects imposes far stronger constraints on the generalizability of results than most researchers appreciate. Ignoring these constraints can dramatically inflate false-positive rates, and often leads researchers to draw sweeping verbal generalizations that lack a meaningful connection to the statistical quantities they are putatively based on. I argue that failure to take the alignment between verbal and statistical expressions seriously lies at the heart of many of psychology's ongoing problems (e.g., the replication crisis), and conclude with a discussion of several potential avenues for improvement.
心理学中的大多数理论和假设都是口头的,但它们的评估主要依赖于推理统计程序。从定性分析到定量分析的转变的有效性取决于假设的口头和统计表达是否紧密一致,也就是说,两者必须大致指的是相同的假设观测集。在这里,我认为心理学中统计推断的许多应用都未能满足这一基本条件。我专注于心理学中最广泛使用的模型类别——线性混合模型,探讨了未能以尊重研究人员实际推广意图的方式对口头假设进行统计操作化的后果。我证明了尽管“随机效应”形式在心理学中被广泛用于对个体间变异性进行建模,但很少有研究人员对他们明确打算推广的其他变量(例如,刺激、任务或研究地点)给予相同的处理。随机效应的不充分指定对结果的可推广性施加了比大多数研究人员意识到的更严格的限制。忽略这些限制会极大地增加假阳性率,并且常常导致研究人员得出缺乏与他们所依据的统计数量有意义联系的全面口头概括。我认为,未能认真对待口头和统计表达之间的一致性是心理学许多持续存在的问题(例如,复制危机)的核心所在,并最后讨论了几个潜在的改进途径。