Godwin Hayward J, Lee Charlotte E, Drieghe Denis
School of Psychology, University of Southampton, Highfield, Southampton, Hampshire, SO17 1BJ, UK.
Behav Res Methods. 2025 May 7;57(6):164. doi: 10.3758/s13428-025-02689-0.
Eye movements during reading experiments involve careful cleaning of raw data into a processed format that can then be analyzed. Through the process of cleaning and analyzing these datasets, there are many decisions that researchers make. As a result, there is a wide range of possible approaches that can be taken when analyzing datasets from reading and eye movement experiments. At present, little is known regarding the consequences of these decisions and in a worst-case scenario, specific approaches to cleaning and analyzing these datasets could "create" effects that would otherwise not be present in the datasets. Here, we addressed these issues by conducting a multiverse analysis of a range of reasonable and defensible analyses that researchers in this field might conduct. We examined a total of 1,890 different data cleaning and analytic pipelines to explore how different decisions researchers make when cleaning and analyzing their data influence perhaps the most well-known effect in eye movements and reading research: the word frequency effect. More specifically, the impact on the size of the word frequency effect during sentence reading (Lee et al. Journal of Experimental Psychology: Learning, Memory, and Cognition, 2025) was explored. The frequency effect was found to be extremely robust and present in almost all cases, but the magnitude varied substantially, with 36% of the size of the effect being due to specific choices made during data cleaning and analysis. Recommendations for further work and greater transparency in the field are set out based on our findings.
阅读实验中的眼动研究需要将原始数据仔细清理成可分析的处理格式。在清理和分析这些数据集的过程中,研究人员会做出许多决策。因此,在分析阅读和眼动实验的数据集时,有多种可能的方法可供选择。目前,对于这些决策的后果知之甚少,在最坏的情况下,清理和分析这些数据集的特定方法可能会“制造”出原本数据集中不存在的效应。在此,我们通过对该领域研究人员可能进行的一系列合理且有依据的分析进行多宇宙分析来解决这些问题。我们总共检查了1890种不同的数据清理和分析流程,以探究研究人员在清理和分析数据时做出的不同决策如何影响眼动和阅读研究中可能是最著名的效应:词频效应。更具体地说,我们探讨了在句子阅读过程中对词频效应大小的影响(Lee等人,《实验心理学杂志:学习、记忆与认知》,2025年)。结果发现词频效应极其稳健,几乎在所有情况下都存在,但效应大小差异很大,其中36%的效应大小归因于数据清理和分析过程中做出的特定选择。基于我们的研究结果,提出了该领域进一步工作和提高透明度的建议。