Department of Psychology, San Francisco State University, San Francisco, CA, USA.
Q J Exp Psychol (Hove). 2020 Aug;73(8):1290-1299. doi: 10.1177/1747021819900098. Epub 2020 Feb 16.
Five recent meta-analyses of the bilingual advantage in executive functioning hypothesis have converged on the outcome that the mean effect size is very small and that the incidence of statistically significant bilingual advantages is very low (about 15% of all comparisons). Those analyses that used the PET-PEESE method to correct for publication bias show mean effect sizes that are not different from zero and sometimes negative. In contrast, van den Noort and colleagues provide a sixth review of 46 studies published before October 31, 2018, that appears to produce a very different outcome, namely that more than half the studies yield clear support for the bilingual advantage hypothesis. We show that the deviance is due in part to search terms that yielded far fewer relevant studies, but more importantly to a subjective method of evaluating the results of each study that enables confirmation biases on the part of study authors and meta-analysts to substantially distort the objective pattern of results. A seventh meta-analysis, by Armstrong and colleagues, reports significant bilingual advantages of = 0.48 for 32 samples using Simon and Stroop colour-word interference tasks that tested older adults. However, all effects were entered into the funnel plots as positive even though many were negative (bilingual disadvantages). This and other striking anomalies are consistent with the view that confirmation bias can suspend critical judgement and promulgate errors. Meta-analyses that use preregistration and a many-labs collaboration can better control for both publication and experimenter biases.
五项最近的双语优势执行功能假说的荟萃分析结果表明,平均效应量非常小,并且具有统计学意义的双语优势的发生率非常低(约为所有比较的 15%)。那些使用 PET-PEESE 方法纠正发表偏倚的分析表明,平均效应量与零没有差异,有时甚至为负。相比之下,van den Noort 和同事提供了对 2018 年 10 月 31 日之前发表的 46 项研究的第六次综述,该综述似乎产生了非常不同的结果,即超过一半的研究清楚地支持双语优势假说。我们表明,这种差异部分是由于搜索词导致相关研究较少,但更重要的是,对每项研究结果进行主观评估的方法使得研究作者和荟萃分析人员的确认偏差能够严重扭曲结果的客观模式。Armstrong 及其同事的第七项荟萃分析报告了使用 Simon 和 Stroop 颜色-单词干扰任务测试老年人的 32 个样本中双语优势的显著效应量 = 0.48。然而,即使许多效应量是负的(双语劣势),所有效应量都被纳入了漏斗图中作为正的。这一和其他引人注目的异常现象与确认偏差可以暂停批判性判断并传播错误的观点一致。使用预注册和多实验室合作的荟萃分析可以更好地控制发表偏倚和实验者偏倚。