Breuer Sonja, Scherndl Thomas, Ortner Tuulia M
Division of Psychological Assessment, Department of Psychology, Paris Lodron University, Salzburg, Austria.
R Soc Open Sci. 2023 May 3;10(5):220456. doi: 10.1098/rsos.220456. eCollection 2023 May.
Psychological achievement and aptitude tests are fundamental elements of the everyday school, academic and professional lives of students, instructors, job applicants, researchers and policymakers. In line with growing demands for fair psychological assessment tools, we aimed to identify psychometric features of tests, test situations and test-taker characteristics that may contribute to the emergence of test bias. Multi-level random effects meta-analyses were conducted to estimate mean effect sizes for differences and relations between scores from achievement or aptitude measures with open-ended (OE) versus closed-ended (CE) response formats. Results from 102 primary studies with 392 effect sizes revealed positive relations between CE and OE assessments (mean = 0.67, 95% CI [0.57; 0.76]), with negative pooled effect sizes for the difference between the two response formats (mean = -0.65; 95% CI [-0.78; -0.53]). Significantly higher scores were obtained on CE exams. Stem-equivalency of items, low-stakes test situations, written short answer OE question types, studies conducted outside the United States and before the year 2000, and test-takers' achievement motivation and sex were at least partially associated with smaller differences and/or larger relations between scores from OE and CE formats. Limitations and the results' implications for practitioners in achievement and aptitude testing are discussed.
心理成就测试和能力倾向测试是学生、教师、求职者、研究人员和政策制定者日常学校生活、学术生活及职业生活的基本组成部分。随着对公平心理评估工具的需求不断增加,我们旨在确定测试、测试情境和应试者特征的心理测量特征,这些特征可能导致测试偏差的出现。我们进行了多层次随机效应荟萃分析,以估计开放式(OE)与封闭式(CE)回答格式的成就或能力倾向测量分数之间差异和关系的平均效应大小。来自102项主要研究、共392个效应大小的结果显示,CE评估与OE评估之间存在正相关(均值 = 0.67,95%置信区间[0.57; 0.76]),两种回答格式之间差异的合并效应大小为负(均值 = -0.65;95%置信区间[-0.78; -0.53])。在CE考试中获得的分数显著更高。题目干等效性、低风险测试情境、书面简答题OE题型、2000年之前在美国境外进行的研究,以及应试者的成就动机和性别至少部分与OE和CE格式分数之间较小的差异和/或较大的相关性有关。我们讨论了局限性以及研究结果对应试者成就和能力倾向测试从业者的意义。