作答格式对成绩和能力倾向评估结果的影响：多层次随机效应荟萃分析

Effects of response format on achievement and aptitude assessment results: multi-level random effects meta-analyses.

作者信息

Breuer Sonja, Scherndl Thomas, Ortner Tuulia M

机构信息

Division of Psychological Assessment, Department of Psychology, Paris Lodron University, Salzburg, Austria.

出版信息

R Soc Open Sci. 2023 May 3;10(5):220456. doi: 10.1098/rsos.220456. eCollection 2023 May.

DOI:10.1098/rsos.220456

PMID:37153364

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10154931/

Abstract

Psychological achievement and aptitude tests are fundamental elements of the everyday school, academic and professional lives of students, instructors, job applicants, researchers and policymakers. In line with growing demands for fair psychological assessment tools, we aimed to identify psychometric features of tests, test situations and test-taker characteristics that may contribute to the emergence of test bias. Multi-level random effects meta-analyses were conducted to estimate mean effect sizes for differences and relations between scores from achievement or aptitude measures with open-ended (OE) versus closed-ended (CE) response formats. Results from 102 primary studies with 392 effect sizes revealed positive relations between CE and OE assessments (mean = 0.67, 95% CI [0.57; 0.76]), with negative pooled effect sizes for the difference between the two response formats (mean = -0.65; 95% CI [-0.78; -0.53]). Significantly higher scores were obtained on CE exams. Stem-equivalency of items, low-stakes test situations, written short answer OE question types, studies conducted outside the United States and before the year 2000, and test-takers' achievement motivation and sex were at least partially associated with smaller differences and/or larger relations between scores from OE and CE formats. Limitations and the results' implications for practitioners in achievement and aptitude testing are discussed.

摘要

心理成就测试和能力倾向测试是学生、教师、求职者、研究人员和政策制定者日常学校生活、学术生活及职业生活的基本组成部分。随着对公平心理评估工具的需求不断增加，我们旨在确定测试、测试情境和应试者特征的心理测量特征，这些特征可能导致测试偏差的出现。我们进行了多层次随机效应荟萃分析，以估计开放式（OE）与封闭式（CE）回答格式的成就或能力倾向测量分数之间差异和关系的平均效应大小。来自102项主要研究、共392个效应大小的结果显示，CE评估与OE评估之间存在正相关（均值 = 0.67，95%置信区间[0.57; 0.76]），两种回答格式之间差异的合并效应大小为负（均值 = -0.65；95%置信区间[-0.78; -0.53]）。在CE考试中获得的分数显著更高。题目干等效性、低风险测试情境、书面简答题OE题型、2000年之前在美国境外进行的研究，以及应试者的成就动机和性别至少部分与OE和CE格式分数之间较小的差异和/或较大的相关性有关。我们讨论了局限性以及研究结果对应试者成就和能力倾向测试从业者的意义。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/79d2/10154931/6fbc8cd67e0d/rsos220456f01.jpg

相似文献

Effects of response format on achievement and aptitude assessment results: multi-level random effects meta-analyses.作答格式对成绩和能力倾向评估结果的影响：多层次随机效应荟萃分析

R Soc Open Sci. 2023 May 3;10(5):220456. doi: 10.1098/rsos.220456. eCollection 2023 May.

Small class sizes for improving student achievement in primary and secondary schools: a systematic review.小班教学对提高中小学学生成绩的影响：一项系统综述。

Campbell Syst Rev. 2018 Oct 11;14(1):1-107. doi: 10.4073/csr.2018.10. eCollection 2018.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区，服用抗叶酸抗疟药物的人群中，叶酸补充剂与疟疾易感性和严重程度的关系。

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

Impact of summer programmes on the outcomes of disadvantaged or 'at risk' young people: A systematic review.暑期项目对处境不利或“有风险”的年轻人的影响：一项系统综述。

Campbell Syst Rev. 2024 Jun 13;20(2):e1406. doi: 10.1002/cl2.1406. eCollection 2024 Jun.

Recovery schools for improving behavioral and academic outcomes among students in recovery from substance use disorders: a systematic review.改善物质使用障碍康复期学生行为和学业成果的康复学校：一项系统综述

Campbell Syst Rev. 2018 Oct 4;14(1):1-86. doi: 10.4073/csr.2018.9. eCollection 2018.

Targeted school-based interventions for improving reading and mathematics for students with or at risk of academic difficulties in Grades K-6: A systematic review.针对K-6年级有学习困难或有学习困难风险的学生提高阅读和数学能力的校本干预措施：一项系统综述。

Campbell Syst Rev. 2021 Apr 6;17(2):e1152. doi: 10.1002/cl2.1152. eCollection 2021 Jun.

Differences in Reaction to Immediate Feedback and Opportunity to Revise Answers for Multiple-Choice and Open-Ended Questions.对选择题和开放式问题的即时反馈以及修改答案机会的反应差异。

Educ Psychol Meas. 2016 Oct;76(5):787-802. doi: 10.1177/0013164415612548. Epub 2015 Oct 26.

Relationship between admission criteria and academic performance in basic science courses in health science colleges in KAU.KAU健康科学学院基础科学课程的录取标准与学业成绩之间的关系。

BMC Med Educ. 2021 Feb 8;21(1):94. doi: 10.1186/s12909-021-02502-4.

Association between scores in high school, aptitude and achievement exams and early performance in health science college.高中成绩、能力倾向和学业成就考试成绩与健康科学学院早期表现之间的关联。

Saudi J Kidney Dis Transpl. 2009 May;20(3):448-53.

Comparing the monetary value of a quality-adjusted life year from the payment card and the open-ended format.比较支付卡法和开放式格式下质量调整生命年的货币价值。

Cost Eff Resour Alloc. 2021 Jul 19;19(1):45. doi: 10.1186/s12962-021-00298-0.

本文引用的文献

Developmental differences in reactivation underlying self-derivation of new knowledge through memory integration.通过记忆整合实现新知识自我衍生的再激活的发展差异。

Cogn Psychol. 2021 Sep;129:101413. doi: 10.1016/j.cogpsych.2021.101413. Epub 2021 Jul 23.

Interpretation of familiar metaphors and proverbs by Polish people in middle and late adulthood.波兰中老年人对常见隐喻和谚语的解读。

Int J Lang Commun Disord. 2021 Jul;56(4):841-857. doi: 10.1111/1460-6984.12631. Epub 2021 Jun 14.

Estimating outcome-specific effects in meta-analyses of multiple outcomes: A simulation study.对多项结果荟萃分析中特定结果的效应估计：一项模拟研究。

Behav Res Methods. 2021 Apr;53(2):702-717. doi: 10.3758/s13428-020-01459-4.

Development of Children's monitoring and control when learning from texts: effects of age and test format.儿童从文本学习时监控与控制能力的发展：年龄和测试形式的影响

Metacogn Learn. 2020;15(1):3-27. doi: 10.1007/s11409-019-09208-5. Epub 2019 Sep 7.

A within-subject experiment of item format effects on early primary students' language, reading, and numeracy assessment results.一项关于项目格式效应对早期小学生语言、阅读和数学评估结果影响的被试内实验。

Sch Psychol. 2020 Jan;35(1):80-87. doi: 10.1037/spq0000340. Epub 2019 Oct 24.

Proverb comprehension in individuals with agenesis of the corpus callosum.胼胝体发育不全个体的谚语理解能力

Brain Lang. 2016 Sep;160:21-9. doi: 10.1016/j.bandl.2016.07.001. Epub 2016 Jul 21.

Cognitive Difficulty and Format of Exams Predicts Gender and Socioeconomic Gaps in Exam Performance of Students in Introductory Biology Courses.认知难度和考试形式预测了生物学入门课程学生考试成绩中的性别和社会经济差距。

CBE Life Sci Educ. 2016 Summer;15(2). doi: 10.1187/cbe.15-12-0246.

On the reproducibility of meta-analyses: six practical recommendations.元分析可重复性问题：六项实用建议

BMC Psychol. 2016 May 31;4(1):24. doi: 10.1186/s40359-016-0126-3.

Longitudinal changes in adolescent risk-taking: a comprehensive study of neural responses to rewards, pubertal development, and risk-taking behavior.青少年冒险行为的纵向变化：对奖励、青春期发育和冒险行为的神经反应的综合研究。

J Neurosci. 2015 May 6;35(18):7226-38. doi: 10.1523/JNEUROSCI.4764-14.2015.

Are the General Medical Council's Tests of Competence fair to long standing doctors? A retrospective cohort study.英国医学总会的能力测试对长期执业的医生公平吗？一项回顾性队列研究。

BMC Med Educ. 2015 Apr 21;15:80. doi: 10.1186/s12909-015-0362-x.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

作答格式对成绩和能力倾向评估结果的影响：多层次随机效应荟萃分析

Effects of response format on achievement and aptitude assessment results: multi-level random effects meta-analyses.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献