考察建构反应式成就测验项目分数的教学敏感性。

Examining the Instructional Sensitivity of Constructed-Response Achievement Test Item Scores.

作者信息

Traynor Anne, Li Cheng-Hsien, Zhou Shuqi

机构信息

Purdue University, West Lafayette, IN, USA.

National Sun Yat-sen University, Kaohsiung, Taiwan.

出版信息

Educ Psychol Meas. 2025 Jan 30:00131644241313212. doi: 10.1177/00131644241313212.

DOI:10.1177/00131644241313212

PMID:39896146

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11783420/

Abstract

Inferences about student learning from large-scale achievement test scores are fundamental in education. For achievement test scores to provide useful information about student learning progress, differences in the content of instruction (i.e., the implemented curriculum) should affect test-takers' item responses. Existing research has begun to identify patterns in the content of instructionally sensitive multiple-choice achievement test items. To inform future test design decisions, this study identified instructionally (in)sensitive constructed-response achievement items, then characterized features of those items and their corresponding scoring rubrics. First, we used simulation to evaluate an item step difficulty difference index for constructed-response test items, derived from the generalized partial credit model. The statistical performance of the index was adequate, so we then applied it to data from 32 constructed-response eighth-grade science test items. We found that the instructional sensitivity (IS) index values varied appreciably across the category boundaries within an item as well as across items. Content analysis by master science teachers allowed us to identify general features of item score categories that show high, or negligible, IS.

摘要

从大规模成绩测试分数推断学生的学习情况在教育领域至关重要。为了使成绩测试分数能提供有关学生学习进展的有用信息，教学内容（即实施的课程）的差异应影响考生对试题的回答。现有研究已开始识别对教学敏感的多项选择题成绩测试题目的内容模式。为了为未来的测试设计决策提供参考，本研究识别了对教学（不）敏感的建构回应式成绩题目，然后描述了这些题目的特征及其相应的评分标准。首先，我们使用模拟来评估从广义部分计分模型推导出来的建构回应式测试题目的项目步长难度差异指数。该指数的统计性能良好，因此我们随后将其应用于32道八年级科学建构回应式测试题目的数据。我们发现，教学敏感度（IS）指数值在一个题目内的类别边界之间以及不同题目之间有明显差异。理科主考教师进行的内容分析使我们能够识别出显示高教学敏感度或可忽略不计的教学敏感度的题目分数类别的一般特征。

相似文献

Examining the Instructional Sensitivity of Constructed-Response Achievement Test Item Scores.

Educ Psychol Meas. 2025 Jan 30:00131644241313212. doi: 10.1177/00131644241313212.

What About the "Instruction" in Instructional Sensitivity? Raising a Validity Issue in Research on Instructional Sensitivity.

Educ Psychol Meas. 2018 Aug;78(4):635-652. doi: 10.1177/0013164417714846. Epub 2017 Jun 23.

Artificial intelligence and medical education: application in classroom instruction and student assessment using a pharmacology & therapeutics case study.

BMC Med Educ. 2024 Apr 22;24(1):431. doi: 10.1186/s12909-024-05365-7.

Does Instruction Affect the Underlying Dimensionality of a Kinesiology Test?

J Appl Meas. 2016;17(4):393-415.

Assessing Instructional Cognitive Load in the Context of Students' Psychological Challenge and Threat Orientations: A Multi-Level Latent Profile Analysis of Students and Classrooms.

Front Psychol. 2021 Jul 1;12:656994. doi: 10.3389/fpsyg.2021.656994. eCollection 2021.

Use of Multiple-Select Multiple-Choice Items in a Dental Undergraduate Curriculum: Retrospective Study Involving the Application of Different Scoring Methods.

JMIR Med Educ. 2023 Mar 27;9:e43792. doi: 10.2196/43792.

Small class sizes for improving student achievement in primary and secondary schools: a systematic review.

Campbell Syst Rev. 2018 Oct 11;14(1):1-107. doi: 10.4073/csr.2018.10. eCollection 2018.

[The estimation of premorbid intelligence levels in French speakers].

Encephale. 2005 Jan-Feb;31(1 Pt 1):31-43. doi: 10.1016/s0013-7006(05)82370-x.

Association between Characteristics of Impostor Phenomenon in Medical Students and Step 1 Performance.

Teach Learn Med. 2021 Jan-Mar;33(1):36-48. doi: 10.1080/10401334.2020.1784741. Epub 2020 Jul 7.

A New Measure of Quantified Social Health Is Associated With Levels of Discomfort, Capability, and Mental and General Health Among Patients Seeking Musculoskeletal Specialty Care.

Clin Orthop Relat Res. 2025 Apr 1;483(4):647-663. doi: 10.1097/CORR.0000000000003394. Epub 2025 Feb 5.

本文引用的文献

Measuring students' learning progressions in energy using cognitive diagnostic models.

Front Psychol. 2022 Aug 9;13:892884. doi: 10.3389/fpsyg.2022.892884. eCollection 2022.

Children's science vocabulary uniquely predicts individual differences in science knowledge.

J Exp Child Psychol. 2022 Sep;221:105427. doi: 10.1016/j.jecp.2022.105427. Epub 2022 May 3.

Testing Differential Item Functioning in Small Samples.

Multivariate Behav Res. 2020 Sep-Oct;55(5):722-747. doi: 10.1080/00273171.2019.1671162. Epub 2019 Oct 4.

The risk-return trade-off: Performance assessments and cognitive validation of inferences.

Br J Educ Psychol. 2019 Sep;89(3):441-455. doi: 10.1111/bjep.12271. Epub 2019 Mar 18.

What About the "Instruction" in Instructional Sensitivity? Raising a Validity Issue in Research on Instructional Sensitivity.

Educ Psychol Meas. 2018 Aug;78(4):635-652. doi: 10.1177/0013164417714846. Epub 2017 Jun 23.

A Monte Carlo Study of an Iterative Wald Test Procedure for DIF Analysis.

Educ Psychol Meas. 2017 Jan;77(1):104-118. doi: 10.1177/0013164416637104. Epub 2016 Mar 7.

Trends in Classroom Observation Scores.

Educ Psychol Meas. 2015 Apr;75(2):311-337. doi: 10.1177/0013164414539163. Epub 2014 Jun 22.

Differential Item Functioning: Beyond validity evidence based on internal structure.

Psicothema. 2018 Feb;30(1):104-109. doi: 10.7334/psicothema2017.183.

Effect size indices for analyses of measurement equivalence: understanding the practical importance of differences between groups.

J Appl Psychol. 2011 Sep;96(5):966-80. doi: 10.1037/a0022955.

A taxonomy of effect size measures for the differential functioning of items and scales.

J Appl Psychol. 2010 Jul;95(4):728-43. doi: 10.1037/a0018966.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

考察建构反应式成就测验项目分数的教学敏感性。

Examining the Instructional Sensitivity of Constructed-Response Achievement Test Item Scores.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献