• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Examining the Instructional Sensitivity of Constructed-Response Achievement Test Item Scores.考察建构反应式成就测验项目分数的教学敏感性。
Educ Psychol Meas. 2025 Jan 30:00131644241313212. doi: 10.1177/00131644241313212.
2
What About the "Instruction" in Instructional Sensitivity? Raising a Validity Issue in Research on Instructional Sensitivity.教学敏感性中的“教学”是指什么?对教学敏感性研究中一个效度问题的探讨。
Educ Psychol Meas. 2018 Aug;78(4):635-652. doi: 10.1177/0013164417714846. Epub 2017 Jun 23.
3
Artificial intelligence and medical education: application in classroom instruction and student assessment using a pharmacology & therapeutics case study.人工智能与医学教育:在药理学与治疗学案例研究中的课堂教学及学生评估应用
BMC Med Educ. 2024 Apr 22;24(1):431. doi: 10.1186/s12909-024-05365-7.
4
Does Instruction Affect the Underlying Dimensionality of a Kinesiology Test?指导会影响运动机能学测试的潜在维度吗?
J Appl Meas. 2016;17(4):393-415.
5
Assessing Instructional Cognitive Load in the Context of Students' Psychological Challenge and Threat Orientations: A Multi-Level Latent Profile Analysis of Students and Classrooms.在学生心理挑战与威胁取向背景下评估教学认知负荷:对学生和课堂的多层次潜在剖面分析
Front Psychol. 2021 Jul 1;12:656994. doi: 10.3389/fpsyg.2021.656994. eCollection 2021.
6
Use of Multiple-Select Multiple-Choice Items in a Dental Undergraduate Curriculum: Retrospective Study Involving the Application of Different Scoring Methods.牙科本科课程中多项选择多项选择题的使用:涉及不同评分方法应用的回顾性研究
JMIR Med Educ. 2023 Mar 27;9:e43792. doi: 10.2196/43792.
7
Small class sizes for improving student achievement in primary and secondary schools: a systematic review.小班教学对提高中小学学生成绩的影响:一项系统综述。
Campbell Syst Rev. 2018 Oct 11;14(1):1-107. doi: 10.4073/csr.2018.10. eCollection 2018.
8
[The estimation of premorbid intelligence levels in French speakers].[法语使用者病前智力水平的评估]
Encephale. 2005 Jan-Feb;31(1 Pt 1):31-43. doi: 10.1016/s0013-7006(05)82370-x.
9
Association between Characteristics of Impostor Phenomenon in Medical Students and Step 1 Performance.医学生冒名顶替现象特征与第一步考试成绩之间的关联。
Teach Learn Med. 2021 Jan-Mar;33(1):36-48. doi: 10.1080/10401334.2020.1784741. Epub 2020 Jul 7.
10
A New Measure of Quantified Social Health Is Associated With Levels of Discomfort, Capability, and Mental and General Health Among Patients Seeking Musculoskeletal Specialty Care.一种新的量化社会健康指标与寻求肌肉骨骼专科护理的患者的不适程度、能力以及心理和总体健康水平相关。
Clin Orthop Relat Res. 2025 Apr 1;483(4):647-663. doi: 10.1097/CORR.0000000000003394. Epub 2025 Feb 5.

本文引用的文献

1
Measuring students' learning progressions in energy using cognitive diagnostic models.使用认知诊断模型测量学生在能源方面的学习进程。
Front Psychol. 2022 Aug 9;13:892884. doi: 10.3389/fpsyg.2022.892884. eCollection 2022.
2
Children's science vocabulary uniquely predicts individual differences in science knowledge.儿童的科学词汇独特地预测了科学知识方面的个体差异。
J Exp Child Psychol. 2022 Sep;221:105427. doi: 10.1016/j.jecp.2022.105427. Epub 2022 May 3.
3
Testing Differential Item Functioning in Small Samples.小样本中的差异项目功能测试。
Multivariate Behav Res. 2020 Sep-Oct;55(5):722-747. doi: 10.1080/00273171.2019.1671162. Epub 2019 Oct 4.
4
The risk-return trade-off: Performance assessments and cognitive validation of inferences.风险-回报权衡:推断的绩效评估和认知验证。
Br J Educ Psychol. 2019 Sep;89(3):441-455. doi: 10.1111/bjep.12271. Epub 2019 Mar 18.
5
What About the "Instruction" in Instructional Sensitivity? Raising a Validity Issue in Research on Instructional Sensitivity.教学敏感性中的“教学”是指什么?对教学敏感性研究中一个效度问题的探讨。
Educ Psychol Meas. 2018 Aug;78(4):635-652. doi: 10.1177/0013164417714846. Epub 2017 Jun 23.
6
A Monte Carlo Study of an Iterative Wald Test Procedure for DIF Analysis.用于差异项目功能分析的迭代 Wald 检验程序的蒙特卡罗研究。
Educ Psychol Meas. 2017 Jan;77(1):104-118. doi: 10.1177/0013164416637104. Epub 2016 Mar 7.
7
Trends in Classroom Observation Scores.课堂观察分数趋势
Educ Psychol Meas. 2015 Apr;75(2):311-337. doi: 10.1177/0013164414539163. Epub 2014 Jun 22.
8
Differential Item Functioning: Beyond validity evidence based on internal structure.项目区分度:超越基于内部结构的效度证据。
Psicothema. 2018 Feb;30(1):104-109. doi: 10.7334/psicothema2017.183.
9
Effect size indices for analyses of measurement equivalence: understanding the practical importance of differences between groups.分析测量等效性的效应大小指标:了解组间差异的实际重要性。
J Appl Psychol. 2011 Sep;96(5):966-80. doi: 10.1037/a0022955.
10
A taxonomy of effect size measures for the differential functioning of items and scales.项目和量表差异功能的效应量度量分类法。
J Appl Psychol. 2010 Jul;95(4):728-43. doi: 10.1037/a0018966.

考察建构反应式成就测验项目分数的教学敏感性。

Examining the Instructional Sensitivity of Constructed-Response Achievement Test Item Scores.

作者信息

Traynor Anne, Li Cheng-Hsien, Zhou Shuqi

机构信息

Purdue University, West Lafayette, IN, USA.

National Sun Yat-sen University, Kaohsiung, Taiwan.

出版信息

Educ Psychol Meas. 2025 Jan 30:00131644241313212. doi: 10.1177/00131644241313212.

DOI:10.1177/00131644241313212
PMID:39896146
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11783420/
Abstract

Inferences about student learning from large-scale achievement test scores are fundamental in education. For achievement test scores to provide useful information about student learning progress, differences in the content of instruction (i.e., the implemented curriculum) should affect test-takers' item responses. Existing research has begun to identify patterns in the content of instructionally sensitive multiple-choice achievement test items. To inform future test design decisions, this study identified instructionally (in)sensitive constructed-response achievement items, then characterized features of those items and their corresponding scoring rubrics. First, we used simulation to evaluate an item step difficulty difference index for constructed-response test items, derived from the generalized partial credit model. The statistical performance of the index was adequate, so we then applied it to data from 32 constructed-response eighth-grade science test items. We found that the instructional sensitivity (IS) index values varied appreciably across the category boundaries within an item as well as across items. Content analysis by master science teachers allowed us to identify general features of item score categories that show high, or negligible, IS.

摘要

从大规模成绩测试分数推断学生的学习情况在教育领域至关重要。为了使成绩测试分数能提供有关学生学习进展的有用信息,教学内容(即实施的课程)的差异应影响考生对试题的回答。现有研究已开始识别对教学敏感的多项选择题成绩测试题目的内容模式。为了为未来的测试设计决策提供参考,本研究识别了对教学(不)敏感的建构回应式成绩题目,然后描述了这些题目的特征及其相应的评分标准。首先,我们使用模拟来评估从广义部分计分模型推导出来的建构回应式测试题目的项目步长难度差异指数。该指数的统计性能良好,因此我们随后将其应用于32道八年级科学建构回应式测试题目的数据。我们发现,教学敏感度(IS)指数值在一个题目内的类别边界之间以及不同题目之间有明显差异。理科主考教师进行的内容分析使我们能够识别出显示高教学敏感度或可忽略不计的教学敏感度的题目分数类别的一般特征。