Suppr超能文献

违反标准试题编写原则对考试及学生的影响:医学教育中使用有缺陷的试题对成绩考试的后果。

The effects of violating standard item writing principles on tests and students: the consequences of using flawed test items on achievement examinations in medical education.

作者信息

Downing Steven M

机构信息

Department of Medical Education (MC 591), College of Medicine, University of Illinois at Chicago, 60612-7309, USA.

出版信息

Adv Health Sci Educ Theory Pract. 2005;10(2):133-43. doi: 10.1007/s10459-004-4019-5.

Abstract

The purpose of this research was to study the effects of violations of standard multiple-choice item writing principles on test characteristics, student scores, and pass-fail outcomes. Four basic science examinations, administered to year-one and year-two medical students, were randomly selected for study. Test items were classified as either standard or flawed by three independent raters, blinded to all item performance data. Flawed test questions violated one or more standard principles of effective item writing. Thirty-six to sixty-five percent of the items on the four tests were flawed. Flawed items were 0-15 percentage points more difficult than standard items measuring the same construct. Over all four examinations, 646 (53%) students passed the standard items while 575 (47%) passed the flawed items. The median passing rate difference between flawed and standard items was 3.5 percentage points, but ranged from -1 to 35 percentage points. Item flaws had little effect on test score reliability or other psychometric quality indices. Results showed that flawed multiple-choice test items, which violate well established and evidence-based principles of effective item writing, disadvantage some medical students. Item flaws introduce the systematic error of construct-irrelevant variance to assessments, thereby reducing the validity evidence for examinations and penalizing some examinees.

摘要

本研究的目的是探讨违反标准多项选择题编写原则对考试特性、学生成绩及及格与否结果的影响。随机选取了针对一年级和二年级医学生进行的四项基础科学考试进行研究。由三位独立评分者将试题分类为标准试题或有缺陷试题,评分者对所有试题的表现数据均不知情。有缺陷的试题违反了一项或多项有效试题编写的标准原则。四项考试中36%至65%的试题存在缺陷。与测量相同内容的标准试题相比,有缺陷的试题难度高出0至15个百分点。在所有四项考试中,646名(53%)学生通过了标准试题,而575名(47%)学生通过了有缺陷的试题。有缺陷试题和标准试题的及格率中位数差异为3.5个百分点,但范围在-1至35个百分点之间。试题缺陷对考试分数的可靠性或其他心理测量质量指标影响不大。结果表明,违反成熟且有证据支持的有效试题编写原则的有缺陷多项选择题,对一些医学生不利。试题缺陷会给评估引入与内容无关的系统性误差,从而降低考试的效度证据,并对一些考生造成不利影响。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验