Nnodim J O
Department of Anatomy, College of Medical Sciences, University of Benin, Nigeria.
Med Educ. 1992 Jul;26(4):301-9. doi: 10.1111/j.1365-2923.1992.tb00173.x.
An analysis of 596 multiple-choice questions (MCQs) on human anatomy given at three First Professional Examinations for medical students is reported. The MCQ paper at each examination was 200 items long and consisted of three item-types: A, K and T/F. Each A-type item comprised a stem and five options, only one of the latter being the correct or best answer. Items of the K-type consisted of a stem and four responses, any number of which may be correct. The T/F items were of the three-response kind, the available options being 'true', 'false' and 'don't know'. Test reliability was computed by internal analysis, using the Kuder-Richardson 20 formula. Measures of concurrent validity were obtained by correlating the scores in the MCQ papers with the overall outcome of the First Professional Examination. Indices of item facility, discrimination and abstention were calculated. The effects of item-type and the availability of the 'don't know' option on examinee performance were also determined. Reliability (alpha) and concurrent validity (Pearson r) coefficients in the ranges of 0.71-0.85 and 0.80-0.93 (P less than 0.05) respectively were recorded. Regression analysis revealed the MCQ papers to be less sensitive predictors of the aggregate performance than the essay papers. The proportion of highly discriminatory and excessively difficult items was highest for the K-type. When the same K-type questions were re-exhibited in the indeterminate format, the examinees performed significantly better. Higher scores were also recorded when candidates were required to respond to all the questions than when they were offered the 'don't know' option and the percentage gain was higher for the low-scoring examinees. The appropriateness of multiple-choice testing as a tool for assessing student achievement in human anatomy is discussed.
本文报告了对医学院校三次第一专业考试中给出的596道人体解剖学多项选择题(MCQ)的分析。每次考试的MCQ试卷有200道题,由三种题型组成:A、K和是非判断题(T/F)。每种A类题目包括一个题干和五个选项,其中只有一个是正确或最佳答案。K类题目由一个题干和四个答案组成,这些答案中的任何数量都可能是正确的。是非判断题是三选一的类型,可供选择的选项是“真”、“假”和“不知道”。通过内部分析,使用库德-理查森20公式计算测试信度。通过将MCQ试卷的分数与第一专业考试的总体结果进行关联,获得了同时效度的测量值。计算了题目难度、区分度和弃权率指标。还确定了题型和“不知道”选项的可用性对考生表现的影响。记录的信度(α)和同时效度(皮尔逊r)系数分别在0.71 - 0.85和0.80 - 0.93范围内(P小于0.05)。回归分析表明,MCQ试卷作为总体表现的预测指标不如论文试卷敏感。K类题目中高区分度和极难的题目的比例最高。当同样的K类题目以不确定的形式重新展示时,考生的表现明显更好。当要求考生回答所有问题时,记录的分数也更高,而当为考生提供“不知道”选项时,低分考生的得分百分比增幅更高。本文讨论了多项选择题测试作为评估学生人体解剖学成绩的工具的适用性。