Caldwell David J, Sampognaro Laurel, Pate Adam N
University of Louisiana at Monroe School of Pharmacy.
Am J Pharm Educ. 2015 Aug 25;79(6):87. doi: 10.5688/ajpe79687.
To improve examination item quality by educating and involving course instructors in evidence-based item review and encouraging use of this process in future courses.
A peer-review process was implemented in a 2-course sequence (intervention) that involved training and review sessions before each examination and was compared to the previous year's courses (control). Instructors completed a presurvey and postsurvey on training, experience, self-confidence, and self-rated success in multiple-choice item writing. Statistics were calculated for all items in the control and intervention sequences and compared using independent t tests. Items also were classified into levels based on difficulty and discrimination, and distribution into these levels was compared between sequences with independent t tests.
No significant difference was found between control and intervention sequence items with regard to mean difficulty (86.3% and 84.4%) or discrimination (0.23- and 0.25), respectively, although item classification distribution did appear to change between the control and intervention sequences' subjective feelings of confidence, and success in item writing increased between presurvey and postsurvey. Confidence in ability to peer-review test items and to implement a formal item evaluation process also increased.
Item statistics did not change significantly, but reviewed and edited items distributed more favorably into item statistic-based categories. This method of review positively affected instructors' perceptions of their item-writing confidence and success and improved self-rated opinions of their ability to edit items and train others to do so.
通过教育课程教师并让其参与基于证据的试题审查,并鼓励在未来课程中使用该流程,以提高考试试题质量。
在一个两课程序列(干预组)中实施同行评审流程,该流程包括在每次考试前进行培训和评审会议,并与上一年的课程(对照组)进行比较。教师完成了关于培训、经验、自信心以及在多项选择题编写方面的自评成功率的预调查和后调查。计算了对照组和干预组序列中所有试题的统计数据,并使用独立t检验进行比较。试题还根据难度和区分度进行了分类,并用独立t检验比较了两组序列中这些类别的分布情况。
对照组和干预组序列的试题在平均难度(分别为86.3%和84.4%)或区分度(分别为0.23和0.25)方面未发现显著差异,尽管对照组和干预组序列之间的试题分类分布似乎有所变化,教师在编写试题方面的自信心和成功率在预调查和后调查之间有所提高。对同行评审试题和实施正式试题评估流程的能力的信心也有所增强。
试题统计数据没有显著变化,但经过评审和编辑的试题在基于试题统计的类别中的分布更有利。这种评审方法对教师对其试题编写信心和成功率的看法产生了积极影响,并提高了他们对编辑试题以及培训他人编辑试题能力的自评。