Tekian Ara, Norcini John
University of Illinois, College of Medicine at Chicago, Chicago, USA.
Foundation for Advancement of International Medical Education and Research , Philadelphia, USA.
GMS Z Med Ausbild. 2015 Oct 15;32(4):Doc43. doi: 10.3205/zma000985. eCollection 2015.
It is not unusual for institutions around the world to have fixed standards (e.g., 60%) for all of their examinations. This creates problems in the creation of examinations, since all of the content has to be chosen with an eye toward this fixed standard. As a result, the validity of the decisions based on these examinations can be adversely influenced, making them less useful for their intended purposes. Over the past several decades, many institutions have addressed this problem by using standard setting methods which are defensible, acceptable, and credible [1], [2]. Many methods are available and the major reasons to use them is to ensure that test content is appropriately selected and to be as fair to the students and other test users as possible [2], [3]. One barrier to the wider use of these methods is that some institutions object to the fact that the fixed standard (e.g., 60%) has not been applied. However, it is possible to rescale the passing score so that it is equal to the fixed standard, and then apply that same rescaling calculation to all of the test scores. This ensures that the institutional guidelines are not violated and allows the application of accepted methods of standard-setting. In turn, the application of these methods allow the content of the test to be selected without regard to a fixed standard, increases the validity of the decisions being made, and ensures a fairer and more accurate test of students.
世界各地的机构对其所有考试都设定固定标准(例如60%)的情况并不罕见。这在考试命题过程中会产生问题,因为所有考试内容都必须依据这个固定标准来选择。结果,基于这些考试所做决策的有效性可能会受到不利影响,使其对预期目的的作用降低。在过去几十年里,许多机构通过使用合理、可接受且可信的标准设定方法来解决这个问题[1][2]。有多种方法可供使用,使用这些方法的主要原因是确保考试内容得到恰当选择,并尽可能公平地对待学生和其他考试使用者[2][3]。这些方法未能更广泛应用的一个障碍是,一些机构反对未采用固定标准(例如60%)这一事实。然而,可以重新调整及格分数,使其等于固定标准,然后将相同的重新调整计算应用于所有考试成绩。这确保了不违反机构准则,并允许应用公认的标准设定方法。反过来,应用这些方法可以在不考虑固定标准的情况下选择考试内容,提高所做决策的有效性,并确保对学生进行更公平、更准确的测试。