Campos-Outcalt D, Witzke D B, Fulginiti J V
Department of Family and Community Medicine, University of Arizona College of Medicine, Tucson.
Fam Med. 1994 Feb;26(2):85-8.
The family medicine clerkship at the University of Arizona uses three methods to evaluate students. The purpose of this study is to 1) look for possible sources of bias in each method, 2) validate the evaluation methods by correlating with other measures of academic achievement, and 3) measure the amount of correlation between the three evaluation methods.
The three clerkship evaluation methods include a final exam, clinical evaluations, and problem-solving scores. The clerkship evaluations received by 482 students were correlated with undergraduate grade point averages, MCAT scores, National Board of Medical Examiners parts I and II subtests and total scores, gender, age, race, college major, and timing of the clerkship.
Females and older students scored higher on problem solving and minority students scored lower on the final exam. Students in the second half of the year scored higher on the final exam. There were significant correlations found between the final exam score and all the scores on standard measures of academic performance except the MCAT quantitative subtest. Problem-solving scores correlated only with NBME part II. The clinical evaluations correlated with five NBME part I subtest scores, three NBME part II subtest scores, and both parts I and II total scores. All three evaluation methods correlated significantly with each other.
These analyses highlight the need to examine all evaluation methods used in order to improve their validity and reliability and to find potential biases. In addition, more work is needed to document the predictive validity of clerkship evaluations by correlating them with future clinical performance.
亚利桑那大学的家庭医学实习采用三种方法对学生进行评估。本研究的目的是:1)寻找每种方法中可能存在的偏差来源;2)通过与其他学业成绩衡量指标的相关性来验证评估方法;3)测量三种评估方法之间的相关程度。
三种实习评估方法包括期末考试、临床评估和解决问题得分。对482名学生的实习评估结果与本科平均绩点、医学院入学考试(MCAT)成绩、美国国家医学考试委员会第一部分和第二部分的子测试及总成绩、性别、年龄、种族、大学专业以及实习时间进行了相关性分析。
女性和年龄较大的学生在解决问题方面得分较高,少数族裔学生在期末考试中得分较低。下半年实习的学生在期末考试中得分较高。除MCAT定量子测试外,期末考试成绩与所有学业成绩标准衡量指标的得分之间均存在显著相关性。解决问题得分仅与美国国家医学考试委员会第二部分相关。临床评估与美国国家医学考试委员会第一部分的五个子测试成绩、第二部分的三个子测试成绩以及第一部分和第二部分的总成绩相关。三种评估方法之间均存在显著相关性。
这些分析强调了检查所有使用的评估方法的必要性,以提高其有效性和可靠性,并发现潜在偏差。此外,还需要做更多工作,通过将实习评估与未来临床绩效相关联来证明其预测效度。