Pusic Martin V, Chiaramonte Robert, Gladding Sophia, Andrews John S, Pecaric Martin R, Boutis Kathy
Division of Education Quality and Analytics, School of Medicine, New York University, New York, NY, USA.
Downstate College of Medicine, State University of New York, New York, NY, USA.
Med Educ. 2015 Aug;49(8):838-46. doi: 10.1111/medu.12774.
Despite calls for the improvement of self-assessment as a basis for self-directed learning, instructional designs that include reflection in practice are uncommon. Using data from a screen-based simulation for learning radiograph interpretation, we present validity evidence for a simple self-monitoring measure and examine how it can complement skill assessment.
Medical students learning ankle radiograph interpretation were given an online learning set of 50 cases which they were asked to classify as 'abnormal' (fractured) or 'normal' and to indicate the degree to which they felt certain about their response (Definitely or Probably). They received immediate feedback on each case. All students subsequently completed two 20-case post-tests: an immediate post-test (IPT), and a delayed post-test (DPT) administered 2 weeks later. We determined the degree to which certainty (Definitely versus Probably) correlated with accuracy of interpretation and how this relationship changed between the tests.
Of 988 students approached, 115 completed both tests. Mean ± SD accuracy scores decreased from 59 ± 17% at the IPT to 53 ± 16% at the DPT (95% confidence interval [CI] for the difference: -2% to -10%). Mean self-assessed certainty did not decrease (rates of Definitely: IPT, 17.6%; DPT, 19.5%; 95% CI for difference: +7.2% to -3.4%). Regression modelling showed that accuracy was positively associated with choosing Definitely over Probably (odds ratio [OR] 1.63, 95% CI 1.27-2.09) and indicated a statistically significant interaction between test timing and certainty (OR 0.72, 95% CI 0.52-0.99); thus, the accuracy of self-monitoring decayed over the retention interval, leaving students relatively overconfident in their abilities.
This study shows that, in medical students learning radiograph interpretation, the development of self-monitoring skills can be measured and should not be assumed to necessarily vary in the same way as the underlying clinical skill.
尽管人们呼吁改进自我评估,将其作为自主学习的基础,但包含实践反思的教学设计并不常见。利用基于屏幕模拟学习X光片解读的数据,我们提供了一种简单自我监测措施的效度证据,并研究它如何补充技能评估。
学习脚踝X光片解读的医学生获得一套包含50个病例的在线学习资料,要求他们将病例分类为“异常”(骨折)或“正常”,并表明他们对自己回答的确定程度(肯定或可能)。他们会收到每个病例的即时反馈。所有学生随后完成两次各含20个病例的后测:即时后测(IPT)和两周后进行的延迟后测(DPT)。我们确定确定程度(肯定与可能)与解读准确性的相关程度,以及这种关系在两次测试之间如何变化。
在邀请参与的988名学生中,115名完成了两次测试。平均±标准差准确性得分从即时后测时的59±17%降至延迟后测时的53±16%(差异的95%置信区间[CI]:-2%至-10%)。平均自我评估的确定程度没有下降(肯定的比例:即时后测,17.6%;延迟后测,19.5%;差异的95%置信区间:+7.2%至-3.4%)。回归模型显示,准确性与选择“肯定”而非“可能”呈正相关(优势比[OR]1.63,95%CI 1.27 - 2.09),并表明测试时间与确定程度之间存在统计学上的显著交互作用(OR 0.72,95%CI 0.52 - 0.99);因此,自我监测的准确性在保留期内下降,使学生对自己的能力相对过度自信。
本研究表明,在学习X光片解读的医学生中,自我监测技能的发展可以被测量,且不应假定其必然与潜在临床技能以相同方式变化。