Department of Pediatrics, Section of Critical Care Medicine, Baylor College of Medicine, Houston, TX, USA.
Department of Pediatrics, Center for Research, Innovation and Scholarship in Medical Education, Texas Children's Hospital, Houston, TX, USA.
Med Teach. 2021 Feb;43(2):168-173. doi: 10.1080/0142159X.2020.1830960. Epub 2020 Oct 17.
Assessing learners' competence in diagnostic reasoning is challenging and unstandardized in medical education. We developed a theory-informed, behaviorally anchored rubric, the Assessment of Reasoning Tool (ART), with content and response process validity. This study gathered evidence to support the internal structure and the interpretation of measurements derived from this tool.
We derived a reconstructed version of ART (ART-R) as a 15-item, 5-point Likert scale using the ART domains and descriptors. A psychometric evaluation was performed. We created 18 video variations of learner oral presentations, portraying different performance levels of the ART-R.
152 faculty viewed two videos and rated the learner globally and then using the ART-R. The confirmatory factor analysis showed a favorable comparative fit index = 0.99, root mean square error of approximation = 0.097, and standardized root mean square residual = 0.026. The five domains, hypothesis-directed information gathering, problem representation, prioritized differential diagnosis, diagnostic evaluation, and awareness of cognitive tendencies/emotional factors, had high internal consistency. The total score for each domain had a positive association with the global assessment of diagnostic reasoning.
Our findings provide validity evidence for the ART-R as an assessment tool with five theoretical domains, internal consistency, and association with global assessment.
在医学教育中,评估学习者的诊断推理能力具有挑战性且尚未标准化。我们开发了一种基于理论、具有行为锚定的评分工具,即推理评估工具(ART),具有内容和反应过程的有效性。本研究收集了证据,以支持该工具的内部结构和测量结果的解释。
我们使用 ART 的领域和描述符,推导出了一个 15 项、5 点李克特量表的重构版本 ART-R。进行了心理计量学评估。我们创建了 18 个学习者口头陈述的视频变体,描绘了 ART-R 的不同表现水平。
152 名教师观看了两个视频,并使用 ART-R 对学习者进行了整体评估和分项评估。验证性因子分析显示,良好的比较拟合指数 = 0.99,均方根误差近似值 = 0.097,标准化均方根残差 = 0.026。五个领域,即有针对性的信息收集、问题表示、优先考虑的鉴别诊断、诊断评估和认知倾向/情感因素的意识,具有较高的内部一致性。每个域的总分与诊断推理的整体评估呈正相关。
我们的研究结果为 ART-R 作为一种评估工具提供了有效性证据,该工具具有五个理论领域、内部一致性以及与整体评估的关联。