• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

脚本一致性测试分数的使用和解释中的有效性威胁。

Threats to validity in the use and interpretation of script concordance test scores.

机构信息

Department of Medical Education, University of Illinois at Chicago, Chicago, Illinois, USA.

出版信息

Med Educ. 2013 Dec;47(12):1175-83. doi: 10.1111/medu.12283.

DOI:10.1111/medu.12283
PMID:24206151
Abstract

CONTEXT

Recent reviews have claimed that the script concordance test (SCT) methodology generally produces reliable and valid assessments of clinical reasoning and that the SCT may soon be suitable for high-stakes testing.

OBJECTIVES

This study is intended to describe three major threats to the validity of the SCT not yet considered in prior research and to illustrate the severity of these threats.

METHODS

We conducted a review of SCT reports available through the Web of Science database. Additionally, we reanalysed scores from a previously published SCT administration to explore issues related to standard SCT scoring practice.

RESULTS

Firstly, the predominant method for aggregate and partial credit scoring of SCTs introduces logical inconsistencies in the scoring key. Secondly, our literature review shows that SCT reliability studies have generally ignored inter-panel, inter-panellist and test-retest measurement error. Instead, studies have focused on observed levels of coefficient alpha, which is neither an informative index of internal structure nor a comprehensive index of reliability for SCT scores. As such, claims that SCT scores show acceptable reliability are premature. Finally, SCT criteria for item inclusion, in concert with a statistical artefact of the SCT format, cause anchors at the extremes of the scale to have less expected credit than anchors near or at the midpoint. Consequently, SCT scores are likely to reflect construct-irrelevant differences in examinees' response styles. This makes the test susceptible to bias against candidates who endorse extreme scale anchors more readily; it also makes two construct-irrelevant test taking strategies extremely effective. In our reanalysis, we found that examinees could drastically increase their scores by never endorsing extreme scale points. Furthermore, examinees who simply endorsed the scale midpoint for every item would still have outperformed most examinees who used the scale as it is intended.

CONCLUSIONS

Given the severity of these threats, we conclude that aggregate scoring of SCTs cannot be recommended. Recommendations for revisions of SCT methodology are discussed.

摘要

背景

最近的评论声称,脚本一致性测试(SCT)方法通常可以对临床推理进行可靠和有效的评估,并且 SCT 可能很快就适用于高风险测试。

目的

本研究旨在描述三个尚未在先前研究中考虑到的 SCT 有效性的主要威胁,并说明这些威胁的严重性。

方法

我们对 Web of Science 数据库中可用的 SCT 报告进行了审查。此外,我们重新分析了先前发表的 SCT 管理分数,以探讨与标准 SCT 评分实践相关的问题。

结果

首先,SCT 的总分数和部分分数的主要方法在评分键中引入了逻辑不一致。其次,我们的文献综述表明,SCT 可靠性研究通常忽略了面板间、面板内和测试重测测量误差。相反,研究集中在观察到的系数 alpha 水平上,这既不是内部结构的信息指标,也不是 SCT 分数可靠性的综合指标。因此,声称 SCT 分数具有可接受的可靠性还为时过早。最后,SCT 的项目纳入标准与 SCT 格式的统计伪影相结合,导致量表极值的锚点比接近或处于中点的锚点获得的预期信用较少。因此,SCT 分数可能反映了考生在反应风格上与测试无关的差异。这使得测试容易受到对更愿意支持极端量表锚点的考生的偏见;它还使两种与测试无关的测试策略非常有效。在我们的重新分析中,我们发现考生可以通过从不支持极端量表点来大幅提高他们的分数。此外,对于每个项目仅支持量表中点的考生,仍将比大多数按预期使用量表的考生表现更好。

结论

鉴于这些威胁的严重性,我们得出结论,不能推荐对 SCT 进行总分数评估。讨论了对 SCT 方法学的修订建议。

相似文献

1
Threats to validity in the use and interpretation of script concordance test scores.脚本一致性测试分数的使用和解释中的有效性威胁。
Med Educ. 2013 Dec;47(12):1175-83. doi: 10.1111/medu.12283.
2
Assessment in the context of uncertainty using the script concordance test: more meaning for scores.使用脚本一致性测试在不确定情况下进行评估:分数更有意义。
Teach Learn Med. 2010 Jul;22(3):180-6. doi: 10.1080/10401334.2010.488197.
3
Use of the script concordance approach to evaluate clinical reasoning in food-ruminant practitioners.使用脚本一致性方法评估反刍动物从业者的临床推理能力。
J Vet Med Educ. 2012 Fall;39(3):267-75. doi: 10.3138/jvme.0112-13R.
4
The script concordance test for clinical reasoning: re-examining its utility and potential weakness.临床推理的脚本一致性测试:重新审视其效用和潜在弱点。
Med Educ. 2014 Nov;48(11):1069-77. doi: 10.1111/medu.12514.
5
Analyzing script concordance test scoring methods and items by difficulty and type.分析脚本一致性测试的评分方法和按难度及类型划分的项目。
Teach Learn Med. 2014;26(2):135-45. doi: 10.1080/10401334.2014.884464.
6
The psychometric properties of five scoring methods applied to the script concordance test.应用于脚本一致性测试的五种评分方法的心理测量特性。
Acad Med. 2005 Apr;80(4):395-9. doi: 10.1097/00001888-200504000-00019.
7
Optimization of answer keys for script concordance testing: should we exclude deviant panelists, deviant responses, or neither?脚本一致性测试答案键的优化:我们应该排除异常测试者、异常答案还是两者都不排除?
Adv Health Sci Educ Theory Pract. 2011 Dec;16(5):601-8. doi: 10.1007/s10459-011-9279-2. Epub 2011 Feb 1.
8
Can script concordance testing be used in nursing education to accurately assess clinical reasoning skills?脚本一致性测试能否用于护理教育以准确评估临床推理能力?
J Nurs Educ. 2014 May 1;53(5):281-6. doi: 10.3928/01484834-20140321-03. Epub 2014 Mar 21.
9
The script concordance test as a measure of clinical reasoning: a national validation study.脚本一致性测试作为临床推理的衡量标准:一项全国性验证研究。
Am J Surg. 2012 Apr;203(4):530-4. doi: 10.1016/j.amjsurg.2011.11.006.
10
Validation of undergraduate medical student script concordance test (SCT) scores on the clinical assessment of the acute abdomen.本科医学生急性腹痛临床评估脚本一致性测试(SCT)分数的验证
BMC Surg. 2016 Aug 17;16(1):57. doi: 10.1186/s12893-016-0173-y.

引用本文的文献

1
Development of script concordance test for assessment of clinical reasoning in nursing: Lessons learned regarding construct validity.用于评估护理临床推理的脚本一致性测试的开发:关于结构效度的经验教训。
Heliyon. 2024 Jul 24;10(15):e35151. doi: 10.1016/j.heliyon.2024.e35151. eCollection 2024 Aug 15.
2
Using ChatGPT in Psychiatry to Design Script Concordance Tests in Undergraduate Medical Education: Mixed Methods Study.在精神病学中使用 ChatGPT 设计本科医学教育中的脚本一致性测试:混合方法研究。
JMIR Med Educ. 2024 Apr 4;10:e54067. doi: 10.2196/54067.
3
Reliability of a script agreement test for undergraduate speech-language therapy students.
本科言语治疗学生脚本协议测试的可靠性。
Codas. 2023 Nov 13;35(5):e20220098. doi: 10.1590/2317-1782/20232022098es. eCollection 2023.
4
Developing Script Concordance Test Items in Otolaryngology to Improve Clinical Reasoning Skills: Validation using Consensus Analysis and Psychometrics.开发耳鼻喉科脚本一致性测试项目以提高临床推理能力:使用共识分析和心理测量学进行验证
Int J Appl Basic Med Res. 2023 Apr-Jun;13(2):64-69. doi: 10.4103/ijabmr.ijabmr_604_22. Epub 2023 Jul 17.
5
Modeling Diagnostic Expertise in Cases of Irreducible Uncertainty: The Decision-Aligned Response Model.建模诊断专长在不可还原不确定性的案例中:决策一致的反应模型。
Acad Med. 2023 Jan 1;98(1):88-97. doi: 10.1097/ACM.0000000000004918. Epub 2022 Aug 9.
6
The Use of Formative Assessment in Postgraduate Urology Training: A Systematic Review.研究生泌尿外科培训中形成性评估的应用:一项系统综述
Cureus. 2022 Jul 22;14(7):e27162. doi: 10.7759/cureus.27162. eCollection 2022 Jul.
7
Script concordance test acceptability and utility for assessing medical students' clinical reasoning: a user's survey and an institutional prospective evaluation of students' scores.脚本一致性测试在评估医学生临床推理能力方面的可接受性和实用性:一项用户调查和对学生成绩的机构前瞻性评估。
BMC Med Educ. 2022 Apr 13;22(1):277. doi: 10.1186/s12909-022-03339-1.
8
Assessment of Emergency Medicine Residents' Clinical Reasoning: Validation of a Script Concordance Test.评估急诊住院医师的临床推理能力:脚本一致性测试的验证。
West J Emerg Med. 2020 Jun 24;21(4):978-984. doi: 10.5811/westjem.2020.3.46035.
9
Assessment methods in medical specialist assessments in the DACH region - overview, critical examination and recommendations for further development.DACH地区医学专家评估中的评估方法——概述、批判性审视及进一步发展建议。
GMS J Med Educ. 2019 Nov 15;36(6):Doc78. doi: 10.3205/zma001286. eCollection 2019.
10
Assessment in the context of problem-based learning.基于问题的学习中的评估。
Adv Health Sci Educ Theory Pract. 2019 Dec;24(5):903-914. doi: 10.1007/s10459-019-09909-1. Epub 2019 Oct 2.