• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在临床能力考核中,标准化病人病例评分及标准设定简单方法的有效性。

Validity of simple approach to scoring and standard setting for standardized-patient cases in an examination of clinical competence.

作者信息

Travis T A, Colliver J A, Robbs R S, Barnhart A J, Barrows H S, Giannone L, Henkle J Q, Kelly D P, Nichols-Johnson V, Rabinovitch S, Ramsey D E, Riseman J, Rockey P H, Ross D S, Schrage J P, Steward D E

机构信息

SIU School of Medicine, Springfield, IL 62794-9230, USA.

出版信息

Acad Med. 1996 Jan;71(1 Suppl):S84-6. doi: 10.1097/00001888-199601000-00051.

DOI:10.1097/00001888-199601000-00051
PMID:8546793
Abstract

The results are disappointing, providing little support for the validity of the case-passing decisions based on this simple approach to scoring and standard setting. The case-passing decisions predicted what the case author intended for about only 73% or 74% of the students on average and, with agreement expected by chance removed, predicted what the case author intended for about only 25% of the students. Even with the use of the optimal pass/fail cutoffs and the dropping of students with ambiguous borderline global ratings, the case-passing decisions failed to agree with the case authors' global ratings for 15% to 30% of the students. The findings might be dismissed as simply due to low reliabilities of passing decisions and global ratings based on a single case. Although this concern would apply to intercase reliabilities, which would be subject to case specificity, the appropriate reliabilities here would seem to be intracase (i.e., intrarater), which should be fairly high (if they could be computed). Nevertheless, it seems reasonable to expect much better agreement between results of case scoring and of standard setting developed by the case author and the case author's global ratings of performance on that case, given that the case author might recall the checklist, assign a weight to each item, and so forth. Also, case-passing decisions would possibly agree more with global ratings of live or videotaped performances than with ratings of written summaries of performance; however, that question remains a challenge for further research. In conclusion, the study provides only weak evidence, at best, for the validity of the scoring and standard setting commonly used with SP assessment. The results do not undermine claims about the realism of the SP approach, however, nor do they call into question the standardization afforded by this method of assessing clinical competence. The results do raise serious concerns about this simple approach to scoring and standard setting for SP-based assessments and suggest that we should focus more on the observation and evaluation of actual student performance on SP cases in the development of valid scoring and standard setting.

摘要

结果令人失望,几乎无法支持基于这种简单评分和标准设定方法做出的病例通过决策的有效性。病例通过决策平均仅能预测约73%或74%的学生的病例撰写者意图,去除偶然一致性后,仅能预测约25%的学生的病例撰写者意图。即使使用最优的通过/不通过临界值并剔除整体评分模糊的学生,病例通过决策仍有15%至30%的学生与病例撰写者的整体评分不一致。这些发现可能会被简单地归结为基于单个病例的通过决策和整体评分的可靠性较低。尽管这种担忧适用于病例间的可靠性,因为其会受到病例特异性的影响,但这里合适的可靠性似乎是病例内(即评分者内)的,本应相当高(如果可以计算的话)。然而,鉴于病例撰写者可能会回忆起检查表、为每个项目赋予权重等等,病例评分结果与病例撰写者制定的标准设定以及病例撰写者对该病例表现的整体评分之间应该有更好的一致性,这似乎是合理的。此外,病例通过决策可能与现场或录像表现的整体评分比与书面表现总结的评分更一致;然而,这个问题仍是进一步研究的挑战。总之,该研究充其量仅为SP评估中常用的评分和标准设定的有效性提供了微弱证据。然而,这些结果并未削弱关于SP方法现实性的主张,也没有质疑这种临床能力评估方法所提供的标准化。这些结果确实引发了对基于SP评估的这种简单评分和标准设定方法的严重担忧,并表明在制定有效的评分和标准设定时,我们应更多地关注对学生在SP病例上实际表现的观察和评估。

相似文献

1
Validity of simple approach to scoring and standard setting for standardized-patient cases in an examination of clinical competence.在临床能力考核中,标准化病人病例评分及标准设定简单方法的有效性。
Acad Med. 1996 Jan;71(1 Suppl):S84-6. doi: 10.1097/00001888-199601000-00051.
2
Validating the standardized-patient assessment administered to medical students in the New York City Consortium.验证纽约市联盟对医学生进行的标准化患者评估。
Acad Med. 1997 Jul;72(7):619-26. doi: 10.1097/00001888-199707000-00014.
3
Using a receiver operating characteristic (ROC) analysis to set passing standards for a standardized-patient examination of clinical competence.使用受试者工作特征(ROC)分析来设定临床能力标准化患者检查的及格标准。
Acad Med. 1994 Oct;69(10 Suppl):S37-9. doi: 10.1097/00001888-199410000-00035.
4
Consistency of pass-fail decisions made with clinical clerkship ratings and standardized-patient examination scores.
Acad Med. 1994 Oct;69(10 Suppl):S40-1. doi: 10.1097/00001888-199410000-00036.
5
Psychometric properties of a standardized-patient checklist and rating-scale form used to assess interpersonal and communication skills.用于评估人际和沟通技能的标准化患者检查表及评分量表形式的心理测量特性。
Acad Med. 1996 Jan;71(1 Suppl):S87-9. doi: 10.1097/00001888-199601000-00052.
6
Six years of comprehensive, clinical, performance-based assessment using standardized patients at the Southern Illinois University School of Medicine.南伊利诺伊大学医学院利用标准化病人进行了为期六年的全面、基于临床和表现的评估。
Acad Med. 1992 Jan;67(1):42-50. doi: 10.1097/00001888-199201000-00009.
7
Referral for competency committee review for poor performance on the internal medicine clerkship is associated with poor performance in internship.因在内科实习中表现不佳而被转介至能力评估委员会审查,这与在实习期间的不良表现相关。
Mil Med. 2015 Apr;180(4 Suppl):71-6. doi: 10.7205/MILMED-D-14-00575.
8
Diagnosing technical competence in six bedside procedures: comparing checklists and a global rating scale in the assessment of resident performance.六种床边操作技术能力的诊断:在住院医师操作评估中比较清单法和整体评分量表
Acad Med. 2015 Aug;90(8):1100-8. doi: 10.1097/ACM.0000000000000704.
9
Setting standards and defining quality of performance in the validation of a standardized-patient examination format.在标准化患者检查形式的验证中设定标准并定义性能质量。
Acad Med. 1997 Nov;72(11):1012-4. doi: 10.1097/00001888-199711000-00022.
10
The reliability of the pass/fail decision for assessments comprised of multiple components.由多个部分组成的评估中通过/失败判定的可靠性。
GMS Z Med Ausbild. 2015 Oct 15;32(4):Doc42. doi: 10.3205/zma000984. eCollection 2015.