技术技能评估的评分者间信度和评分者培训的保留。

The Inter-Rater Reliability of Technical Skills Assessment and Retention of Rater Training.

机构信息

Division of General Surgery, Department of Surgery, Faculty of Medicine, University of Ottawa, Ottawa, Ontario, Canada; The Ottawa Hospital, Ottawa, Ontario, Canada; Department of Innovation in Medical Education (DIME), University of Ottawa, Ottawa, Ontario, Canada.

Division of General Surgery, Department of Surgery, Faculty of Medicine, University of Ottawa, Ottawa, Ontario, Canada; The Ottawa Hospital, Ottawa, Ontario, Canada.

出版信息

J Surg Educ. 2019 Jul-Aug;76(4):1088-1093. doi: 10.1016/j.jsurg.2019.01.001. Epub 2019 Jan 29.

DOI:10.1016/j.jsurg.2019.01.001

PMID:30709756

Abstract

BACKGROUND

The inter-rater reliability (IRR) of laparoscopic skills assessment is usually determined in the context of motivated raters from a single subspecialty practice group with significant experience using similar tools. The purpose of this study was to determine the IRR among attending surgeons of different experience and practices, the extent of rater training that is necessary to achieve good IRR, and if rater training is retained over periods of nonuse.

METHODS

In Part 1, 5 surgeons of different practice backgrounds assessed 3 laparoscopic cholecystectomy videos using the Global Operative Assessment of Laparoscopic Skills instrument. In Part 2, 2 of the surgeons assessed a total of 33 videos over 5 scoring sessions distributed across 6 months. They participated in 2 different training sessions, and retention was tested in the other 3 sessions. IRR was calculated for Parts 1 and 2 with an intraclass correlation (ICC) in a 2-way random-effects model.

RESULTS

The ICC for Part 1 was poor (ICC = 0.26). In Part 2, the ICC was highest after each training session (scoring #1 ICC = 0.76, scoring #3 ICC = 0.74). The ICC was not retained 1.5 months after the brief video-based training session (scoring #2 ICC = -0.17). The ICC was retained 2.5 months after the in-depth discussion training session (scoring #4 ICC = 0.70), but not 4.5 months later (scoring #5 ICC = 0.04).

CONCLUSIONS

Good IRR is not implicit among surgeons with varying backgrounds and experience. Good IRR can be achieved with different types of rater training, but the impact of rater training is lost in periods of nonuse. This suggests the need for further study of the IRR of technical skills assessment when performed by the wide variety of surgeon raters as is commonly encountered in the environment of postgraduate resident assessment.

摘要

背景

腹腔镜技能评估的组内相关系数（IRR）通常是在具有使用类似工具的丰富经验的单一亚专业实践小组的动机评估者的背景下确定的。本研究的目的是确定不同经验和实践的主治外科医生之间的 IRR、达到良好 IRR 所需的评估者培训程度，以及评估者培训是否在非使用期间保留。

方法

在第 1 部分中，5 名具有不同实践背景的外科医生使用全球腹腔镜技能操作评估工具评估了 3 个腹腔镜胆囊切除术视频。在第 2 部分中，其中 2 名外科医生在 6 个月的时间内共评估了 33 个视频，分为 5 次评分。他们参加了 2 次不同的培训课程，在另外 3 次课程中测试了保留情况。使用 2 种随机效应模型的组内相关系数（ICC）计算了第 1 部分和第 2 部分的 IRR。

结果

第 1 部分的 ICC 较差（ICC=0.26）。在第 2 部分中，每次培训后 ICC 最高（评分#1 ICC=0.76，评分#3 ICC=0.74）。在简短的基于视频的培训课程 1.5 个月后，ICC 没有保留（评分#2 ICC=-0.17）。在深入讨论培训课程 2.5 个月后，ICC 保留（评分#4 ICC=0.70），但 4.5 个月后没有保留（评分#5 ICC=0.04）。

结论

具有不同背景和经验的外科医生之间的良好 IRR 并非固有。通过不同类型的评估者培训可以实现良好的 IRR，但在非使用期间，评估者培训的效果会消失。这表明需要进一步研究在研究生住院医师评估中常见的各种外科医生评估者进行技术技能评估的 IRR。

相似文献

The Inter-Rater Reliability of Technical Skills Assessment and Retention of Rater Training.技术技能评估的评分者间信度和评分者培训的保留。

J Surg Educ. 2019 Jul-Aug;76(4):1088-1093. doi: 10.1016/j.jsurg.2019.01.001. Epub 2019 Jan 29.

Training less-experienced faculty improves reliability of skills assessment in cardiac surgery.培训经验不足的教员可提高心脏外科手术技能评估的可靠性。

J Thorac Cardiovasc Surg. 2014 Dec;148(6):2491-6.e1-2. doi: 10.1016/j.jtcvs.2014.09.017. Epub 2014 Sep 16.

Integration of Hands-On Team Training into Existing Curriculum Improves Both Technical and Nontechnical Skills in Laparoscopic Cholecystectomy.将实践团队培训融入现有课程可提高腹腔镜胆囊切除术的技术和非技术技能。

J Surg Educ. 2017 Nov-Dec;74(6):915-920. doi: 10.1016/j.jsurg.2017.05.007. Epub 2017 May 26.

Expanded Access to Video-Based Laparoscopic Skills Assessments: Ease, Reliability, and Accuracy.视频腹腔镜技能评估的扩展访问：简便、可靠和准确。

J Surg Educ. 2024 Jun;81(6):850-857. doi: 10.1016/j.jsurg.2024.03.010. Epub 2024 Apr 24.

Validity and reliability of global operative assessment of laparoscopic skills (GOALS) in novice trainees performing a laparoscopic cholecystectomy.腹腔镜胆囊切除术新手学员的腹腔镜技能整体手术评估（GOALS）的有效性和可靠性

J Surg Educ. 2015 Mar-Apr;72(2):351-8. doi: 10.1016/j.jsurg.2014.08.006. Epub 2014 Oct 16.

Does training novices to criteria and does rapid acquisition of skills on laparoscopic simulators have predictive validity or are we just playing video games?训练新手达到标准以及在腹腔镜模拟器上快速掌握技能是否具有预测效度，还是说我们只是在玩电子游戏？

J Surg Educ. 2008 Nov-Dec;65(6):431-5. doi: 10.1016/j.jsurg.2008.05.008.

Boot cAMP: educational outcomes after 4 successive years of preparatory simulation-based training at onset of internship.Boot cAMP：在实习开始时进行连续 4 年基于模拟的预备培训后的教育成果。

J Surg Educ. 2012 Mar-Apr;69(2):242-8. doi: 10.1016/j.jsurg.2011.08.007.

Surgical quality assessment of critical view of safety in 283 laparoscopic cholecystectomy videos by surgical residents and surgeons.手术学员和外科医生对 283 段腹腔镜胆囊切除术视频中关键安全视野的手术质量评估。

Surg Endosc. 2024 Jul;38(7):3609-3614. doi: 10.1007/s00464-024-10873-0. Epub 2024 May 20.

Developing an Objective Structured Assessment of Technical Skills for Laparoscopic Suturing and Intracorporeal Knot Tying.开发一种用于腹腔镜缝合和体内打结技术技能的客观结构化评估方法。

J Surg Educ. 2016 Mar-Apr;73(2):258-63. doi: 10.1016/j.jsurg.2015.10.006. Epub 2015 Nov 16.

Assessment of the Non-Technical Skills for Surgeons (NOTSS) framework in the USA.评估美国外科医生的非技术技能（NOTSS）框架。

Br J Surg. 2020 Aug;107(9):1137-1144. doi: 10.1002/bjs.11607. Epub 2020 Apr 23.

引用本文的文献

Multimodal Assessment in Clinical Simulations: A Guide for Moving Towards Precision Education.临床模拟中的多模态评估：迈向精准教育的指南。

Med Sci Educ. 2024 Nov 19;35(2):1025-1034. doi: 10.1007/s40670-024-02221-7. eCollection 2025 Apr.

Development and Evaluation of a Proficiency-based and Simulation-based Surgical Skills Training for Technical Medicine Students.针对医学技术专业学生的基于熟练度和模拟的外科技能培训的开发与评估

MedEdPublish (2016). 2020 Dec 17;9:284. doi: 10.15694/mep.2020.000284.1. eCollection 2020.

Surgical experience and identification of errors in laparoscopic cholecystectomy.腹腔镜胆囊切除术的手术经验和错误识别。

Br J Surg. 2023 Oct 10;110(11):1535-1542. doi: 10.1093/bjs/znad256.

Assessment of laparoscopic skills: comparing the reliability of global rating and entrustability tools.腹腔镜手术技能评估：比较整体评分和可托付性工具的可靠性

Can Med Educ J. 2022 Nov 15;13(6):36-45. doi: 10.36834/cmej.72369. eCollection 2022 Nov.

Effect of moderation on rubric criteria for inter-rater reliability in an objective structured clinical examination with real patients.在针对真实患者的客观结构化临床考试中，适度性对评分者间信度的评分标准的影响。

Fujita Med J. 2022 Aug;8(3):83-87. doi: 10.20407/fmj.2021-010. Epub 2021 Nov 25.

Development of a median sternotomy simulation model for cardiac surgery training.用于心脏手术训练的正中胸骨切开术模拟模型的开发。

JTCVS Tech. 2020 Apr 5;2:109-116. doi: 10.1016/j.xjtc.2020.03.007. eCollection 2020 Jun.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

技术技能评估的评分者间信度和评分者培训的保留。

The Inter-Rater Reliability of Technical Skills Assessment and Retention of Rater Training.

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSIONS

背景

方法

结果

结论

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献