针对高风险评估，比较技术技能的客观结构化评估与检查表量表的可靠性。

Objective structured assessment of technical skills and checklist scales reliability compared for high stakes assessments.

作者信息

Gallagher Anthony G, O'Sullivan Gerald C, Leonard Gerald, Bunting Brendan P, McGlade Kieran J

机构信息

School of Medicine, University College Cork, Cork, Ireland; National Surgical Training Centre, Royal College of Surgeons in Ireland, Dublin, Ireland.

出版信息

ANZ J Surg. 2014 Jul-Aug;84(7-8):568-73. doi: 10.1111/j.1445-2197.2012.06236.x. Epub 2012 Sep 3.

DOI:10.1111/j.1445-2197.2012.06236.x

PMID:22943748

Abstract

BACKGROUND

The establishment of assessment reliability at the level of the individual trainee is an important attribute of assessment methodologies, particularly for doctors who have been failed. This issue is of particular importance for the process of competence assessment in the USA, UK, Australia and New Zealand.

METHODS

We use data from 19 applicants for higher surgical training in 2008 at the Royal College of Surgeons in Ireland to compare: (i) the objective structured assessment of technical skills (OSATS) method; and (ii) a procedure-specific checklist to assess surgical technical skills in the excision of a sebaceous cyst task by two experienced senior surgeons.

RESULTS

The overall interrater reliability (IRR) of the OSATS assessment as determined by a correlation coefficient was 0.507 (P < 0.03) and 0.67 with coefficient alpha, considerably below the accepted 0.8 level of IRR. The checklist's overall IRR was 0.89. Individually, only five (26%) of the OSATS assessments reached the 0.8 level of IRR in contrast to 18 (95%) of the checklist assessments.

DISCUSSION

We propose binary procedure-based assessment checklists as more reliable assessment instruments with more robust reproducibility.

摘要

背景

在个体培训学员层面建立评估可靠性是评估方法的一个重要属性，对于考核未通过的医生而言尤为如此。在美国、英国、澳大利亚和新西兰，这个问题对于能力评估过程尤为重要。

方法

我们使用了2008年爱尔兰皇家外科医学院19名申请高级外科培训的学员的数据，以比较：（i）客观结构化技术技能评估（OSATS）方法；以及（ii）由两名经验丰富的资深外科医生使用的特定程序检查表，用于评估皮脂腺囊肿切除任务中的手术技术技能。

结果

通过相关系数确定的OSATS评估的总体评分者间信度（IRR）为0.507（P < 0.03），使用α系数时为0.67，远低于公认的0.8的IRR水平。检查表的总体IRR为0.89。单独来看，OSATS评估中只有5项（26%）达到了0.8的IRR水平，而检查表评估中有18项（95%）达到了该水平。

讨论

我们建议基于二元程序的评估检查表作为更可靠的评估工具，具有更强的可重复性。

相似文献

Objective structured assessment of technical skills and checklist scales reliability compared for high stakes assessments.

ANZ J Surg. 2014 Jul-Aug;84(7-8):568-73. doi: 10.1111/j.1445-2197.2012.06236.x. Epub 2012 Sep 3.

Reliability and Validity of 3 Methods of Assessing Orthopedic Resident Skill in Shoulder Surgery.

J Surg Educ. 2016 Nov-Dec;73(6):1020-1025. doi: 10.1016/j.jsurg.2016.04.023. Epub 2016 Jun 3.

Assessing the surgical skills of trainees in the operating theatre: a prospective observational study of the methodology.

Health Technol Assess. 2011 Jan;15(1):i-xxi, 1-162. doi: 10.3310/hta15010.

High-Fidelity Emergency Department Thoracotomy Simulator With Beating-Heart Technology and OSATS Tool Improves Trainee Confidence and Distinguishes Level of Skill.

J Surg Educ. 2018 Sep-Oct;75(5):1357-1366. doi: 10.1016/j.jsurg.2018.02.001. Epub 2018 Feb 26.

Using objective structured assessment of technical skills to evaluate a basic skills simulation curriculum for first-year surgical residents.

J Am Coll Surg. 2009 Sep;209(3):364-370.e2. doi: 10.1016/j.jamcollsurg.2009.05.005. Epub 2009 Jul 9.

Development and preliminary validation of an Objective Structured Assessment of Technical Skills (OSATS) for a partial pulpotomy procedure.

Int Endod J. 2023 Aug;56(8):1011-1021. doi: 10.1111/iej.13938. Epub 2023 Jun 6.

Developing an Objective Structured Assessment of Technical Skills for Laparoscopic Suturing and Intracorporeal Knot Tying.

J Surg Educ. 2016 Mar-Apr;73(2):258-63. doi: 10.1016/j.jsurg.2015.10.006. Epub 2015 Nov 16.

Reliability of results produced through objectively structured assessment of technical skills (OSATS) for endotracheal intubation (ETI).

J Coll Physicians Surg Pak. 2013 Jan;23(1):51-5.

Assessment of resident surgical skills: is testing feasible?

Am J Obstet Gynecol. 2005 Apr;192(4):1331-8; discussion 1338-40. doi: 10.1016/j.ajog.2004.12.068.

Novel method for assessment and selection of trainees for higher surgical training in general surgery.

ANZ J Surg. 2008 Apr;78(4):282-90. doi: 10.1111/j.1445-2197.2008.04439.x.

引用本文的文献

International expert consensus on a structured approach to robotic multiport right hemicolectomy with complete mesocolic excision and intracorporeal anastomosis.

Colorectal Dis. 2025 Aug;27(8):e70197. doi: 10.1111/codi.70197.

Artificial Intelligence and Plastic Surgery Resident Education.

Plast Reconstr Surg Glob Open. 2025 Jul 17;13(7):e6924. doi: 10.1097/GOX.0000000000006924. eCollection 2025 Jul.

European expert consensus on a structured approach to circular stapling anastomosis in minimally invasive left-sided colorectal resection.

Colorectal Dis. 2025 Feb;27(2):e70037. doi: 10.1111/codi.70037.

Development and validation of metrics for a new RAPN training model.

J Robot Surg. 2024 Apr 2;18(1):153. doi: 10.1007/s11701-024-01911-z.

Development and preliminary validation of a new task-based objective procedure-specific assessment of inguinal hernia repair procedural safety.

Surg Endosc. 2024 Mar;38(3):1583-1591. doi: 10.1007/s00464-024-10677-2. Epub 2024 Feb 8.

Discrimination, Reliability, Sensitivity, and Specificity of Robotic Surgical Proficiency Assessment With Global Evaluative Assessment of Robotic Skills and Binary Scoring Metrics: Results From a Randomized Controlled Trial.

Ann Surg Open. 2023 Aug 16;4(3):e307. doi: 10.1097/AS9.0000000000000307. eCollection 2023 Sep.

Learning surgical knot tying and suturing technique - effects of different forms of training in a controlled randomized trial with dental students.

GMS J Med Educ. 2023 Jun 15;40(4):Doc48. doi: 10.3205/zma001630. eCollection 2023.

Global assessment of surgical skills (GASS): validation of a new instrument to measure global technical safety in surgical procedures.

Surg Endosc. 2023 Oct;37(10):7964-7969. doi: 10.1007/s00464-023-10116-8. Epub 2023 Jul 13.

Evaluation of single-stage vision models for pose estimation of surgical instruments.

Int J Comput Assist Radiol Surg. 2023 Dec;18(12):2125-2142. doi: 10.1007/s11548-023-02890-6. Epub 2023 Apr 30.

Validation of Task-Specific Rating Scale for Open Balloon Catheter Arterial Embolectomy: An Assessor-Blinded Quasi-Experimental Pilot Study.

Ann Vasc Dis. 2022 Dec 25;15(4):289-294. doi: 10.3400/avd.oa.22-00047.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

针对高风险评估，比较技术技能的客观结构化评估与检查表量表的可靠性。

Objective structured assessment of technical skills and checklist scales reliability compared for high stakes assessments.

作者信息

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

DISCUSSION

背景

方法

结果

讨论

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献