众包评估白内障手术的手术技能熟练度。

Crowdsourced Assessment of Surgical Skill Proficiency in Cataract Surgery.

机构信息

Department of Ophthalmology and Visual Sciences, Washington University School of Medicine, Saint Louis, Missouri.

Graduate Medical Education, University of Minnesota, Minneapolis, Minnesota.

出版信息

J Surg Educ. 2021 Jul-Aug;78(4):1077-1088. doi: 10.1016/j.jsurg.2021.02.004. Epub 2021 Feb 25.

DOI:10.1016/j.jsurg.2021.02.004

PMID:33640326

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8217126/

Abstract

OBJECTIVE

To test whether crowdsourced lay raters can accurately assess cataract surgical skills.

DESIGN

Two-armed study: independent cross-sectional and longitudinal cohorts.

SETTING

Washington University Department of Ophthalmology.

PARTICIPANTS AND METHODS

Sixteen cataract surgeons with varying experience levels submitted cataract surgery videos to be graded by 5 experts and 300+ crowdworkers masked to surgeon experience. Cross-sectional study: 50 videos from surgeons ranging from first-year resident to attending physician, pooled by years of training. Longitudinal study: 28 videos obtained at regular intervals as residents progressed through 180 cases. Surgical skill was graded using the modified Objective Structured Assessment of Technical Skill (mOSATS). Main outcome measures were overall technical performance, reliability indices, and correlation between expert and crowd mean scores.

RESULTS

Experts demonstrated high interrater reliability and accurately predicted training level, establishing construct validity for the modified OSATS. Crowd scores were correlated with (r = 0.865, p < 0.0001) but consistently higher than expert scores for first, second, and third-year residents (p < 0.0001, paired t-test). Longer surgery duration negatively correlated with training level (r = -0.855, p < 0.0001) and expert score (r = -0.927, p < 0.0001). The longitudinal dataset reproduced cross-sectional study findings for crowd and expert comparisons. A regression equation transforming crowd score plus video length into expert score was derived from the cross-sectional dataset (r = 0.92) and demonstrated excellent predictive modeling when applied to the independent longitudinal dataset (r = 0.80). A group of student raters who had edited the cataract videos also graded them, producing scores that more closely approximated experts than the crowd.

CONCLUSIONS

Crowdsourced rankings correlated with expert scores, but were not equivalent; crowd scores overestimated technical competency, especially for novice surgeons. A novel approach of adjusting crowd scores with surgery duration generated a more accurate predictive model for surgical skill. More studies are needed before crowdsourcing can be reliably used for assessing surgical proficiency.

摘要

目的

测试众包的外行评估者是否能准确评估白内障手术技能。

设计

双臂研究：独立的横断面和纵向队列。

地点

华盛顿大学眼科系。

参与者和方法

16 名白内障外科医生，他们具有不同的经验水平，向 5 名专家和 300 多名对手术医生经验不知情的众包人员提交白内障手术视频进行评分。横断面研究：对从第一年住院医师到主治医生的外科医生的 50 个视频进行分组，这些视频按培训年限进行分组。纵向研究：作为住院医师完成 180 例手术的常规间隔时间获取 28 个视频。使用改良的客观结构化手术技能评估（mOSATS）来评估手术技能。主要观察指标是总体技术表现、可靠性指标以及专家和众包平均分数之间的相关性。

结果

专家表现出高度的组内可靠性，并准确预测了培训水平，从而建立了改良 OSATS 的结构有效性。众包分数与（r=0.865，p<0.0001）相关，但与第一、第二和第三年住院医师的专家评分相比始终较高（p<0.0001，配对 t 检验）。手术时间延长与培训水平（r=-0.855，p<0.0001）和专家评分（r=-0.927，p<0.0001）呈负相关。纵向数据集再现了众包和专家比较的横断面研究结果。从横断面数据集推导出了一个将众包分数加上视频长度转化为专家分数的回归方程（r=0.92），并将其应用于独立的纵向数据集时，表现出了极好的预测建模效果（r=0.80）。一组编辑过白内障视频的学生评估者也对其进行了评分，他们的评分比众包人员更接近专家。

结论

众包排名与专家评分相关，但并不等同；众包评分高估了技术能力，尤其是对新手外科医生。一种调整手术时间的众包评分的新方法为手术技能的预测模型提供了更准确的结果。在可以可靠地将众包用于评估手术熟练度之前，还需要进行更多的研究。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d09/8217126/b2717b844122/nihms-1674550-f0001.jpg

相似文献

Crowdsourced Assessment of Surgical Skill Proficiency in Cataract Surgery.

J Surg Educ. 2021 Jul-Aug;78(4):1077-1088. doi: 10.1016/j.jsurg.2021.02.004. Epub 2021 Feb 25.

A study of crowdsourced segment-level surgical skill assessment using pairwise rankings.

Int J Comput Assist Radiol Surg. 2015 Sep;10(9):1435-47. doi: 10.1007/s11548-015-1238-6. Epub 2015 Jun 30.

Vessel Ligation Fundamentals: A Comparison of Technical Evaluations by Crowdsourced Nonclinical Personnel and Surgical Faculty.

J Surg Educ. 2018 May-Jun;75(3):664-670. doi: 10.1016/j.jsurg.2017.09.030. Epub 2017 Dec 15.

Crowdsourced Assessment of Ureteroscopy with Laser Lithotripsy Video Feed Does Not Correlate with Trainee Experience.

J Endourol. 2019 Jan;33(1):42-49. doi: 10.1089/end.2018.0534. Epub 2018 Dec 22.

Crowd-sourced assessment of technical skills: an adjunct to urology resident surgical simulation training.

J Endourol. 2015 May;29(5):604-9. doi: 10.1089/end.2014.0616. Epub 2015 Jan 7.

Crowd-Sourced and Attending Assessment of General Surgery Resident Operative Performance Using Global Ratings Scales.

J Surg Educ. 2020 Nov-Dec;77(6):e214-e219. doi: 10.1016/j.jsurg.2020.07.011. Epub 2020 Oct 8.

Crowdsourcing: a valid alternative to expert evaluation of robotic surgery skills.

Am J Obstet Gynecol. 2016 Nov;215(5):644.e1-644.e7. doi: 10.1016/j.ajog.2016.06.033. Epub 2016 Jun 27.

Crowd-Sourced Assessment of Technical Skill: A Valid Method for Discriminating Basic Robotic Surgery Skills.

J Endourol. 2015 Nov;29(11):1295-301. doi: 10.1089/end.2015.0191. Epub 2015 Aug 24.

C-SATS: Assessing Surgical Skills Among Urology Residency Applicants.

J Endourol. 2017 Apr;31(S1):S95-S100. doi: 10.1089/end.2016.0569. Epub 2016 Oct 11.

Crowdsourcing Assessment of Surgeon Dissection of Renal Artery and Vein During Robotic Partial Nephrectomy: A Novel Approach for Quantitative Assessment of Surgical Performance.

J Endourol. 2016 Apr;30(4):447-52. doi: 10.1089/end.2015.0665. Epub 2015 Dec 30.

引用本文的文献

Dedicated Chalazion Clinic as a Tool for Early Surgical Education in Ophthalmology Residency.

J Acad Ophthalmol (2017). 2023 Jan 28;15(1):e36-e40. doi: 10.1055/s-0043-1761275. eCollection 2023 Jan.

Fluoroscopic image-based behavior analysis can objectively explain subjective expert assessment of wire navigation skill.

J Orthop Res. 2024 Feb;42(2):404-414. doi: 10.1002/jor.25685. Epub 2023 Sep 24.

Using the language of surgery to enhance ophthalmology surgical education.

Surg Open Sci. 2023 Jul 14;14:52-59. doi: 10.1016/j.sopen.2023.07.002. eCollection 2023 Aug.

A Virtual Reading Center Model Using Crowdsourcing to Grade Photographs for Trachoma: Validation Study.

J Med Internet Res. 2023 Apr 6;25:e41233. doi: 10.2196/41233.

本文引用的文献

Associations Between Video Evaluations of Surgical Technique and Outcomes of Laparoscopic Sleeve Gastrectomy.

JAMA Surg. 2021 Feb 1;156(2):e205532. doi: 10.1001/jamasurg.2020.5532. Epub 2021 Feb 10.

Verifying Surgical Competence: Our Fiduciary Responsibility.

Ophthalmology. 2020 Aug;127(8):997-999. doi: 10.1016/j.ophtha.2020.03.022.

Ophthalmology Resident Surgical Competence: A Survey of Program Directors.

Ophthalmology. 2020 Aug;127(8):1123-1125. doi: 10.1016/j.ophtha.2020.02.017. Epub 2020 Feb 20.

Crowdsourcing Morphology Assessments in Oculoplastic Surgery: Reliability and Validity of Lay People Relative to Professional Image Analysts and Experts.

Ophthalmic Plast Reconstr Surg. 2020 Mar/Apr;36(2):178-181. doi: 10.1097/IOP.0000000000001515.

Can we efficiently use structured rating scales to objectively assess global technical skill in cataract surgery?

J Cataract Refract Surg. 2019 Nov;45(11):1682-1683. doi: 10.1016/j.jcrs.2019.07.030.

Current Status of Technical Skills Assessment Tools in Surgery: A Systematic Review.

J Surg Res. 2020 Feb;246:342-378. doi: 10.1016/j.jss.2019.09.006. Epub 2019 Nov 2.

Objective assessment of intraoperative technical skill in capsulorhexis using videos of cataract surgery.

Int J Comput Assist Radiol Surg. 2019 Jun;14(6):1097-1105. doi: 10.1007/s11548-019-01956-8. Epub 2019 Apr 11.

Assessment of Automated Identification of Phases in Videos of Cataract Surgery Using Machine Learning and Deep Learning Techniques.

JAMA Netw Open. 2019 Apr 5;2(4):e191860. doi: 10.1001/jamanetworkopen.2019.1860.

Crowdsourced Assessment of Ureteroscopy with Laser Lithotripsy Video Feed Does Not Correlate with Trainee Experience.

J Endourol. 2019 Jan;33(1):42-49. doi: 10.1089/end.2018.0534. Epub 2018 Dec 22.

Assessing Progression of Resident Proficiency during Ophthalmology Residency Training: Utility of Serial Clinical Skill Evaluations.

J Med Educ Train. 2017;1(4). Epub 2017 Sep 9.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

众包评估白内障手术的手术技能熟练度。

Crowdsourced Assessment of Surgical Skill Proficiency in Cataract Surgery.

机构信息

出版信息

OBJECTIVE

DESIGN

SETTING

PARTICIPANTS AND METHODS

RESULTS

CONCLUSIONS

目的

设计

地点

参与者和方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献