• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

众包评估白内障手术的手术技能熟练度。

Crowdsourced Assessment of Surgical Skill Proficiency in Cataract Surgery.

机构信息

Department of Ophthalmology and Visual Sciences, Washington University School of Medicine, Saint Louis, Missouri.

Graduate Medical Education, University of Minnesota, Minneapolis, Minnesota.

出版信息

J Surg Educ. 2021 Jul-Aug;78(4):1077-1088. doi: 10.1016/j.jsurg.2021.02.004. Epub 2021 Feb 25.

DOI:10.1016/j.jsurg.2021.02.004
PMID:33640326
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8217126/
Abstract

OBJECTIVE

To test whether crowdsourced lay raters can accurately assess cataract surgical skills.

DESIGN

Two-armed study: independent cross-sectional and longitudinal cohorts.

SETTING

Washington University Department of Ophthalmology.

PARTICIPANTS AND METHODS

Sixteen cataract surgeons with varying experience levels submitted cataract surgery videos to be graded by 5 experts and 300+ crowdworkers masked to surgeon experience. Cross-sectional study: 50 videos from surgeons ranging from first-year resident to attending physician, pooled by years of training. Longitudinal study: 28 videos obtained at regular intervals as residents progressed through 180 cases. Surgical skill was graded using the modified Objective Structured Assessment of Technical Skill (mOSATS). Main outcome measures were overall technical performance, reliability indices, and correlation between expert and crowd mean scores.

RESULTS

Experts demonstrated high interrater reliability and accurately predicted training level, establishing construct validity for the modified OSATS. Crowd scores were correlated with (r = 0.865, p < 0.0001) but consistently higher than expert scores for first, second, and third-year residents (p < 0.0001, paired t-test). Longer surgery duration negatively correlated with training level (r = -0.855, p < 0.0001) and expert score (r = -0.927, p < 0.0001). The longitudinal dataset reproduced cross-sectional study findings for crowd and expert comparisons. A regression equation transforming crowd score plus video length into expert score was derived from the cross-sectional dataset (r = 0.92) and demonstrated excellent predictive modeling when applied to the independent longitudinal dataset (r = 0.80). A group of student raters who had edited the cataract videos also graded them, producing scores that more closely approximated experts than the crowd.

CONCLUSIONS

Crowdsourced rankings correlated with expert scores, but were not equivalent; crowd scores overestimated technical competency, especially for novice surgeons. A novel approach of adjusting crowd scores with surgery duration generated a more accurate predictive model for surgical skill. More studies are needed before crowdsourcing can be reliably used for assessing surgical proficiency.

摘要

目的

测试众包的外行评估者是否能准确评估白内障手术技能。

设计

双臂研究:独立的横断面和纵向队列。

地点

华盛顿大学眼科系。

参与者和方法

16 名白内障外科医生,他们具有不同的经验水平,向 5 名专家和 300 多名对手术医生经验不知情的众包人员提交白内障手术视频进行评分。横断面研究:对从第一年住院医师到主治医生的外科医生的 50 个视频进行分组,这些视频按培训年限进行分组。纵向研究:作为住院医师完成 180 例手术的常规间隔时间获取 28 个视频。使用改良的客观结构化手术技能评估(mOSATS)来评估手术技能。主要观察指标是总体技术表现、可靠性指标以及专家和众包平均分数之间的相关性。

结果

专家表现出高度的组内可靠性,并准确预测了培训水平,从而建立了改良 OSATS 的结构有效性。众包分数与(r=0.865,p<0.0001)相关,但与第一、第二和第三年住院医师的专家评分相比始终较高(p<0.0001,配对 t 检验)。手术时间延长与培训水平(r=-0.855,p<0.0001)和专家评分(r=-0.927,p<0.0001)呈负相关。纵向数据集再现了众包和专家比较的横断面研究结果。从横断面数据集推导出了一个将众包分数加上视频长度转化为专家分数的回归方程(r=0.92),并将其应用于独立的纵向数据集时,表现出了极好的预测建模效果(r=0.80)。一组编辑过白内障视频的学生评估者也对其进行了评分,他们的评分比众包人员更接近专家。

结论

众包排名与专家评分相关,但并不等同;众包评分高估了技术能力,尤其是对新手外科医生。一种调整手术时间的众包评分的新方法为手术技能的预测模型提供了更准确的结果。在可以可靠地将众包用于评估手术熟练度之前,还需要进行更多的研究。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d09/8217126/c8090241a025/nihms-1674550-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d09/8217126/b2717b844122/nihms-1674550-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d09/8217126/29f48d97803e/nihms-1674550-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d09/8217126/c8090241a025/nihms-1674550-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d09/8217126/b2717b844122/nihms-1674550-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d09/8217126/29f48d97803e/nihms-1674550-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d09/8217126/c8090241a025/nihms-1674550-f0003.jpg

相似文献

1
Crowdsourced Assessment of Surgical Skill Proficiency in Cataract Surgery.众包评估白内障手术的手术技能熟练度。
J Surg Educ. 2021 Jul-Aug;78(4):1077-1088. doi: 10.1016/j.jsurg.2021.02.004. Epub 2021 Feb 25.
2
A study of crowdsourced segment-level surgical skill assessment using pairwise rankings.一项使用成对排名进行众包的节段级手术技能评估研究。
Int J Comput Assist Radiol Surg. 2015 Sep;10(9):1435-47. doi: 10.1007/s11548-015-1238-6. Epub 2015 Jun 30.
3
Vessel Ligation Fundamentals: A Comparison of Technical Evaluations by Crowdsourced Nonclinical Personnel and Surgical Faculty.血管结扎基础:众包非临床人员和外科教员技术评估的比较。
J Surg Educ. 2018 May-Jun;75(3):664-670. doi: 10.1016/j.jsurg.2017.09.030. Epub 2017 Dec 15.
4
Crowdsourced Assessment of Ureteroscopy with Laser Lithotripsy Video Feed Does Not Correlate with Trainee Experience.基于众包的输尿管镜激光碎石术视频反馈评估与学员经验无关。
J Endourol. 2019 Jan;33(1):42-49. doi: 10.1089/end.2018.0534. Epub 2018 Dec 22.
5
Crowd-sourced assessment of technical skills: an adjunct to urology resident surgical simulation training.技术技能的众包评估:泌尿外科住院医师手术模拟训练的辅助手段
J Endourol. 2015 May;29(5):604-9. doi: 10.1089/end.2014.0616. Epub 2015 Jan 7.
6
Crowd-Sourced and Attending Assessment of General Surgery Resident Operative Performance Using Global Ratings Scales.基于全球评分量表的众包和主治医生评估普通外科住院医师手术操作表现。
J Surg Educ. 2020 Nov-Dec;77(6):e214-e219. doi: 10.1016/j.jsurg.2020.07.011. Epub 2020 Oct 8.
7
Crowdsourcing: a valid alternative to expert evaluation of robotic surgery skills.众包:机器人手术技能专家评估的有效替代方法。
Am J Obstet Gynecol. 2016 Nov;215(5):644.e1-644.e7. doi: 10.1016/j.ajog.2016.06.033. Epub 2016 Jun 27.
8
Crowd-Sourced Assessment of Technical Skill: A Valid Method for Discriminating Basic Robotic Surgery Skills.技术技能的众包评估:一种区分基本机器人手术技能的有效方法。
J Endourol. 2015 Nov;29(11):1295-301. doi: 10.1089/end.2015.0191. Epub 2015 Aug 24.
9
C-SATS: Assessing Surgical Skills Among Urology Residency Applicants.C-SATS:评估泌尿外科住院医师申请者的手术技能
J Endourol. 2017 Apr;31(S1):S95-S100. doi: 10.1089/end.2016.0569. Epub 2016 Oct 11.
10
Crowdsourcing Assessment of Surgeon Dissection of Renal Artery and Vein During Robotic Partial Nephrectomy: A Novel Approach for Quantitative Assessment of Surgical Performance.机器人辅助肾部分切除术中外科医生对肾动脉和肾静脉解剖的众包评估:一种评估手术操作的新方法。
J Endourol. 2016 Apr;30(4):447-52. doi: 10.1089/end.2015.0665. Epub 2015 Dec 30.

引用本文的文献

1
Dedicated Chalazion Clinic as a Tool for Early Surgical Education in Ophthalmology Residency.专用睑板腺囊肿诊所作为眼科住院医师早期手术教育的工具。
J Acad Ophthalmol (2017). 2023 Jan 28;15(1):e36-e40. doi: 10.1055/s-0043-1761275. eCollection 2023 Jan.
2
Fluoroscopic image-based behavior analysis can objectively explain subjective expert assessment of wire navigation skill.基于荧光透视影像的行为分析可以客观地解释专家对导丝导航技能的主观评估。
J Orthop Res. 2024 Feb;42(2):404-414. doi: 10.1002/jor.25685. Epub 2023 Sep 24.
3
Using the language of surgery to enhance ophthalmology surgical education.

本文引用的文献

1
Associations Between Video Evaluations of Surgical Technique and Outcomes of Laparoscopic Sleeve Gastrectomy.手术技术视频评估与腹腔镜袖状胃切除术结局的相关性。
JAMA Surg. 2021 Feb 1;156(2):e205532. doi: 10.1001/jamasurg.2020.5532. Epub 2021 Feb 10.
2
Verifying Surgical Competence: Our Fiduciary Responsibility.验证手术能力:我们的信托责任。
Ophthalmology. 2020 Aug;127(8):997-999. doi: 10.1016/j.ophtha.2020.03.022.
3
Ophthalmology Resident Surgical Competence: A Survey of Program Directors.眼科住院医师的手术能力:项目主任的调查
运用外科手术语言提升眼科手术教育水平。
Surg Open Sci. 2023 Jul 14;14:52-59. doi: 10.1016/j.sopen.2023.07.002. eCollection 2023 Aug.
4
A Virtual Reading Center Model Using Crowdsourcing to Grade Photographs for Trachoma: Validation Study.基于众包的沙眼照片分级虚拟阅读中心模型:验证研究。
J Med Internet Res. 2023 Apr 6;25:e41233. doi: 10.2196/41233.
Ophthalmology. 2020 Aug;127(8):1123-1125. doi: 10.1016/j.ophtha.2020.02.017. Epub 2020 Feb 20.
4
Crowdsourcing Morphology Assessments in Oculoplastic Surgery: Reliability and Validity of Lay People Relative to Professional Image Analysts and Experts. crowdsourcing 形态评估在眼整形手术中:非专业人士与专业图像分析员和专家的可靠性和有效性。
Ophthalmic Plast Reconstr Surg. 2020 Mar/Apr;36(2):178-181. doi: 10.1097/IOP.0000000000001515.
5
Can we efficiently use structured rating scales to objectively assess global technical skill in cataract surgery?我们能否有效地使用结构化评分量表来客观评估白内障手术中的整体技术水平?
J Cataract Refract Surg. 2019 Nov;45(11):1682-1683. doi: 10.1016/j.jcrs.2019.07.030.
6
Current Status of Technical Skills Assessment Tools in Surgery: A Systematic Review.手术技术评估工具的现状:系统评价。
J Surg Res. 2020 Feb;246:342-378. doi: 10.1016/j.jss.2019.09.006. Epub 2019 Nov 2.
7
Objective assessment of intraoperative technical skill in capsulorhexis using videos of cataract surgery.使用白内障手术视频对晶状体囊外切除术的术中技术技能进行客观评估。
Int J Comput Assist Radiol Surg. 2019 Jun;14(6):1097-1105. doi: 10.1007/s11548-019-01956-8. Epub 2019 Apr 11.
8
Assessment of Automated Identification of Phases in Videos of Cataract Surgery Using Machine Learning and Deep Learning Techniques.使用机器学习和深度学习技术评估白内障手术视频中的相位自动识别。
JAMA Netw Open. 2019 Apr 5;2(4):e191860. doi: 10.1001/jamanetworkopen.2019.1860.
9
Crowdsourced Assessment of Ureteroscopy with Laser Lithotripsy Video Feed Does Not Correlate with Trainee Experience.基于众包的输尿管镜激光碎石术视频反馈评估与学员经验无关。
J Endourol. 2019 Jan;33(1):42-49. doi: 10.1089/end.2018.0534. Epub 2018 Dec 22.
10
Assessing Progression of Resident Proficiency during Ophthalmology Residency Training: Utility of Serial Clinical Skill Evaluations.评估眼科住院医师培训期间住院医师熟练程度的进展:系列临床技能评估的效用。
J Med Educ Train. 2017;1(4). Epub 2017 Sep 9.