• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

多模态人工智能在手术器械识别中的应用。

Use of Multimodal Artificial Intelligence in Surgical Instrument Recognition.

作者信息

Haider Syed Ali, Ho Olivia A, Borna Sahar, Gomez-Cabello Cesar A, Pressman Sophia M, Cole Dave, Sehgal Ajai, Leibovich Bradley C, Forte Antonio Jorge

机构信息

Division of Plastic Surgery, Mayo Clinic, Jacksonville, FL 32224, USA.

Center for Digital Health, Mayo Clinic, Rochester, MN 55905, USA.

出版信息

Bioengineering (Basel). 2025 Jan 15;12(1):72. doi: 10.3390/bioengineering12010072.

DOI:10.3390/bioengineering12010072
PMID:39851346
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11761865/
Abstract

Accurate identification of surgical instruments is crucial for efficient workflows and patient safety within the operating room, particularly in preventing complications such as retained surgical instruments. Artificial Intelligence (AI) models have shown the potential to automate this process. This study evaluates the accuracy of publicly available Large Language Models (LLMs)-ChatGPT-4, ChatGPT-4o, and Gemini-and a specialized commercial mobile application, Surgical-Instrument Directory (SID 2.0), in identifying surgical instruments from images. The study utilized a dataset of 92 high-resolution images of 25 surgical instruments (retractors, forceps, scissors, and trocars) photographed from multiple angles. Model performance was evaluated using accuracy, weighted precision, recall, and F1 score. ChatGPT-4o exhibited the highest accuracy (89.1%) in categorizing instruments (e.g., scissors, forceps). SID 2.0 (77.2%) and ChatGPT-4 (76.1%) achieved comparable accuracy, while Gemini (44.6%) demonstrated lower accuracy in this task. For precise subtype identification of instrument names (like "Mayo scissors" or "Kelly forceps"), all models had low accuracy, with SID 2.0 having an accuracy of 39.1%, followed by ChatGPT-4o (33.69%). Subgroup analysis revealed ChatGPT-4 and 4o recognized trocars in all instances. Similarly, Gemini identified surgical scissors in all instances. In conclusion, publicly available LLMs can reliably identify surgical instruments at the category level, with ChatGPT-4o demonstrating an overall edge. However, precise subtype identification remains a challenge for all models. These findings highlight the potential of AI-driven solutions to enhance surgical-instrument management and underscore the need for further refinements to improve accuracy and support patient safety.

摘要

准确识别手术器械对于手术室高效的工作流程和患者安全至关重要,特别是在预防诸如手术器械遗留等并发症方面。人工智能(AI)模型已显示出使这一过程自动化的潜力。本研究评估了公开可用的大语言模型(LLMs)——ChatGPT-4、ChatGPT-4o和Gemini——以及一款专门的商业移动应用程序“手术器械目录(SID 2.0)”从图像中识别手术器械的准确性。该研究使用了一个数据集,其中包含25种手术器械(牵开器、镊子、剪刀和套管针)从多个角度拍摄的92张高分辨率图像。使用准确率、加权精确率、召回率和F1分数来评估模型性能。ChatGPT-4o在对器械(如剪刀、镊子)进行分类时表现出最高的准确率(89.1%)。SID 2.0(77.2%)和ChatGPT-4(76.1%)达到了可比的准确率,而Gemini(44.6%)在这项任务中表现出较低的准确率。对于器械名称(如“梅奥剪刀”或“凯利镊子”)的精确亚型识别,所有模型的准确率都很低,其中SID 2.0的准确率为39.1%,其次是ChatGPT-4o(33.69%)。亚组分析显示,ChatGPT-4和4o在所有情况下都能识别套管针。同样,Gemini在所有情况下都能识别手术剪刀。总之,公开可用的大语言模型能够可靠地在类别层面识别手术器械,ChatGPT-4o总体上具有优势。然而,精确的亚型识别对所有模型来说仍然是一个挑战。这些发现凸显了人工智能驱动的解决方案在加强手术器械管理方面的潜力,并强调了进一步改进以提高准确性和保障患者安全的必要性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d8f/11761865/ee39d542c1b0/bioengineering-12-00072-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d8f/11761865/3b5963dc79fa/bioengineering-12-00072-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d8f/11761865/891107654b27/bioengineering-12-00072-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d8f/11761865/330c8c7ed5db/bioengineering-12-00072-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d8f/11761865/6647c18ad727/bioengineering-12-00072-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d8f/11761865/ee39d542c1b0/bioengineering-12-00072-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d8f/11761865/3b5963dc79fa/bioengineering-12-00072-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d8f/11761865/891107654b27/bioengineering-12-00072-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d8f/11761865/330c8c7ed5db/bioengineering-12-00072-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d8f/11761865/6647c18ad727/bioengineering-12-00072-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d8f/11761865/ee39d542c1b0/bioengineering-12-00072-g005.jpg

相似文献

1
Use of Multimodal Artificial Intelligence in Surgical Instrument Recognition.多模态人工智能在手术器械识别中的应用。
Bioengineering (Basel). 2025 Jan 15;12(1):72. doi: 10.3390/bioengineering12010072.
2
Comparative Analysis of ChatGPT-4o and Gemini Advanced Performance on Diagnostic Radiology In-Training Exams.ChatGPT-4o与Gemini在放射诊断学培训考试中的性能对比分析
Cureus. 2025 Mar 20;17(3):e80874. doi: 10.7759/cureus.80874. eCollection 2025 Mar.
3
Breaking Bones, Breaking Barriers: ChatGPT, DeepSeek, and Gemini in Hand Fracture Management.突破骨骼,突破障碍:ChatGPT、DeepSeek和Gemini在手部骨折管理中的应用
J Clin Med. 2025 Mar 14;14(6):1983. doi: 10.3390/jcm14061983.
4
Comparative performance of artificial intelligence models in rheumatology board-level questions: evaluating Google Gemini and ChatGPT-4o.人工智能模型在风湿病委员会级问题中的比较性能:评估 Google Gemini 和 ChatGPT-4o。
Clin Rheumatol. 2024 Nov;43(11):3507-3513. doi: 10.1007/s10067-024-07154-5. Epub 2024 Sep 28.
5
Evaluating ChatGPT and Google Gemini Performance and Implications in Turkish Dental Education.评估ChatGPT和谷歌Gemini在土耳其牙科教育中的性能及影响
Cureus. 2025 Jan 11;17(1):e77292. doi: 10.7759/cureus.77292. eCollection 2025 Jan.
6
Artificial intelligence-large language models (AI-LLMs) for reliable and accurate cardiotocography (CTG) interpretation in obstetric practice.用于产科实践中可靠且准确解读胎心监护(CTG)的人工智能大语言模型(AI-LLMs)。
Comput Struct Biotechnol J. 2025 Mar 18;27:1140-1147. doi: 10.1016/j.csbj.2025.03.026. eCollection 2025.
7
Comparative analysis of ChatGPT-4o mini, ChatGPT-4o and Gemini Advanced in the treatment of postmenopausal osteoporosis.ChatGPT-4o mini、ChatGPT-4o与Gemini Advanced在绝经后骨质疏松症治疗中的对比分析。
BMC Musculoskelet Disord. 2025 Apr 16;26(1):369. doi: 10.1186/s12891-025-08601-3.
8
Evaluating text and visual diagnostic capabilities of large language models on questions related to the Breast Imaging Reporting and Data System Atlas 5 edition.评估大语言模型在与《乳腺影像报告和数据系统》第5版相关问题上的文本和视觉诊断能力。
Diagn Interv Radiol. 2025 Mar 3;31(2):111-129. doi: 10.4274/dir.2024.242876. Epub 2024 Sep 9.
9
A Comparative Analysis of Artificial Intelligence Platforms: ChatGPT-4o and Google Gemini in Answering Questions About Birth Control Methods.人工智能平台的比较分析:ChatGPT-4o与谷歌Gemini在回答避孕方法相关问题方面的表现
Cureus. 2025 Jan 1;17(1):e76745. doi: 10.7759/cureus.76745. eCollection 2025 Jan.
10
The Performance of Artificial Intelligence in One Anastomosis Gastric Bypass Surgery: Comparative Efficacy of ChatGPT-4.0, ChatGPT-Omni, and Gemini AI.人工智能在单吻合口胃旁路手术中的表现:ChatGPT-4.0、ChatGPT-Omni和Gemini AI的疗效比较
Obes Surg. 2025 Apr;35(4):1469-1475. doi: 10.1007/s11695-025-07794-9. Epub 2025 Mar 18.

引用本文的文献

1
Facial Analysis for Plastic Surgery in the Era of Artificial Intelligence: A Comparative Evaluation of Multimodal Large Language Models.人工智能时代整形外科的面部分析:多模态大语言模型的比较评估
J Clin Med. 2025 May 16;14(10):3484. doi: 10.3390/jcm14103484.
2
Advances in AI Technology in Healthcare.人工智能技术在医疗保健领域的进展。
Bioengineering (Basel). 2025 May 11;12(5):506. doi: 10.3390/bioengineering12050506.
3
The Transformative Role of Artificial Intelligence in Plastic and Reconstructive Surgery: Challenges and Opportunities.

本文引用的文献

1
Artificial intelligence model for automated surgical instrument detection and counting: an experimental proof-of-concept study.用于自动手术器械检测与计数的人工智能模型:一项实验性概念验证研究。
Patient Saf Surg. 2024 Jul 21;18(1):24. doi: 10.1186/s13037-024-00406-y.
2
Towards multimodal graph neural networks for surgical instrument anticipation.迈向用于手术器械预测的多模态图神经网络。
Int J Comput Assist Radiol Surg. 2024 Oct;19(10):1929-1937. doi: 10.1007/s11548-024-03226-8. Epub 2024 Jul 10.
3
Surgical Instrument Detection Algorithm Based on Improved YOLOv7x.
人工智能在整形与重建外科中的变革性作用:挑战与机遇
J Clin Med. 2025 Apr 15;14(8):2698. doi: 10.3390/jcm14082698.
4
From Promise to Practice: Harnessing AI's Power to Transform Medicine.从承诺到实践:利用人工智能的力量变革医学
J Clin Med. 2025 Feb 13;14(4):1225. doi: 10.3390/jcm14041225.
基于改进 YOLOv7x 的手术器械检测算法。
Sensors (Basel). 2023 May 24;23(11):5037. doi: 10.3390/s23115037.
4
Surgical instrument detection and tracking technologies: Automating dataset labeling for surgical skill assessment.手术器械检测与跟踪技术:实现手术技能评估数据集标注的自动化
Front Robot AI. 2022 Nov 4;9:1030846. doi: 10.3389/frobt.2022.1030846. eCollection 2022.
5
Multimodal machine learning in precision health: A scoping review.精准健康中的多模态机器学习:一项范围综述。
NPJ Digit Med. 2022 Nov 7;5(1):171. doi: 10.1038/s41746-022-00712-8.
6
Optimizing the surgical instrument tray to immediately increase efficiency and lower costs in the operating room.优化手术器械托盘,立即提高手术室效率,降低成本。
Can J Surg. 2022 Apr 12;65(2):E275-E281. doi: 10.1503/cjs.022720. Print 2022 Mar-Apr.
7
Lightweight Deep Neural Network for Articulated Joint Detection of Surgical Instrument in Minimally Invasive Surgical Robot.用于微创手术机器人中手术器械铰接关节检测的轻量化深度神经网络。
J Digit Imaging. 2022 Aug;35(4):923-937. doi: 10.1007/s10278-022-00616-9. Epub 2022 Mar 9.
8
Artificial Intelligence in Endoscopy.内镜检查中的人工智能
Visc Med. 2021 Dec;37(6):471-475. doi: 10.1159/000519407. Epub 2021 Nov 1.
9
Novel Computer-Aided Diagnosis Software for the Prevention of Retained Surgical Items.新型计算机辅助诊断软件预防手术器械遗留
J Am Coll Surg. 2021 Dec;233(6):686-696. doi: 10.1016/j.jamcollsurg.2021.08.689. Epub 2021 Sep 27.
10
Automation of surgical skill assessment using a three-stage machine learning algorithm.使用三阶段机器学习算法实现手术技能评估自动化。
Sci Rep. 2021 Mar 4;11(1):5197. doi: 10.1038/s41598-021-84295-6.