评估单阶段视觉模型在手术器械位姿估计中的应用。

Evaluation of single-stage vision models for pose estimation of surgical instruments.

机构信息

Center for Orthopaedic Biomechanics, University of Denver, 2155 E Wesley Ave, Denver, CO, 80210, USA.

Unmanned Systems Research Institute, University of Denver, 2155 E Wesley Ave, Denver, CO, 80210, USA.

出版信息

Int J Comput Assist Radiol Surg. 2023 Dec;18(12):2125-2142. doi: 10.1007/s11548-023-02890-6. Epub 2023 Apr 30.

DOI:10.1007/s11548-023-02890-6

PMID:37120481

Abstract

PURPOSE

Multiple applications in open surgical environments may benefit from adoption of markerless computer vision depending on associated speed and accuracy requirements. The current work evaluates vision models for 6-degree of freedom pose estimation of surgical instruments in RGB scenes. Potential use cases are discussed based on observed performance.

METHODS

Convolutional neural nets were developed with simulated training data for 6-degree of freedom pose estimation of a representative surgical instrument in RGB scenes. Trained models were evaluated with simulated and real-world scenes. Real-world scenes were produced by using a robotic manipulator to procedurally generate a wide range of object poses.

RESULTS

CNNs trained in simulation transferred to real-world evaluation scenes with a mild decrease in pose accuracy. Model performance was sensitive to input image resolution and orientation prediction format. The model with highest accuracy demonstrated mean in-plane translation error of 13 mm and mean long axis orientation error of 5[Formula: see text] in simulated evaluation scenes. Similar errors of 29 mm and 8[Formula: see text] were observed in real-world scenes.

CONCLUSION

6-DoF pose estimators can predict object pose in RGB scenes with real-time inference speed. Observed pose accuracy suggests that applications such as coarse-grained guidance, surgical skill evaluation, or instrument tracking for tray optimization may benefit from markerless pose estimation.

摘要

目的

基于相关的速度和精度要求，无标记计算机视觉在开放式手术环境中的多种应用可能会受益于此。目前的工作评估了用于在 RGB 场景中对手术器械进行六自由度位姿估计的视觉模型。根据观察到的性能讨论了潜在的用例。

方法

针对 RGB 场景中代表性手术器械的六自由度位姿估计，使用模拟训练数据开发了卷积神经网络。使用模拟和真实场景评估了经过训练的模型。通过使用机器人操纵器来程序化地生成广泛的物体姿态，生成了真实场景。

结果

在模拟中训练的 CNN 转移到真实世界的评估场景时，位姿精度略有下降。模型性能对输入图像分辨率和方位预测格式敏感。在模拟评估场景中，具有最高精度的模型表现出平均面内平移误差为 13mm，平均长轴方位误差为 5[公式：见文本]。在真实场景中观察到类似的误差为 29mm 和 8[公式：见文本]。

结论

六自由度位姿估计器可以以实时推断速度预测 RGB 场景中的物体位姿。观察到的位姿精度表明，诸如粗粒度引导、手术技能评估或托盘优化的器械跟踪等应用可能受益于无标记位姿估计。

相似文献

Evaluation of single-stage vision models for pose estimation of surgical instruments.评估单阶段视觉模型在手术器械位姿估计中的应用。

Int J Comput Assist Radiol Surg. 2023 Dec;18(12):2125-2142. doi: 10.1007/s11548-023-02890-6. Epub 2023 Apr 30.

Towards markerless surgical tool and hand pose estimation.面向无标记手术工具和手部姿势估计。

Int J Comput Assist Radiol Surg. 2021 May;16(5):799-808. doi: 10.1007/s11548-021-02369-2. Epub 2021 Apr 21.

HMD-EgoPose: head-mounted display-based egocentric marker-less tool and hand pose estimation for augmented surgical guidance.HMD-EgoPose：基于头戴式显示器的自我中心无标记工具和手部姿势估计，用于增强手术指导。

Int J Comput Assist Radiol Surg. 2022 Dec;17(12):2253-2262. doi: 10.1007/s11548-022-02688-y. Epub 2022 Jun 14.

i3PosNet: instrument pose estimation from X-ray in temporal bone surgery.i3PosNet：颞骨手术中 X 射线的器械位姿估计。

Int J Comput Assist Radiol Surg. 2020 Jul;15(7):1137-1145. doi: 10.1007/s11548-020-02157-4. Epub 2020 May 21.

Using hand pose estimation to automate open surgery training feedback.利用手部姿势估计实现开放手术训练的自动化反馈。

Int J Comput Assist Radiol Surg. 2023 Jul;18(7):1279-1285. doi: 10.1007/s11548-023-02947-6. Epub 2023 May 30.

Preclinical evaluation of a markerless, real-time, augmented reality guidance system for robot-assisted radical prostatectomy.用于机器人辅助根治性前列腺切除术的无标记、实时、增强现实引导系统的临床前评估。

Int J Comput Assist Radiol Surg. 2021 Jul;16(7):1181-1188. doi: 10.1007/s11548-021-02419-9. Epub 2021 Jun 2.

An integrated approach to endoscopic instrument tracking for augmented reality applications in surgical simulation training.用于手术模拟培训中增强现实应用的内镜器械跟踪的集成方法。

Int J Med Robot. 2013 Dec;9(4):e34-51. doi: 10.1002/rcs.1485. Epub 2013 Jan 25.

ERegPose: An explicit regression based 6D pose estimation for snake-like wrist-type surgical instruments.ERegPose：一种基于显式回归的蛇形腕式手术器械 6D 位姿估计方法。

Int J Med Robot. 2024 Jun;20(3):e2640. doi: 10.1002/rcs.2640.

A deep learning approach for pose estimation from volumetric OCT data.基于深度学习的体 OCT 数据的姿态估计方法。

Med Image Anal. 2018 May;46:162-179. doi: 10.1016/j.media.2018.03.002. Epub 2018 Mar 10.

Detection, segmentation, and 3D pose estimation of surgical tools using convolutional neural networks and algebraic geometry.使用卷积神经网络和代数几何进行手术工具的检测、分割和三维姿态估计。

Med Image Anal. 2021 May;70:101994. doi: 10.1016/j.media.2021.101994. Epub 2021 Feb 7.

引用本文的文献

Artificial intelligence integration in surgery through hand and instrument tracking: a systematic literature review.通过手部和器械追踪将人工智能整合到手术中：一项系统的文献综述

Front Surg. 2025 Feb 26;12:1528362. doi: 10.3389/fsurg.2025.1528362. eCollection 2025.

本文引用的文献

Predictive value of platelet-related hematological markers in indicating the outcomes after laparoscopic intraperitoneal onlay mesh repair (IPOM).血小板相关血液学标志物对腹腔镜腹腔内补片修补术（IPOM）后结局的预测价值。

Surg Endosc. 2023 May;37(5):3471-3477. doi: 10.1007/s00464-022-09845-z. Epub 2022 Dec 27.

Int J Comput Assist Radiol Surg. 2022 Dec;17(12):2253-2262. doi: 10.1007/s11548-022-02688-y. Epub 2022 Jun 14.

Robust hand tracking for surgical telestration.稳健的手部跟踪用于手术遥绘。

Int J Comput Assist Radiol Surg. 2022 Aug;17(8):1477-1486. doi: 10.1007/s11548-022-02637-9. Epub 2022 May 27.

Surgical data science - from concepts toward clinical translation.外科数据科学——从概念到临床转化。

Med Image Anal. 2022 Feb;76:102306. doi: 10.1016/j.media.2021.102306. Epub 2021 Nov 18.

A novel augmented reality-based surgical guidance system for total knee arthroplasty.一种新型基于增强现实的全膝关节置换手术导航系统。

Arch Orthop Trauma Surg. 2021 Dec;141(12):2227-2233. doi: 10.1007/s00402-021-04204-4. Epub 2021 Oct 26.

Automatic tracking of healthy joint kinematics from stereo-radiography sequences.从立体射线照相序列中自动跟踪健康关节运动学。

Comput Biol Med. 2021 Dec;139:104945. doi: 10.1016/j.compbiomed.2021.104945. Epub 2021 Oct 14.

Against spatial-temporal discrepancy: contrastive learning-based network for surgical workflow recognition.对抗时空差异：基于对比学习的手术流程识别网络。

Int J Comput Assist Radiol Surg. 2021 May;16(5):839-848. doi: 10.1007/s11548-021-02382-5. Epub 2021 May 5.

Towards markerless surgical tool and hand pose estimation.面向无标记手术工具和手部姿势估计。

Int J Comput Assist Radiol Surg. 2021 May;16(5):799-808. doi: 10.1007/s11548-021-02369-2. Epub 2021 Apr 21.

Gesture Recognition in Robotic Surgery: A Review.机器人手术中的手势识别：综述。

IEEE Trans Biomed Eng. 2021 Jun;68(6):2021-2035. doi: 10.1109/TBME.2021.3054828. Epub 2021 May 21.

SynPo-Net-Accurate and Fast CNN-Based 6DoF Object Pose Estimation Using Synthetic Training.基于 SynPo-Net 的准确、快速的 CNN 六自由度物体位姿估计方法，使用合成训练。

Sensors (Basel). 2021 Jan 5;21(1):300. doi: 10.3390/s21010300.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

评估单阶段视觉模型在手术器械位姿估计中的应用。

Evaluation of single-stage vision models for pose estimation of surgical instruments.

机构信息

出版信息

PURPOSE

METHODS

RESULTS

CONCLUSION

目的

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献