基于语言的微创手术内镜导航辅助中手术导航步骤的翻译和预测。

Language-based translation and prediction of surgical navigation steps for endoscopic wayfinding assistance in minimally invasive surgery.

机构信息

Innovation Center Computer Assisted Surgery (ICCAS), Leipzig University, Semmelweisstraße 14, 04103, Leipzig, Germany.

Department for Ear-, Nose- and Throat-Surgery, University of Leipzig Medical Center, Leipzig, Germany.

出版信息

Int J Comput Assist Radiol Surg. 2020 Dec;15(12):2089-2100. doi: 10.1007/s11548-020-02264-2. Epub 2020 Oct 10.

DOI:10.1007/s11548-020-02264-2

PMID:33037490

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7671992/

Abstract

PURPOSE

In the context of aviation and automotive navigation technology, assistance functions are associated with predictive planning and wayfinding tasks. In endoscopic minimally invasive surgery, however, assistance so far relies primarily on image-based localization and classification. We show that navigation workflows can be described and used for the prediction of navigation steps.

METHODS

A natural description vocabulary for observable anatomical landmarks in endoscopic images was defined to create 3850 navigation workflow sentences from 22 annotated functional endoscopic sinus surgery (FESS) recordings. Resulting FESS navigation workflows showed an imbalanced data distribution with over-represented landmarks in the ethmoidal sinus. A transformer model was trained to predict navigation sentences in sequence-to-sequence tasks. The training was performed with the Adam optimizer and label smoothing in a leave-one-out cross-validation study. The sentences were generated using an adapted beam search algorithm with exponential decay beam rescoring. The transformer model was compared to a standard encoder-decoder-model, as well as HMM and LSTM baseline models.

RESULTS

The transformer model reached the highest prediction accuracy for navigation steps at 0.53, followed by 0.35 of the LSTM and 0.32 for the standard encoder-decoder-network. With an accuracy of sentence generation of 0.83, the prediction of navigation steps at sentence-level benefits from the additional semantic information. While standard class representation predictions suffer from an imbalanced data distribution, the attention mechanism also considered underrepresented classes reasonably well.

CONCLUSION

We implemented a natural language-based prediction method for sentence-level navigation steps in endoscopic surgery. The sentence-level prediction method showed a potential that word relations to navigation tasks can be learned and used for predicting future steps. Further studies are needed to investigate the functionality of path prediction. The prediction approach is a first step in the field of visuo-linguistic navigation assistance for endoscopic minimally invasive surgery.

摘要

目的

在航空和汽车导航技术领域，辅助功能与预测规划和导航任务相关。然而，在内窥镜微创手术中，辅助功能主要依赖于基于图像的定位和分类。我们表明，导航工作流程可以被描述并用于预测导航步骤。

方法

定义了内窥镜图像中可观察到的解剖学标志的自然描述词汇，从 22 个标注的功能性内窥镜鼻窦手术 (FESS) 记录中创建了 3850 个导航工作流程句子。结果的 FESS 导航工作流程显示出数据分布不平衡，筛窦中地标过多。使用 Adam 优化器和标签平滑在留一交叉验证研究中对变压器模型进行了训练，以进行序列到序列任务的预测。使用自适应波束搜索算法和指数衰减波束重新评分生成句子。将变压器模型与标准编码器-解码器模型以及 HMM 和 LSTM 基线模型进行了比较。

结果

变压器模型在导航步骤预测方面达到了最高的准确度 0.53，其次是 LSTM 的 0.35 和标准编码器-解码器网络的 0.32。句子生成的准确度为 0.83，在句子级别预测导航步骤得益于额外的语义信息。虽然标准类别表示预测受到数据分布不平衡的影响，但注意力机制也相当合理地考虑了代表性不足的类别。

结论

我们实现了一种基于自然语言的内窥镜手术中句子级导航步骤预测方法。句子级预测方法显示出一种潜力，即可以学习并使用词与导航任务之间的关系来预测未来步骤。需要进一步的研究来调查路径预测的功能。该预测方法是内窥镜微创手术视觉语言导航辅助领域的一个初步步骤。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ca2/7671992/e981621b92df/11548_2020_2264_Fig1_HTML.jpg

相似文献

Language-based translation and prediction of surgical navigation steps for endoscopic wayfinding assistance in minimally invasive surgery.基于语言的微创手术内镜导航辅助中手术导航步骤的翻译和预测。

Int J Comput Assist Radiol Surg. 2020 Dec;15(12):2089-2100. doi: 10.1007/s11548-020-02264-2. Epub 2020 Oct 10.

[BIOPASS hybrid navigation for endoscopic sinus surgery - an assistance system].[用于鼻内镜鼻窦手术的BIOPASS混合导航——一种辅助系统]

Laryngorhinootologie. 2023 Jan;102(1):32-39. doi: 10.1055/a-1940-9723. Epub 2022 Nov 3.

Image-guided minimally invasive endopancreatic surgery using a computer-assisted navigation system.计算机辅助导航系统引导下的微创胰腺内视镜手术。

Surg Endosc. 2021 Apr;35(4):1610-1617. doi: 10.1007/s00464-020-07540-5. Epub 2020 Apr 6.

Minimally Invasive Full-Endoscopic Posterior Cervical Foraminotomy Assisted by O-Arm-Based Navigation.基于 O 型臂导航的微创全内窥镜下颈椎侧方椎间孔切开术。

Pain Physician. 2018 May;21(3):E215-E223.

Utilization of artificial intelligence in minimally invasive right adrenalectomy: recognition of anatomical landmarks with deep learning.人工智能在微创右肾上腺切除术的应用：深度学习识别解剖标志。

Acta Chir Belg. 2024 Dec;124(6):492-498. doi: 10.1080/00015458.2024.2363599. Epub 2024 Jun 10.

Workflow and simulation of image-to-physical registration of holes inside spongy bone.松质骨内孔洞的图像到物理配准的工作流程与模拟

Int J Comput Assist Radiol Surg. 2017 Aug;12(8):1425-1437. doi: 10.1007/s11548-017-1594-5. Epub 2017 May 6.

Endoscopic navigation for minimally invasive suturing.用于微创缝合的内镜导航

Med Image Comput Comput Assist Interv. 2007;10(Pt 2):620-7. doi: 10.1007/978-3-540-75759-7_75.

Rotatable flexible neck-model for the evaluation of minimally invasive operation procedures with the help of an ultrasound-based navigation system.借助基于超声的导航系统用于评估微创手术程序的可旋转柔性颈部模型。

Annu Int Conf IEEE Eng Med Biol Soc. 2013;2013:1140-3. doi: 10.1109/EMBC.2013.6609707.

A systematic review of common landmarks in navigated endoscopic sinus surgery (NESS).导航内镜鼻窦手术（NESS）常见标志的系统评价。

Comput Assist Surg (Abingdon). 2021 Dec;26(1):77-84. doi: 10.1080/24699322.2021.1992504.

Predicting Semantic Similarity Between Clinical Sentence Pairs Using Transformer Models: Evaluation and Representational Analysis.使用Transformer模型预测临床句子对之间的语义相似性：评估与表征分析

JMIR Med Inform. 2021 May 26;9(5):e23099. doi: 10.2196/23099.

引用本文的文献

Artificial Intelligence in Otology, Rhinology, and Laryngology: A Narrative Review of Its Current and Evolving Picture.人工智能在耳科学、鼻科学和喉科学中的应用：对其现状与发展态势的叙述性综述

Cureus. 2024 Aug 2;16(8):e66036. doi: 10.7759/cureus.66036. eCollection 2024 Aug.

A Scoping Review of Artificial Intelligence Research in Rhinology.鼻科学人工智能研究的范围综述。

Am J Rhinol Allergy. 2023 Jul;37(4):438-448. doi: 10.1177/19458924231162437. Epub 2023 Mar 9.

Keyword-augmented and semi-automatic generation of FESS reports: a proof-of-concept study.关键词增强和 FESS 报告半自动生成：概念验证研究。

Int J Comput Assist Radiol Surg. 2023 May;18(5):961-968. doi: 10.1007/s11548-022-02791-0. Epub 2022 Nov 17.

The Evolution and Application of Artificial Intelligence in Rhinology: A State of the Art Review.人工智能在鼻科学中的发展与应用：综述

Otolaryngol Head Neck Surg. 2023 Jul;169(1):21-30. doi: 10.1177/01945998221110076. Epub 2023 Jan 29.

本文引用的文献

Intraoperative surgery room management: A deep learning perspective.术中手术室管理：深度学习视角。

Int J Med Robot. 2020 Oct;16(5):1-12. doi: 10.1002/rcs.2136. Epub 2020 Jun 30.

Deep learning-based anatomical site classification for upper gastrointestinal endoscopy.基于深度学习的上消化道内镜解剖部位分类。

Int J Comput Assist Radiol Surg. 2020 Jul;15(7):1085-1094. doi: 10.1007/s11548-020-02148-5. Epub 2020 May 6.

EasyLabels: weak labels for scene segmentation in laparoscopic videos.EasyLabels：腹腔镜视频场景分割的弱标签。

Int J Comput Assist Radiol Surg. 2019 Jul;14(7):1247-1257. doi: 10.1007/s11548-019-02003-2. Epub 2019 Jun 4.

Video-based surgical skill assessment using 3D convolutional neural networks.基于视频的三维卷积神经网络手术技能评估。

Int J Comput Assist Radiol Surg. 2019 Jul;14(7):1217-1225. doi: 10.1007/s11548-019-01995-1. Epub 2019 May 18.

Prediction of laparoscopic procedure duration using unlabeled, multimodal sensor data.使用未标记的多模态传感器数据预测腹腔镜手术时间。

Int J Comput Assist Radiol Surg. 2019 Jun;14(6):1089-1095. doi: 10.1007/s11548-019-01966-6. Epub 2019 Apr 9.

"Deep-Onto" network for surgical workflow and context recognition.“Deep-Onto”网络用于手术流程和上下文识别。

Int J Comput Assist Radiol Surg. 2019 Apr;14(4):685-696. doi: 10.1007/s11548-018-1882-8. Epub 2018 Nov 16.

Advanced Endoscopic Navigation: Surgical Big Data, Methodology, and Applications.高级内镜导航：手术大数据、方法学与应用。

Annu Rev Biomed Eng. 2018 Jun 4;20:221-251. doi: 10.1146/annurev-bioeng-062117-120917. Epub 2018 Mar 5.

Image-Based Navigation for Functional Endoscopic Sinus Surgery Using Structure From Motion.基于运动结构的功能内镜鼻窦手术图像导航

Proc SPIE Int Soc Opt Eng. 2016 Feb-Mar;9784. doi: 10.1117/12.2217279. Epub 2016 Mar 21.

Endoscopic Image Classification and Retrieval using Clustered Convolutional Features.基于聚类卷积特征的内镜图像分类与检索。

J Med Syst. 2017 Oct 30;41(12):196. doi: 10.1007/s10916-017-0836-y.

Online scene association for endoscopic navigation.用于内镜导航的在线场景关联

Med Image Comput Comput Assist Interv. 2014;17(Pt 2):316-23. doi: 10.1007/978-3-319-10470-6_40.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于语言的微创手术内镜导航辅助中手术导航步骤的翻译和预测。

Language-based translation and prediction of surgical navigation steps for endoscopic wayfinding assistance in minimally invasive surgery.

机构信息

出版信息

PURPOSE

METHODS

RESULTS

CONCLUSION

目的

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献