• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于内镜图像仪器识别的多层次特征聚合网络。

Multi-level feature aggregation network for instrument identification of endoscopic images.

机构信息

Beijing Engineering Research Center of Mixed Reality and Advanced Display, School of Optics and Photonics, Beijing Institute of Technology, Beijing 100081 People's Republic of China. Authors contribute equally to this article.

出版信息

Phys Med Biol. 2020 Aug 31;65(16):165004. doi: 10.1088/1361-6560/ab8dda.

DOI:10.1088/1361-6560/ab8dda
PMID:32344381
Abstract

Identification of surgical instruments is crucial in understanding surgical scenarios and providing an assistive process in endoscopic image-guided surgery. This study proposes a novel multilevel feature-aggregated deep convolutional neural network (MLFA-Net) for identifying surgical instruments in endoscopic images. First, a global feature augmentation layer is created on the top layer of the backbone to improve the localization ability of object identification by boosting the high-level semantic information to the feature flow network. Second, a modified interaction path of cross-channel features is proposed to increase the nonlinear combination of features in the same level and improve the efficiency of information propagation. Third, a multiview fusion branch of features is built to aggregate the location-sensitive information of the same level in different views, increase the information diversity of features, and enhance the localization ability of objects. By utilizing the latent information, the proposed network of multilevel feature aggregation can accomplish multitask instrument identification with a single network. Three tasks are handled by the proposed network, including object detection, which classifies the type of instrument and locates its border; mask segmentation, which detects the instrument shape; and pose estimation, which detects the keypoint of instrument parts. The experiments are performed on laparoscopic images from MICCAI 2017 Endoscopic Vision Challenge, and the mean average precision (AP) and average recall (AR) are utilized to quantify the segmentation and pose estimation results. For the bounding box regression, the AP and AR are 79.1% and 63.2%, respectively, while the AP and AR of mask segmentation are 78.1% and 62.1%, and the AP and AR of the pose estimation achieve 67.1% and 55.7%, respectively. The experiments demonstrate that our method efficiently improves the recognition accuracy of the instrument in endoscopic images, and outperforms the other state-of-the-art methods.

摘要

在理解手术场景和为内镜图像引导手术提供辅助过程中,识别手术器械至关重要。本研究提出了一种新的多级特征聚合深度卷积神经网络(MLFA-Net),用于识别内镜图像中的手术器械。首先,在骨干网络的顶层创建一个全局特征增强层,通过将高层语义信息提升到特征流网络,提高目标识别的定位能力。其次,提出了一种改进的交叉通道特征交互路径,以增加同层特征的非线性组合,提高信息传播效率。第三,构建了一个多视图特征融合分支,聚合不同视图中同层的位置敏感信息,增加特征的信息多样性,增强目标的定位能力。通过利用潜在信息,提出的多级特征聚合网络可以利用单个网络完成多任务器械识别。所提出的网络处理三个任务,包括对象检测,用于分类器械类型并定位其边界;掩模分割,用于检测器械形状;以及姿态估计,用于检测器械部分的关键点。实验在 MICCAI 2017 内镜视觉挑战赛上的腹腔镜图像上进行,使用平均精度(AP)和平均召回率(AR)来量化分割和姿态估计结果。对于边界框回归,AP 和 AR 分别为 79.1%和 63.2%,而掩模分割的 AP 和 AR 分别为 78.1%和 62.1%,姿态估计的 AP 和 AR 分别为 67.1%和 55.7%。实验表明,我们的方法有效地提高了内镜图像中器械的识别精度,优于其他最新方法。

相似文献

1
Multi-level feature aggregation network for instrument identification of endoscopic images.用于内镜图像仪器识别的多层次特征聚合网络。
Phys Med Biol. 2020 Aug 31;65(16):165004. doi: 10.1088/1361-6560/ab8dda.
2
A parallel network utilizing local features and global representations for segmentation of surgical instruments.一种利用局部特征和全局表示进行手术器械分割的并行网络。
Int J Comput Assist Radiol Surg. 2022 Oct;17(10):1903-1913. doi: 10.1007/s11548-022-02687-z. Epub 2022 Jun 10.
3
Detection, segmentation, and 3D pose estimation of surgical tools using convolutional neural networks and algebraic geometry.使用卷积神经网络和代数几何进行手术工具的检测、分割和三维姿态估计。
Med Image Anal. 2021 May;70:101994. doi: 10.1016/j.media.2021.101994. Epub 2021 Feb 7.
4
An attention-guided network for surgical instrument segmentation from endoscopic images.基于注意力引导的内窥镜图像手术器械分割网络。
Comput Biol Med. 2022 Dec;151(Pt A):106216. doi: 10.1016/j.compbiomed.2022.106216. Epub 2022 Oct 24.
5
An adaptive and fully automatic method for estimating the 3D position of bendable instruments using endoscopic images.一种使用内窥镜图像估计可弯曲器械三维位置的自适应全自动方法。
Int J Med Robot. 2017 Dec;13(4). doi: 10.1002/rcs.1812. Epub 2017 Apr 7.
6
GC-Net: Global context network for medical image segmentation.GC-Net:用于医学图像分割的全局上下文网络。
Comput Methods Programs Biomed. 2020 Jul;190:105121. doi: 10.1016/j.cmpb.2019.105121. Epub 2019 Oct 4.
7
An iterative multi-path fully convolutional neural network for automatic cardiac segmentation in cine MR images.基于迭代多路径全卷积神经网络的心脏电影磁共振图像自动分割方法。
Med Phys. 2019 Dec;46(12):5652-5665. doi: 10.1002/mp.13859. Epub 2019 Nov 1.
8
An integrated approach to endoscopic instrument tracking for augmented reality applications in surgical simulation training.用于手术模拟培训中增强现实应用的内镜器械跟踪的集成方法。
Int J Med Robot. 2013 Dec;9(4):e34-51. doi: 10.1002/rcs.1485. Epub 2013 Jan 25.
9
CGBA-Net: context-guided bidirectional attention network for surgical instrument segmentation.CGBA-Net:用于手术器械分割的上下文引导双向注意网络。
Int J Comput Assist Radiol Surg. 2023 Oct;18(10):1769-1781. doi: 10.1007/s11548-023-02906-1. Epub 2023 May 18.
10
Composited FishNet: Fish Detection and Species Recognition From Low-Quality Underwater Videos.复合鱼网:从低质量水下视频中进行鱼类检测和物种识别。
IEEE Trans Image Process. 2021;30:4719-4734. doi: 10.1109/TIP.2021.3074738. Epub 2021 May 3.

引用本文的文献

1
Deep Learning-Based Semantic Segmentation for Objective Colonoscopy Quality Assessment.基于深度学习的语义分割用于客观结肠镜检查质量评估
J Imaging. 2025 Mar 18;11(3):84. doi: 10.3390/jimaging11030084.
2
CLAD-Net: cross-layer aggregation attention network for real-time endoscopic instrument detection.CLAD-Net:用于实时内镜器械检测的跨层聚合注意力网络。
Health Inf Sci Syst. 2023 Nov 27;11(1):58. doi: 10.1007/s13755-023-00260-9. eCollection 2023 Dec.
3
Feature matching for texture-less endoscopy images via superpixel vector field consistency.
基于超像素向量场一致性的无纹理内镜图像特征匹配
Biomed Opt Express. 2022 Mar 18;13(4):2247-2265. doi: 10.1364/BOE.450259. eCollection 2022 Apr 1.
4
Detection of blood stains using computer vision-based algorithms and their association with postoperative outcomes in thoracoscopic lobectomies.基于计算机视觉算法的血渍检测及其与胸腔镜肺叶切除术后结果的关联。
Eur J Cardiothorac Surg. 2022 Oct 4;62(5). doi: 10.1093/ejcts/ezac154.