IEEE Trans Med Imaging. 2023 Nov;42(11):3408-3419. doi: 10.1109/TMI.2023.3288127. Epub 2023 Oct 27.
Surgical instrument segmentation is of great significance to robot-assisted surgery, but the noise caused by reflection, water mist, and motion blur during the surgery as well as the different forms of surgical instruments would greatly increase the difficulty of precise segmentation. A novel method called Branch Aggregation Attention network (BAANet) is proposed to address these challenges, which adopts a lightweight encoder and two designed modules, named Branch Balance Aggregation module (BBA) and Block Attention Fusion module (BAF), for efficient feature localization and denoising. By introducing the unique BBA module, features from multiple branches are balanced and optimized through a combination of addition and multiplication to complement strengths and effectively suppress noise. Furthermore, to fully integrate the contextual information and capture the region of interest, the BAF module is proposed in the decoder, which receives adjacent feature maps from the BBA module and localizes the surgical instruments from both global and local perspectives by utilizing a dual branch attention mechanism. According to the experimental results, the proposed method has the advantage of being lightweight while outperforming the second-best method by 4.03%, 1.53%, and 1.34% in mIoU scores on three challenging surgical instrument datasets, respectively, compared to the existing state-of-the-art methods. Code is available at https://github.com/SWT-1014/BAANet.
手术器械分割对于机器人辅助手术具有重要意义,但手术过程中反射、水雾和运动模糊造成的噪声以及手术器械的不同形态,极大地增加了精确分割的难度。提出了一种名为分支聚合注意力网络(BAANet)的新方法来应对这些挑战,该方法采用轻量级编码器和两个设计的模块,分别名为分支平衡聚合模块(BBA)和块注意力融合模块(BAF),用于高效的特征定位和去噪。通过引入独特的 BBA 模块,通过加法和乘法的组合平衡和优化来自多个分支的特征,以取长补短并有效抑制噪声。此外,为了充分整合上下文信息并捕捉感兴趣区域,在解码器中提出了 BAF 模块,它接收来自 BBA 模块的相邻特征图,并通过利用双分支注意力机制,从全局和局部视角定位手术器械。根据实验结果,与现有的最先进方法相比,所提出的方法在三个具有挑战性的手术器械数据集上的 mIoU 得分分别提高了 4.03%、1.53%和 1.34%,具有轻量级的优势。代码可在 https://github.com/SWT-1014/BAANet 上获得。