Faculty of Informatics, University of Debrecen, 4028 Debrecen, Hungary.
Department of Obstetrics and Gynaecology, Faculty of Medicine, University of Debrecen, 4032 Debrecen, Hungary.
Sensors (Basel). 2024 May 4;24(9):2926. doi: 10.3390/s24092926.
Performing a minimally invasive surgery comes with a significant advantage regarding rehabilitating the patient after the operation. But it also causes difficulties, mainly for the surgeon or expert who performs the surgical intervention, since only visual information is available and they cannot use their tactile senses during keyhole surgeries. This is the case with laparoscopic hysterectomy since some organs are also difficult to distinguish based on visual information, making laparoscope-based hysterectomy challenging. In this paper, we propose a solution based on semantic segmentation, which can create pixel-accurate predictions of surgical images and differentiate the uterine arteries, ureters, and nerves. We trained three binary semantic segmentation models based on the U-Net architecture with the EfficientNet-b3 encoder; then, we developed two ensemble techniques that enhanced the segmentation performance. Our pixel-wise ensemble examines the segmentation map of the binary networks on the lowest level of pixels. The other algorithm developed is a region-based ensemble technique that takes this examination to a higher level and makes the ensemble based on every connected component detected by the binary segmentation networks. We also introduced and trained a classic multi-class semantic segmentation model as a reference and compared it to the ensemble-based approaches. We used 586 manually annotated images from 38 surgical videos for this research and published this dataset.
微创手术在患者术后康复方面具有显著优势。但它也带来了困难,主要是对于执行手术干预的外科医生或专家来说,因为只有视觉信息可用,并且他们在微创手术中无法使用触觉。腹腔镜子宫切除术就是这种情况,因为仅基于视觉信息也很难区分某些器官,这使得基于腹腔镜的子宫切除术具有挑战性。在本文中,我们提出了一种基于语义分割的解决方案,该方案可以对手术图像进行像素级精确预测,并区分子宫动脉、输尿管和神经。我们基于 U-Net 架构和 EfficientNet-b3 编码器训练了三个二进制语义分割模型;然后,我们开发了两种集成技术来提高分割性能。我们的像素级集成在最低像素级别的二进制网络的分割图上进行检查。另一种开发的算法是基于区域的集成技术,它将这种检查提升到更高的水平,并根据二进制分割网络检测到的每个连通分量进行集成。我们还引入并训练了一个经典的多类语义分割模型作为参考,并将其与基于集成的方法进行了比较。我们使用了 38 个手术视频中的 586 张手动标注图像进行了这项研究,并发布了这个数据集。