Zhou Yihang, Towning Rebecca, Awad Zaid, Giannarou Stamatia
Hamlyn Centre for Robotic Surgery, Department of Surgery and Cancer Imperial College London London UK.
Imperial College Healthcare NHS Trust London UK.
Healthc Technol Lett. 2025 Jan 31;12(1):e70003. doi: 10.1049/htl2.70003. eCollection 2025 Jan-Dec.
Surgical scene segmentation is essential for enhancing surgical precision, yet it is frequently compromised by the scarcity and imbalance of available data. To address these challenges, semantic image synthesis methods based on generative adversarial networks and diffusion models have been developed. However, these models often yield non-diverse images and fail to capture small, critical tissue classes, limiting their effectiveness. In response, a class-aware semantic diffusion model (CASDM), a novel approach which utilizes segmentation maps as conditions for image synthesis to tackle data scarcity and imbalance is proposed. Novel class-aware mean squared error and class-aware self-perceptual loss functions have been defined to prioritize critical, less visible classes, thereby enhancing image quality and relevance. Furthermore, to the authors' knowledge, they are the first to generate multi-class segmentation maps using text prompts in a novel fashion to specify their contents. These maps are then used by CASDM to generate surgical scene images, enhancing datasets for training and validating segmentation models. This evaluation assesses both image quality and downstream segmentation performance, demonstrates the strong effectiveness and generalisability of CASDM in producing realistic image-map pairs, significantly advancing surgical scene segmentation across diverse and challenging datasets.
手术场景分割对于提高手术精度至关重要,但它经常因可用数据的稀缺和不平衡而受到影响。为应对这些挑战,基于生成对抗网络和扩散模型的语义图像合成方法已被开发出来。然而,这些模型往往生成的图像缺乏多样性,并且无法捕捉到小的关键组织类别,从而限制了它们的有效性。作为回应,提出了一种类感知语义扩散模型(CASDM),这是一种新颖的方法,它利用分割图作为图像合成的条件来解决数据稀缺和不平衡问题。定义了新颖的类感知均方误差和类感知自感知损失函数,以优先处理关键的、不太明显的类别,从而提高图像质量和相关性。此外,据作者所知,他们首次以新颖的方式使用文本提示生成多类分割图来指定其内容。然后,CASDM使用这些分割图来生成手术场景图像,增强用于训练和验证分割模型的数据集。该评估同时评估了图像质量和下游分割性能,证明了CASDM在生成逼真的图像-分割图对方面的强大有效性和通用性,显著推进了跨不同且具有挑战性的数据集的手术场景分割。