Dhiyanesh B, Vijayalakshmi M, Saranya P, Viji D
Department of Computer Science and Engineering (ETech), SRM Institute of Science and Technology, Vadapalani Campus, Chennai, Tamil Nadu, India.
Department of Computing Technologies, SRM Institute of Science and Technology, Kattankulathur, Chennai, India.
Sci Rep. 2025 May 23;15(1):17892. doi: 10.1038/s41598-025-02470-5.
Semantic segmentation involves an imminent part in the investigation of medical images, particularly in the domain of microvascular decompression, where publicly available datasets are scarce, and expert annotation is demanding. In response to this challenge, this study presents a meticulously curated dataset comprising 2003 RGB microvascular decompression images, each intricately paired with annotated masks. Extensive data preprocessing and augmentation strategies were employed to fortify the training dataset, enhancing the robustness of proposed deep learning model. Numerous up-to-date semantic segmentation approaches, including DeepLabv3+, U-Net, DilatedFastFCN with JPU, DANet, and a custom Vanilla architecture, were trained and evaluated using diverse performance metrics. Among these models, DeepLabv3 + emerged as a strong contender, notably excelling in F1 score. Innovatively, ensemble techniques, such as stacking and bagging, were introduced to further elevate segmentation performance. Bagging, notably with the Naïve Bayes approach, exhibited significant improvements, underscoring the potential of ensemble methods in medical image segmentation. The proposed EnsembleEdgeFusion technique exhibited superior loss reduction during training compared to DeepLabv3 + and achieved maximum Mean Intersection over Union (MIoU) scores of 77.73%, surpassing other models. Category-wise analysis affirmed its superiority in accurately delineating various categories within the test dataset.
语义分割在医学图像研究中起着重要作用,特别是在微血管减压领域,那里公开可用的数据集稀缺,且专家标注要求很高。针对这一挑战,本研究提出了一个精心策划的数据集,包含2003张RGB微血管减压图像,每张图像都与注释掩码精确配对。采用了广泛的数据预处理和增强策略来强化训练数据集,提高所提出的深度学习模型的鲁棒性。使用多种性能指标对包括DeepLabv3+、U-Net、带JPU的扩张式快速全卷积网络(DilatedFastFCN)、DANet和一种自定义的朴素架构在内的众多最新语义分割方法进行了训练和评估。在这些模型中,DeepLabv3+成为有力的竞争者,在F1分数方面表现尤为出色。创新性地引入了堆叠和装袋等集成技术,以进一步提高分割性能。装袋,特别是采用朴素贝叶斯方法时,表现出显著的改进,突出了集成方法在医学图像分割中的潜力。与DeepLabv3+相比,所提出的EnsembleEdgeFusion技术在训练期间表现出更好的损失降低效果,并实现了77.73%的最大平均交并比(MIoU)分数,超过了其他模型。类别分析证实了其在准确描绘测试数据集中各类别方面的优越性。