一种用于语义分割的上下文感知多类损失函数，重点关注复杂区域和类别不平衡问题。

A context aware multiclass loss function for semantic segmentation with a focus on intricate areas and class imbalances.

作者信息

Ghanaei Zahra, Rouhani Modjtaba

机构信息

Department of Computer Engineering, Faculty of Engineering, Ferdowsi University of Mashhad, Mashhad, Iran.

出版信息

Sci Rep. 2025 Jul 19;15(1):26279. doi: 10.1038/s41598-025-08234-5.

DOI:10.1038/s41598-025-08234-5

PMID:40683905

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12276218/

Abstract

Image segmentation models play an important role in many machine vision systems by providing a more interpretable representation of images to computers. The accuracy of these models is vital, as it can directly impact the overall performance of the systems. Therefore, making any progress in this component would be very critical. To improve this aspect, we have developed a new loss function, named SPix-WCE, to boost the performance of deep neural networks in image segmentation tasks. Our primary goal is to address imbalances in image datasets by identifying complicated areas in the images and bringing them more into focus during the model training process. This was achieved by utilizing the SLIC algorithm and analyzing each superpixel to detect key regions in images, followed by implementing a weighting scheme to control the influence of each area in the loss calculation. Subsequently, we carried out a series of experiments to validate our approach. These experiments involved three different models and four multiclass datasets with various degrees of imbalance. The models were trained and tested using the proposed loss function as well as other commonly used ones. The outcomes of our experiments demonstrate that using SPix-based losses led to better results in terms of IoU, F1-Score, and pixel accuracy metrics compared to other methods.

摘要

图像分割模型通过向计算机提供更具可解释性的图像表示，在许多机器视觉系统中发挥着重要作用。这些模型的准确性至关重要，因为它会直接影响系统的整体性能。因此，在这个组件上取得任何进展都非常关键。为了改进这一方面，我们开发了一种新的损失函数，名为SPix-WCE，以提高深度神经网络在图像分割任务中的性能。我们的主要目标是通过识别图像中的复杂区域并在模型训练过程中更关注这些区域，来解决图像数据集的不平衡问题。这是通过利用SLIC算法并分析每个超像素以检测图像中的关键区域，然后实施加权方案来控制损失计算中每个区域的影响来实现的。随后，我们进行了一系列实验来验证我们的方法。这些实验涉及三种不同的模型和四个具有不同程度不平衡的多类数据集。使用所提出的损失函数以及其他常用的损失函数对模型进行训练和测试。我们的实验结果表明，与其他方法相比，使用基于SPix的损失在交并比（IoU）、F1分数和像素准确率指标方面产生了更好的结果。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3536/12276218/4c369b3c419d/41598_2025_8234_Fig1_HTML.jpg

相似文献

A context aware multiclass loss function for semantic segmentation with a focus on intricate areas and class imbalances.

Sci Rep. 2025 Jul 19;15(1):26279. doi: 10.1038/s41598-025-08234-5.

Short-Term Memory Impairment

Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.

Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.

Development and Validation of a Convolutional Neural Network Model to Predict a Pathologic Fracture in the Proximal Femur Using Abdomen and Pelvis CT Images of Patients With Advanced Cancer.

Clin Orthop Relat Res. 2023 Nov 1;481(11):2247-2256. doi: 10.1097/CORR.0000000000002771. Epub 2023 Aug 23.

Interventions to reduce harm from continued tobacco use.

Cochrane Database Syst Rev. 2016 Oct 13;10(10):CD005231. doi: 10.1002/14651858.CD005231.pub3.

Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.

Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.

Artificial intelligence for diagnosing exudative age-related macular degeneration.

Cochrane Database Syst Rev. 2024 Oct 17;10(10):CD015522. doi: 10.1002/14651858.CD015522.pub2.

Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.

Cochrane Database Syst Rev. 2020 Jan 9;1(1):CD011535. doi: 10.1002/14651858.CD011535.pub3.

A rapid and systematic review of the clinical effectiveness and cost-effectiveness of paclitaxel, docetaxel, gemcitabine and vinorelbine in non-small-cell lung cancer.

Health Technol Assess. 2001;5(32):1-195. doi: 10.3310/hta5320.

Influence of early through late fusion on pancreas segmentation from imperfectly registered multimodal magnetic resonance imaging.

J Med Imaging (Bellingham). 2025 Mar;12(2):024008. doi: 10.1117/1.JMI.12.2.024008. Epub 2025 Apr 26.

本文引用的文献

Generalised Dice Overlap as a Deep Learning Loss Function for Highly Unbalanced Segmentations.

Deep Learn Med Image Anal Multimodal Learn Clin Decis Support (2017). 2017;2017:240-248. doi: 10.1007/978-3-319-67558-9_28. Epub 2017 Sep 9.

Loss odyssey in medical image segmentation.

Med Image Anal. 2021 Jul;71:102035. doi: 10.1016/j.media.2021.102035. Epub 2021 Mar 19.

Image Segmentation Using Deep Learning: A Survey.

IEEE Trans Pattern Anal Mach Intell. 2022 Jul;44(7):3523-3542. doi: 10.1109/TPAMI.2021.3059968. Epub 2022 Jun 3.

nnU-Net: a self-configuring method for deep learning-based biomedical image segmentation.

Nat Methods. 2021 Feb;18(2):203-211. doi: 10.1038/s41592-020-01008-z. Epub 2020 Dec 7.

Boundary loss for highly unbalanced segmentation.

Med Image Anal. 2021 Jan;67:101851. doi: 10.1016/j.media.2020.101851. Epub 2020 Oct 6.

Automated volumetric assessment with artificial neural networks might enable a more accurate assessment of disease burden in patients with multiple sclerosis.

Eur Radiol. 2020 Apr;30(4):2356-2364. doi: 10.1007/s00330-019-06593-y. Epub 2020 Jan 3.

Asymmetric Loss Functions and Deep Densely Connected Networks for Highly Imbalanced Medical Image Segmentation: Application to Multiple Sclerosis Lesion Detection.

IEEE Access. 2019;7:721-1735. doi: 10.1109/ACCESS.2018.2886371. Epub 2018 Dec 12.

Reducing the Hausdorff Distance in Medical Image Segmentation With Convolutional Neural Networks.

IEEE Trans Med Imaging. 2020 Feb;39(2):499-513. doi: 10.1109/TMI.2019.2930068. Epub 2019 Jul 19.

Combo loss: Handling input and output imbalance in multi-organ segmentation.

Comput Med Imaging Graph. 2019 Jul;75:24-33. doi: 10.1016/j.compmedimag.2019.04.005. Epub 2019 May 9.

AnatomyNet: Deep learning for fast and fully automated whole-volume segmentation of head and neck anatomy.

Med Phys. 2019 Feb;46(2):576-589. doi: 10.1002/mp.13300. Epub 2018 Dec 17.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

一种用于语义分割的上下文感知多类损失函数，重点关注复杂区域和类别不平衡问题。

A context aware multiclass loss function for semantic segmentation with a focus on intricate areas and class imbalances.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献