• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于自知识蒸馏的像素级自适应标签平滑的有限标注数据语义分割。

Semantic Segmentation Using Pixel-Wise Adaptive Label Smoothing via Self-Knowledge Distillation for Limited Labeling Data.

机构信息

Department of Electrical and Computer Engineering, Ajou University, Suwon 16449, Korea.

Department of Artificial Intelligence, Ajou University, Suwon 16449, Korea.

出版信息

Sensors (Basel). 2022 Mar 29;22(7):2623. doi: 10.3390/s22072623.

DOI:10.3390/s22072623
PMID:35408237
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9003518/
Abstract

To achieve high performance, most deep convolutional neural networks (DCNNs) require a significant amount of training data with ground truth labels. However, creating ground-truth labels for semantic segmentation requires more time, human effort, and cost compared with other tasks such as classification and object detection, because the ground-truth label of every pixel in an image is required. Hence, it is practically demanding to train DCNNs using a limited amount of training data for semantic segmentation. Generally, training DCNNs using a limited amount of data is problematic as it easily results in a decrease in the accuracy of the networks because of overfitting to the training data. Here, we propose a new regularization method called pixel-wise adaptive label smoothing (PALS) via self-knowledge distillation to stably train semantic segmentation networks in a practical situation, in which only a limited amount of training data is available. To mitigate the problem caused by limited training data, our method fully utilizes the internal statistics of pixels within an input image. Consequently, the proposed method generates a pixel-wise aggregated probability distribution using a similarity matrix that encodes the affinities between all pairs of pixels. To further increase the accuracy, we add one-hot encoded distributions with ground-truth labels to these aggregated distributions, and obtain our final soft labels. We demonstrate the effectiveness of our method for the Cityscapes dataset and the Pascal VOC2012 dataset using limited amounts of training data, such as 10%, 30%, 50%, and 100%. Based on various quantitative and qualitative comparisons, our method demonstrates more accurate results compared with previous methods. Specifically, for the Cityscapes test set, our method achieved mIoU improvements of 0.076%, 1.848%, 1.137%, and 1.063% for 10%, 30%, 50%, and 100% training data, respectively, compared with the method of the cross-entropy loss using one-hot encoding with ground truth labels.

摘要

为了实现高性能,大多数深度卷积神经网络 (DCNN) 需要大量具有真实标签的训练数据。然而,与分类和目标检测等其他任务相比,创建语义分割的真实标签需要更多的时间、人力和成本,因为需要图像中每个像素的真实标签。因此,在语义分割中使用有限数量的训练数据来训练 DCNN 在实践中是非常具有挑战性的。一般来说,使用有限数量的数据训练 DCNN 存在问题,因为它容易导致网络精度下降,因为网络过度拟合训练数据。在这里,我们提出了一种新的正则化方法,称为像素自适应标签平滑 (PALS),通过自我知识蒸馏,在只有有限数量的训练数据的实际情况下稳定地训练语义分割网络。为了解决有限训练数据带来的问题,我们的方法充分利用了输入图像中像素的内部统计信息。因此,该方法使用相似度矩阵生成像素级聚合概率分布,相似度矩阵编码所有像素对之间的亲和力。为了进一步提高精度,我们将具有真实标签的 One-hot 编码分布添加到这些聚合分布中,并获得最终的软标签。我们使用有限数量的训练数据(例如 10%、30%、50%和 100%)在 Cityscapes 数据集和 Pascal VOC2012 数据集上证明了我们方法的有效性。通过各种定量和定性比较,我们的方法与以前的方法相比表现出更准确的结果。具体来说,对于 Cityscapes 测试集,我们的方法在 10%、30%、50%和 100%的训练数据下,与使用真实标签的 One-hot 编码的交叉熵损失方法相比,分别提高了 mIoU 0.076%、1.848%、1.137%和 1.063%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d719/9003518/a00e60af2c6a/sensors-22-02623-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d719/9003518/b97347742a87/sensors-22-02623-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d719/9003518/66d4aa5d4f67/sensors-22-02623-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d719/9003518/2c3d7acd55c7/sensors-22-02623-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d719/9003518/df34853f22da/sensors-22-02623-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d719/9003518/64c21e0a0d80/sensors-22-02623-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d719/9003518/e2833f63e1d5/sensors-22-02623-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d719/9003518/cbcaf628aca3/sensors-22-02623-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d719/9003518/a00e60af2c6a/sensors-22-02623-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d719/9003518/b97347742a87/sensors-22-02623-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d719/9003518/66d4aa5d4f67/sensors-22-02623-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d719/9003518/2c3d7acd55c7/sensors-22-02623-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d719/9003518/df34853f22da/sensors-22-02623-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d719/9003518/64c21e0a0d80/sensors-22-02623-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d719/9003518/e2833f63e1d5/sensors-22-02623-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d719/9003518/cbcaf628aca3/sensors-22-02623-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d719/9003518/a00e60af2c6a/sensors-22-02623-g008.jpg

相似文献

1
Semantic Segmentation Using Pixel-Wise Adaptive Label Smoothing via Self-Knowledge Distillation for Limited Labeling Data.基于自知识蒸馏的像素级自适应标签平滑的有限标注数据语义分割。
Sensors (Basel). 2022 Mar 29;22(7):2623. doi: 10.3390/s22072623.
2
Local contrastive loss with pseudo-label based self-training for semi-supervised medical image segmentation.基于伪标签自训练的局部对比损失的半监督医学图像分割。
Med Image Anal. 2023 Jul;87:102792. doi: 10.1016/j.media.2023.102792. Epub 2023 Mar 11.
3
Methods for the frugal labeler: Multi-class semantic segmentation on heterogeneous labels.节俭标签器的方法:异构标签上的多类别语义分割。
PLoS One. 2022 Feb 8;17(2):e0263656. doi: 10.1371/journal.pone.0263656. eCollection 2022.
4
Group-Wise Learning for Weakly Supervised Semantic Segmentation.基于群体学习的弱监督语义分割。
IEEE Trans Image Process. 2022;31:799-811. doi: 10.1109/TIP.2021.3132834. Epub 2022 Jan 4.
5
DTS-Net: Depth-to-Space Networks for Fast and Accurate Semantic Object Segmentation.DTS-Net:用于快速准确语义对象分割的深度到空间网络
Sensors (Basel). 2022 Jan 3;22(1):337. doi: 10.3390/s22010337.
6
A Performance Improvement Strategy for Concrete Damage Detection Using Stacking Ensemble Learning of Multiple Semantic Segmentation Networks.基于多语义分割网络堆叠集成学习的混凝土损伤检测性能提升策略。
Sensors (Basel). 2022 Apr 27;22(9):3341. doi: 10.3390/s22093341.
7
FE-Net: Feature enhancement segmentation network.FE-Net:特征增强分割网络。
Neural Netw. 2024 Jun;174:106232. doi: 10.1016/j.neunet.2024.106232. Epub 2024 Mar 11.
8
Knowledge Distillation for Semantic Segmentation Using Channel and Spatial Correlations and Adaptive Cross Entropy.基于通道与空间相关性及自适应交叉熵的语义分割知识蒸馏
Sensors (Basel). 2020 Aug 17;20(16):4616. doi: 10.3390/s20164616.
9
Semantic-Aware Contrastive Learning for Multi-Object Medical Image Segmentation.基于语义感知对比学习的多目标医学图像分割。
IEEE J Biomed Health Inform. 2023 Sep;27(9):4444-4453. doi: 10.1109/JBHI.2023.3285230. Epub 2023 Sep 6.
10
A Lightweight Semantic Segmentation Algorithm Based on Deep Convolutional Neural Networks.基于深度卷积神经网络的轻量级语义分割算法。
Comput Intell Neurosci. 2022 Sep 6;2022:5339664. doi: 10.1155/2022/5339664. eCollection 2022.

引用本文的文献

1
FIAEPI-KD: A novel knowledge distillation approach for precise detection of missing insulators in transmission lines.FIAEPI-KD:一种用于精确检测输电线路中缺失绝缘子的新型知识蒸馏方法。
PLoS One. 2025 May 30;20(5):e0324524. doi: 10.1371/journal.pone.0324524. eCollection 2025.

本文引用的文献

1
Self-Training for Class-Incremental Semantic Segmentation.用于类别增量语义分割的自训练
IEEE Trans Neural Netw Learn Syst. 2023 Nov;34(11):9116-9127. doi: 10.1109/TNNLS.2022.3155746. Epub 2023 Oct 27.
2
MSRF-Net: A Multi-Scale Residual Fusion Network for Biomedical Image Segmentation.MSRF-Net:一种用于生物医学图像分割的多尺度残差融合网络。
IEEE J Biomed Health Inform. 2022 May;26(5):2252-2263. doi: 10.1109/JBHI.2021.3138024. Epub 2022 May 5.
3
Knowledge Distillation for Semantic Segmentation Using Channel and Spatial Correlations and Adaptive Cross Entropy.
基于通道与空间相关性及自适应交叉熵的语义分割知识蒸馏
Sensors (Basel). 2020 Aug 17;20(16):4616. doi: 10.3390/s20164616.
4
Structured Knowledge Distillation for Dense Prediction.面向密集预测的结构化知识蒸馏。
IEEE Trans Pattern Anal Mach Intell. 2023 Jun;45(6):7035-7049. doi: 10.1109/TPAMI.2020.3001940. Epub 2023 May 5.
5
Deep High-Resolution Representation Learning for Visual Recognition.用于视觉识别的深度高分辨率表征学习
IEEE Trans Pattern Anal Mach Intell. 2021 Oct;43(10):3349-3364. doi: 10.1109/TPAMI.2020.2983686. Epub 2021 Sep 2.
6
Semi-Supervised Semantic Segmentation With High- and Low-Level Consistency.具有高低级一致性的半监督语义分割
IEEE Trans Pattern Anal Mach Intell. 2021 Apr;43(4):1369-1379. doi: 10.1109/TPAMI.2019.2960224. Epub 2021 Mar 4.
7
DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs.DeepLab:基于深度卷积网络、空洞卷积和全连接条件随机场的语义图像分割。
IEEE Trans Pattern Anal Mach Intell. 2018 Apr;40(4):834-848. doi: 10.1109/TPAMI.2017.2699184. Epub 2017 Apr 27.
8
SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation.SegNet:一种用于图像分割的深度卷积编解码器架构。
IEEE Trans Pattern Anal Mach Intell. 2017 Dec;39(12):2481-2495. doi: 10.1109/TPAMI.2016.2644615. Epub 2017 Jan 2.