基于噪声消除的动态知识蒸馏的 RGB-D 显著目标检测。

Dynamic Knowledge Distillation with Noise Elimination for RGB-D Salient Object Detection.

机构信息

Department of Electrical and Electronic Engineering, Imperial College London, London SW7 2AZ, UK.

School of Electronic Engineering, Beijing University of Posts and Telecommunications, Beijing 100876, China.

出版信息

Sensors (Basel). 2022 Aug 18;22(16):6188. doi: 10.3390/s22166188.

DOI:10.3390/s22166188

PMID:36015947

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9416116/

Abstract

RGB-D salient object detection (SOD) demonstrates its superiority in detecting in complex environments due to the additional depth information introduced in the data. Inevitably, an independent stream is introduced to extract features from depth images, leading to extra computation and parameters. This methodology sacrifices the model size to improve the detection accuracy which may impede the practical application of SOD problems. To tackle this dilemma, we propose a dynamic knowledge distillation (DKD) method, along with a lightweight structure, which significantly reduces the computational burden while maintaining validity. This method considers the factors of both teacher and student performance within the training stage and dynamically assigns the distillation weight instead of applying a fixed weight on the student model. We also investigate the issue of RGB-D early fusion strategy in distillation and propose a simple noise elimination method to mitigate the impact of distorted training data caused by low quality depth maps. Extensive experiments are conducted on five public datasets to demonstrate that our method can achieve competitive performance with a fast inference speed (136FPS) compared to 12 prior methods.

摘要

RGB-D 显著目标检测 (SOD) 由于在数据中引入了额外的深度信息，因此在检测复杂环境方面表现出优越性。不可避免地，引入了一个独立的流从深度图像中提取特征，导致额外的计算和参数。这种方法牺牲了模型大小来提高检测精度，这可能会阻碍 SOD 问题的实际应用。为了解决这个困境，我们提出了一种动态知识蒸馏 (DKD) 方法，以及一个轻量级的结构，在保持有效性的同时，显著降低了计算负担。该方法在训练阶段考虑了教师和学生表现的因素，并动态分配蒸馏权重，而不是在学生模型上应用固定权重。我们还研究了蒸馏中 RGB-D 早期融合策略的问题，并提出了一种简单的噪声消除方法来减轻低质量深度图导致的训练数据失真的影响。在五个公共数据集上进行了广泛的实验，结果表明，与 12 种先前的方法相比，我们的方法可以在快速推断速度（136FPS）下实现有竞争力的性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14a3/9416116/3afb1bcd8c3d/sensors-22-06188-g001.jpg

相似文献

Dynamic Knowledge Distillation with Noise Elimination for RGB-D Salient Object Detection.基于噪声消除的动态知识蒸馏的 RGB-D 显著目标检测。

Sensors (Basel). 2022 Aug 18;22(16):6188. doi: 10.3390/s22166188.

CDNet: Complementary Depth Network for RGB-D Salient Object Detection.CDNet：用于RGB-D显著目标检测的互补深度网络。

IEEE Trans Image Process. 2021;30:3376-3390. doi: 10.1109/TIP.2021.3060167. Epub 2021 Mar 9.

Dynamic Selective Network for RGB-D Salient Object Detection.基于动态选择网络的 RGB-D 显著目标检测

IEEE Trans Image Process. 2021;30:9179-9192. doi: 10.1109/TIP.2021.3123548. Epub 2021 Nov 10.

Absolute and Relative Depth-Induced Network for RGB-D Salient Object Detection.基于绝对和相对深度信息的 RGB-D 显著目标检测网络

Sensors (Basel). 2023 Mar 30;23(7):3611. doi: 10.3390/s23073611.

UTDNet: A unified triplet decoder network for multimodal salient object detection.UTDNet：一种用于多模态显著目标检测的统一三元解码器网络。

Neural Netw. 2024 Feb;170:521-534. doi: 10.1016/j.neunet.2023.11.051. Epub 2023 Nov 24.

Hierarchical Alternate Interaction Network for RGB-D Salient Object Detection.用于RGB-D显著目标检测的分层交替交互网络

IEEE Trans Image Process. 2021;30:3528-3542. doi: 10.1109/TIP.2021.3062689. Epub 2021 Mar 11.

Relation Knowledge Distillation by Auxiliary Learning for Object Detection.用于目标检测的基于辅助学习的关系知识蒸馏

IEEE Trans Image Process. 2024;33:4796-4810. doi: 10.1109/TIP.2024.3445740. Epub 2024 Aug 30.

LSNet: Lightweight Spatial Boosting Network for Detecting Salient Objects in RGB-Thermal Images.LSNet：用于在RGB-热图像中检测显著物体的轻量级空间增强网络。

IEEE Trans Image Process. 2023;32:1329-1340. doi: 10.1109/TIP.2023.3242775. Epub 2023 Feb 27.

Middle-Level Feature Fusion for Lightweight RGB-D Salient Object Detection.用于轻量级RGB-D显著目标检测的中级特征融合

IEEE Trans Image Process. 2022;31:6621-6634. doi: 10.1109/TIP.2022.3214092. Epub 2022 Oct 26.

A General Dynamic Knowledge Distillation Method for Visual Analytics.一种用于视觉分析的通用动态知识蒸馏方法。

IEEE Trans Image Process. 2022 Oct 13;PP. doi: 10.1109/TIP.2022.3212905.

引用本文的文献

FIAEPI-KD: A novel knowledge distillation approach for precise detection of missing insulators in transmission lines.FIAEPI-KD：一种用于精确检测输电线路中缺失绝缘子的新型知识蒸馏方法。

PLoS One. 2025 May 30;20(5):e0324524. doi: 10.1371/journal.pone.0324524. eCollection 2025.

MAV Localization in Large-Scale Environments: A Decoupled Optimization/Filtering Approach.大规模环境中的 MAV 定位：一种解耦的优化/滤波方法。

Sensors (Basel). 2023 Jan 3;23(1):516. doi: 10.3390/s23010516.

Multiscale Cascaded Attention Network for Saliency Detection Based on ResNet.基于 ResNet 的多尺度级联注意力网络的显著目标检测

Sensors (Basel). 2022 Dec 16;22(24):9950. doi: 10.3390/s22249950.

本文引用的文献

Joint Learning of Salient Object Detection, Depth Estimation and Contour Extraction.显著目标检测、深度估计和轮廓提取的联合学习

IEEE Trans Image Process. 2022;31:7350-7362. doi: 10.1109/TIP.2022.3222641. Epub 2022 Nov 30.

Rethinking RGB-D Salient Object Detection: Models, Data Sets, and Large-Scale Benchmarks.重新思考RGB-D显著目标检测：模型、数据集和大规模基准

IEEE Trans Neural Netw Learn Syst. 2021 May;32(5):2075-2089. doi: 10.1109/TNNLS.2020.2996406. Epub 2021 May 3.

Three-stream Attention-aware Network for RGB-D Salient Object Detection.用于RGB-D显著目标检测的三流注意力感知网络

IEEE Trans Image Process. 2019 Jan 7. doi: 10.1109/TIP.2019.2891104.

RGBD Salient Object Detection via Deep Fusion.基于深度融合的 RGBD 显著目标检测。

IEEE Trans Image Process. 2017 May;26(5):2274-2285. doi: 10.1109/TIP.2017.2682981. Epub 2017 Mar 15.

Person Re-Identification by Saliency Learning.基于显著学习的人体再识别。

IEEE Trans Pattern Anal Mach Intell. 2017 Feb;39(2):356-370. doi: 10.1109/TPAMI.2016.2544310. Epub 2016 Mar 21.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于噪声消除的动态知识蒸馏的 RGB-D 显著目标检测。

Dynamic Knowledge Distillation with Noise Elimination for RGB-D Salient Object Detection.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献