ST3D++：用于3D目标检测无监督域适应的去噪自训练

Yang Jihan, Shi Shaoshuai, Wang Zhe, Li Hongsheng, Qi Xiaojuan

IEEE Trans Pattern Anal Mach Intell. 2023 May;45(5):6354-6371. doi: 10.1109/TPAMI.2022.3216606. Epub 2023 Apr 3.

In this paper, we present a self-training method, named ST3D++, with a holistic pseudo label denoising pipeline for unsupervised domain adaptation on 3D object detection. ST3D++ aims at reducing noise in pseudo label generation as well as alleviating the negative impacts of noisy pseudo labels on model training. First, ST3D++ pre-trains the 3D object detector on the labeled source domain with random object scaling (ROS) which is designed to reduce target domain pseudo label noise arising from object scale bias of the source domain. Then, the detector is progressively improved through alternating between generating pseudo labels and training the object detector with pseudo-labeled target domain data. Here, we equip the pseudo label generation process with a hybrid quality-aware triplet memory to improve the quality and stability of generated pseudo labels. Meanwhile, in the model training stage, we propose a source data assisted training strategy and a curriculum data augmentation policy to effectively rectify noisy gradient directions and avoid model over-fitting to noisy pseudo labeled data. These specific designs enable the detector to be trained on meticulously refined pseudo labeled target data with denoised training signals, and thus effectively facilitate adapting an object detector to a target domain without requiring annotations. Finally, our method is assessed on four 3D benchmark datasets (i.e., Waymo, KITTI, Lyft, and nuScenes) for three common categories (i.e., car, pedestrian and bicycle). ST3D++ achieves state-of-the-art performance on all evaluated settings, outperforming the corresponding baseline by a large margin (e.g., 9.6% ∼ 38.16% on Waymo → KITTI in terms of AP[Formula: see text]), and even surpasses the fully supervised oracle results on the KITTI 3D object detection benchmark with target prior. Code is available at https://github.com/CVMI-Lab/ST3D.

在本文中，我们提出了一种名为ST3D++的自训练方法，它带有一个整体的伪标签去噪管道，用于三维目标检测中的无监督域自适应。ST3D++旨在减少伪标签生成中的噪声，并减轻有噪声的伪标签对模型训练的负面影响。首先，ST3D++在有标签的源域上使用随机对象缩放（ROS）对三维目标检测器进行预训练，ROS旨在减少由于源域的对象尺度偏差而产生的目标域伪标签噪声。然后，通过在生成伪标签和使用伪标签化的目标域数据训练目标检测器之间交替，逐步改进检测器。在这里，我们为伪标签生成过程配备了一个混合质量感知三元组存储器，以提高生成的伪标签的质量和稳定性。同时，在模型训练阶段，我们提出了一种源数据辅助训练策略和一种课程数据增强策略，以有效纠正有噪声的梯度方向，并避免模型过度拟合有噪声的伪标签数据。这些具体设计使检测器能够在经过精心细化的带有去噪训练信号的伪标签化目标数据上进行训练，从而有效地促进目标检测器在无需注释的情况下适应目标域。最后，我们的方法在四个三维基准数据集（即Waymo、KITTI、Lyft和nuScenes）上针对三个常见类别（即汽车、行人与自行车）进行了评估。ST3D++在所有评估设置下均取得了领先的性能，大幅超越了相应的基线（例如，在Waymo→KITTI上，平均精度[公式：见正文]方面提高了9.6%至38.16%），甚至在带有目标先验的KITTI三维目标检测基准上超过了完全监督的最优结果。代码可在https://github.com/CVMI-Lab/ST3D获取。

相似文献

ST3D++: Denoised Self-Training for Unsupervised Domain Adaptation on 3D Object Detection.

IEEE Trans Pattern Anal Mach Intell. 2023 May;45(5):6354-6371. doi: 10.1109/TPAMI.2022.3216606. Epub 2023 Apr 3.

S-CUDA: Self-cleansing unsupervised domain adaptation for medical image segmentation.

Med Image Anal. 2021 Dec;74:102214. doi: 10.1016/j.media.2021.102214. Epub 2021 Aug 12.

Back to Reality: Learning Data-Efficient 3D Object Detector With Shape Guidance.

IEEE Trans Pattern Anal Mach Intell. 2024 Feb;46(2):1165-1180. doi: 10.1109/TPAMI.2023.3328880. Epub 2024 Jan 8.

FPL+: Filtered Pseudo Label-Based Unsupervised Cross-Modality Adaptation for 3D Medical Image Segmentation.

IEEE Trans Med Imaging. 2024 Sep;43(9):3098-3109. doi: 10.1109/TMI.2024.3387415. Epub 2024 Sep 3.

An Effective Motion-Centric Paradigm for 3D Single Object Tracking in Point Clouds.

IEEE Trans Pattern Anal Mach Intell. 2024 Jan;46(1):43-60. doi: 10.1109/TPAMI.2023.3324372. Epub 2023 Dec 5.

ProxyMix: Proxy-based Mixup training with label refinery for source-free domain adaptation.

Neural Netw. 2023 Oct;167:92-103. doi: 10.1016/j.neunet.2023.08.005. Epub 2023 Aug 9.

Superpixel-guided class-level denoising for unsupervised domain adaptive fundus image segmentation without source data.

Comput Biol Med. 2023 Aug;162:107061. doi: 10.1016/j.compbiomed.2023.107061. Epub 2023 May 26.

3D Cascade RCNN: High Quality Object Detection in Point Clouds.

IEEE Trans Image Process. 2022;31:5706-5719. doi: 10.1109/TIP.2022.3201469. Epub 2022 Sep 2.

Unsupervised and Semi-Supervised Robust Spherical Space Domain Adaptation.

IEEE Trans Pattern Anal Mach Intell. 2024 Mar;46(3):1757-1774. doi: 10.1109/TPAMI.2022.3158637. Epub 2024 Feb 6.

Uncertainty-Aware Active Domain Adaptive Salient Object Detection.

IEEE Trans Image Process. 2024;33:5510-5524. doi: 10.1109/TIP.2024.3413598. Epub 2024 Oct 4.

Suppr 超能文献

核心技术专利：CN118964589B侵权必究

相似文献

ST3D++: Denoised Self-Training for Unsupervised Domain Adaptation on 3D Object Detection.

IEEE Trans Pattern Anal Mach Intell. 2023 May;45(5):6354-6371. doi: 10.1109/TPAMI.2022.3216606. Epub 2023 Apr 3.

S-CUDA: Self-cleansing unsupervised domain adaptation for medical image segmentation.

Med Image Anal. 2021 Dec;74:102214. doi: 10.1016/j.media.2021.102214. Epub 2021 Aug 12.

Back to Reality: Learning Data-Efficient 3D Object Detector With Shape Guidance.

IEEE Trans Pattern Anal Mach Intell. 2024 Feb;46(2):1165-1180. doi: 10.1109/TPAMI.2023.3328880. Epub 2024 Jan 8.

FPL+: Filtered Pseudo Label-Based Unsupervised Cross-Modality Adaptation for 3D Medical Image Segmentation.

IEEE Trans Med Imaging. 2024 Sep;43(9):3098-3109. doi: 10.1109/TMI.2024.3387415. Epub 2024 Sep 3.

An Effective Motion-Centric Paradigm for 3D Single Object Tracking in Point Clouds.

IEEE Trans Pattern Anal Mach Intell. 2024 Jan;46(1):43-60. doi: 10.1109/TPAMI.2023.3324372. Epub 2023 Dec 5.

ProxyMix: Proxy-based Mixup training with label refinery for source-free domain adaptation.

Neural Netw. 2023 Oct;167:92-103. doi: 10.1016/j.neunet.2023.08.005. Epub 2023 Aug 9.

Superpixel-guided class-level denoising for unsupervised domain adaptive fundus image segmentation without source data.

Comput Biol Med. 2023 Aug;162:107061. doi: 10.1016/j.compbiomed.2023.107061. Epub 2023 May 26.

3D Cascade RCNN: High Quality Object Detection in Point Clouds.

IEEE Trans Image Process. 2022;31:5706-5719. doi: 10.1109/TIP.2022.3201469. Epub 2022 Sep 2.

Unsupervised and Semi-Supervised Robust Spherical Space Domain Adaptation.

IEEE Trans Pattern Anal Mach Intell. 2024 Mar;46(3):1757-1774. doi: 10.1109/TPAMI.2022.3158637. Epub 2024 Feb 6.

Uncertainty-Aware Active Domain Adaptive Salient Object Detection.

IEEE Trans Image Process. 2024;33:5510-5524. doi: 10.1109/TIP.2024.3413598. Epub 2024 Oct 4.

ST3D++: Denoised Self-Training for Unsupervised Domain Adaptation on 3D Object Detection.

作者信息

出版信息

相似文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献