回归现实：借助形状引导学习数据高效的3D目标检测器。

Back to Reality: Learning Data-Efficient 3D Object Detector With Shape Guidance.

作者信息

Xu Xiuwei, Wang Ziwei, Zhou Jie, Lu Jiwen

出版信息

IEEE Trans Pattern Anal Mach Intell. 2024 Feb;46(2):1165-1180. doi: 10.1109/TPAMI.2023.3328880. Epub 2024 Jan 8.

DOI:10.1109/TPAMI.2023.3328880

Abstract

In this paper, we propose a weakly-supervised approach for 3D object detection, which makes it possible to train a strong 3D detector with position-level annotations (i.e. annotations of object centers and categories). In order to remedy the information loss from box annotations to centers, our method makes use of synthetic 3D shapes to convert the position-level annotations into virtual scenes with box-level annotations, and in turn utilizes the fully-annotated virtual scenes to complement the real labels. Specifically, we first present a shape-guided label-enhancement method, which assembles 3D shapes into physically reasonable virtual scenes according to the coarse scene layout extracted from position-level annotations. Then we transfer the information contained in the virtual scenes back to real ones by applying a virtual-to-real domain adaptation method, which refines the annotated object centers and additionally supervises the training of detector with the virtual scenes. Since the shape-guided label enhancement method generates virtual scenes by human-heuristic physical constraints, the layout of the fixed virtual scenes may be unreasonable with varied object combinations. To address this, we further present differentiable label enhancement to optimize the virtual scenes including object scales, orientations and locations in a data-driven manner. Moreover, we further propose a label-assisted self-training strategy to fully exploit the capability of detector. By reusing the position-level annotations and virtual scenes, we fuse the information from both domains and generate box-level pseudo labels on the real scenes, which enables us to directly train a detector in fully-supervised manner. Extensive experiments on the widely used ScanNet and Matterport3D datasets show that our approach surpasses current weakly-supervised and semi-supervised methods by a large margin, and achieves comparable detection performance with some popular fully-supervised methods with less than 5% of the labeling labor.

摘要

在本文中，我们提出了一种用于3D目标检测的弱监督方法，该方法使得使用位置级注释（即目标中心和类别的注释）训练强大的3D检测器成为可能。为了弥补从边界框注释到中心的信息损失，我们的方法利用合成3D形状将位置级注释转换为具有边界框级注释的虚拟场景，进而利用完全注释的虚拟场景来补充真实标签。具体而言，我们首先提出一种形状引导的标签增强方法，该方法根据从位置级注释中提取的粗略场景布局将3D形状组装成物理上合理的虚拟场景。然后，我们通过应用虚拟到真实域适应方法将虚拟场景中包含的信息转移回真实场景，该方法细化注释的目标中心，并额外使用虚拟场景监督检测器的训练。由于形状引导的标签增强方法通过人工启发式物理约束生成虚拟场景，对于不同的目标组合，固定虚拟场景的布局可能不合理。为了解决这个问题，我们进一步提出可微标签增强，以数据驱动的方式优化包括目标尺度、方向和位置的虚拟场景。此外，我们还提出了一种标签辅助自训练策略，以充分利用检测器的能力。通过重用位置级注释和虚拟场景，我们融合来自两个域的信息，并在真实场景上生成边界框级伪标签，这使我们能够以完全监督的方式直接训练检测器。在广泛使用的ScanNet和Matterport3D数据集上进行的大量实验表明，我们的方法大幅超越了当前的弱监督和半监督方法，并且在标注工作量不到5%的情况下，实现了与一些流行的完全监督方法相当的检测性能。

相似文献

Back to Reality: Learning Data-Efficient 3D Object Detector With Shape Guidance.回归现实：借助形状引导学习数据高效的3D目标检测器。

IEEE Trans Pattern Anal Mach Intell. 2024 Feb;46(2):1165-1180. doi: 10.1109/TPAMI.2023.3328880. Epub 2024 Jan 8.

ST3D++: Denoised Self-Training for Unsupervised Domain Adaptation on 3D Object Detection.ST3D++：用于3D目标检测无监督域适应的去噪自训练

IEEE Trans Pattern Anal Mach Intell. 2023 May;45(5):6354-6371. doi: 10.1109/TPAMI.2022.3216606. Epub 2023 Apr 3.

Towards a Weakly Supervised Framework for 3D Point Cloud Object Detection and Annotation.面向 3D 点云目标检测和标注的弱监督框架研究

IEEE Trans Pattern Anal Mach Intell. 2022 Aug;44(8):4454-4468. doi: 10.1109/TPAMI.2021.3063611. Epub 2022 Jul 1.

Leveraging Geometric Structure for Label-Efficient Semi-Supervised Scene Segmentation.利用几何结构进行标签高效的半监督场景分割。

IEEE Trans Image Process. 2022;31:6320-6330. doi: 10.1109/TIP.2022.3208735. Epub 2022 Oct 10.

Weakly-Supervised Learning of Category-Specific 3D Object Shapes.特定类别3D物体形状的弱监督学习

IEEE Trans Pattern Anal Mach Intell. 2021 Apr;43(4):1423-1437. doi: 10.1109/TPAMI.2019.2949562. Epub 2021 Mar 9.

Learning Dynamic Scene-Conditioned 3D Object Detectors.学习动态场景条件3D目标检测器。

IEEE Trans Pattern Anal Mach Intell. 2024 May;46(5):2981-2996. doi: 10.1109/TPAMI.2023.3336874. Epub 2024 Apr 3.

Mixed-Supervised Scene Text Detection With Expectation-Maximization Algorithm.基于期望最大化算法的混合监督场景文本检测

IEEE Trans Image Process. 2022;31:5513-5528. doi: 10.1109/TIP.2022.3197987. Epub 2022 Aug 22.

Weakly-Supervised Salient Object Detection on Light Fields.光场的弱监督显著目标检测

IEEE Trans Image Process. 2022;31:6295-6305. doi: 10.1109/TIP.2022.3207605. Epub 2022 Oct 10.

SemiRS-COC: Semi-Supervised Classification for Complex Remote Sensing Scenes With Cross-Object Consistency.半监督遥感复杂场景交叉目标一致性分类法（SemiRS-COC）：用于具有交叉目标一致性的复杂遥感场景的半监督分类

IEEE Trans Image Process. 2024;33:3855-3870. doi: 10.1109/TIP.2024.3414122. Epub 2024 Jun 28.

Weakly supervised salient object detection via image category annotation.通过图像类别标注实现弱监督显著目标检测。

Math Biosci Eng. 2023 Dec 1;20(12):21359-21381. doi: 10.3934/mbe.2023945.

回归现实：借助形状引导学习数据高效的3D目标检测器。

Back to Reality: Learning Data-Efficient 3D Object Detector With Shape Guidance.

作者信息

Xu Xiuwei, Wang Ziwei, Zhou Jie, Lu Jiwen

出版信息

IEEE Trans Pattern Anal Mach Intell. 2024 Feb;46(2):1165-1180. doi: 10.1109/TPAMI.2023.3328880. Epub 2024 Jan 8.

DOI:10.1109/TPAMI.2023.3328880

PMID:37906482

Abstract

摘要

Suppr 超能文献

文献检索

文件翻译

深度研究

Suppr 超能文献

文献检索

文件翻译

深度研究

回归现实：借助形状引导学习数据高效的3D目标检测器。

Back to Reality: Learning Data-Efficient 3D Object Detector With Shape Guidance.

作者信息

出版信息

相似文献

回归现实：借助形状引导学习数据高效的3D目标检测器。

Back to Reality: Learning Data-Efficient 3D Object Detector With Shape Guidance.

作者信息

出版信息

相似文献