GUPNet++：用于单目3D目标检测的几何不确定性传播网络

GUPNet++: Geometry Uncertainty Propagation Network for Monocular 3D Object Detection.

作者信息

Lu Yan, Ma Xinzhu, Yang Lei, Zhang Tianzhu, Liu Yating, Chu Qi, He Tong, Li Yonghui, Ouyang Wanli

出版信息

IEEE Trans Pattern Anal Mach Intell. 2025 Feb;47(2):900-915. doi: 10.1109/TPAMI.2024.3475583. Epub 2025 Jan 9.

DOI:10.1109/TPAMI.2024.3475583

Abstract

Geometry plays a significant role in monocular 3D object detection. It can be used to estimate object depth by using the perspective projection between object's physical size and 2D projection in the image plane, which can introduce mathematical priors into deep models. However, this projection process also introduces error amplification, where the error of the estimated height is amplified and reflected into the projected depth. It leads to unreliable depth inferences and also impairs training stability. To tackle this problem, we propose a novel Geometry Uncertainty Propagation Network (GUPNet++) by modeling geometry projection in a probabilistic manner. This ensures depth predictions are well-bounded and associated with a reasonable uncertainty. The significance of introducing such geometric uncertainty is two-fold: (1). It models the uncertainty propagation relationship of the geometry projection during training, improving the stability and efficiency of the end-to-end model learning. (2). It can be derived to a highly reliable confidence to indicate the quality of the 3D detection result, enabling more reliable detection inference. Experiments show that the proposed approach not only obtains (state-of-the-art) SOTA performance in image-based monocular 3D detection but also demonstrates superiority in efficacy with a simplified framework. The code and model will be released at https://github.com/SuperMHP/GUPNet_Plus.

摘要

几何在单目3D目标检测中起着重要作用。它可以通过利用物体的物理尺寸与图像平面中二维投影之间的透视投影来估计物体深度，这可以将数学先验引入深度模型。然而，这种投影过程也会引入误差放大，即估计高度的误差被放大并反映到投影深度中。这导致深度推断不可靠，也损害了训练稳定性。为了解决这个问题，我们通过以概率方式对几何投影进行建模，提出了一种新颖的几何不确定性传播网络（GUPNet++）。这确保了深度预测具有良好的边界，并与合理的不确定性相关联。引入这种几何不确定性的意义有两个方面：（1）。它对训练期间几何投影的不确定性传播关系进行建模，提高了端到端模型学习的稳定性和效率。（2）。它可以导出到一个高度可靠的置信度，以指示3D检测结果的质量，从而实现更可靠的检测推断。实验表明，所提出的方法不仅在基于图像的单目3D检测中获得了（当前最优的）SOTA性能，而且在简化框架下的有效性方面也表现出优势。代码和模型将在https://github.com/SuperMHP/GUPNet_Plus上发布。

相似文献

GUPNet++: Geometry Uncertainty Propagation Network for Monocular 3D Object Detection.GUPNet++：用于单目3D目标检测的几何不确定性传播网络

IEEE Trans Pattern Anal Mach Intell. 2025 Feb;47(2):900-915. doi: 10.1109/TPAMI.2024.3475583. Epub 2025 Jan 9.

Vertex points are not enough: Monocular 3D object detection via intra- and inter-plane constraints.顶点不够：通过平面内和平面间约束进行单目 3D 目标检测。

Neural Netw. 2023 May;162:350-358. doi: 10.1016/j.neunet.2023.02.038. Epub 2023 Mar 2.

GAC3D: improving monocular 3D object detection with ground-guide model and adaptive convolution.GAC3D：利用地面引导模型和自适应卷积改进单目3D目标检测

PeerJ Comput Sci. 2021 Oct 6;7:e686. doi: 10.7717/peerj-cs.686. eCollection 2021.

MonoGRNet: A General Framework for Monocular 3D Object Detection.MonoGRNet：单目3D目标检测的通用框架

IEEE Trans Pattern Anal Mach Intell. 2022 Sep;44(9):5170-5184. doi: 10.1109/TPAMI.2021.3074363. Epub 2022 Aug 4.

MonoAux: Fully Exploiting Auxiliary Information and Uncertainty for Monocular 3D Object Detection.单目辅助（MonoAux）：充分利用辅助信息和不确定性进行单目3D目标检测

Cyborg Bionic Syst. 2024 Mar 27;5:0097. doi: 10.34133/cbsystems.0097. eCollection 2024.

OBMO: One Bounding Box Multiple Objects for Monocular 3D Object Detection.OBMO：用于单目3D目标检测的一个边界框多个目标

IEEE Trans Image Process. 2023 Nov 21;PP. doi: 10.1109/TIP.2023.3333225.

Detection, segmentation, and 3D pose estimation of surgical tools using convolutional neural networks and algebraic geometry.使用卷积神经网络和代数几何进行手术工具的检测、分割和三维姿态估计。

Med Image Anal. 2021 May;70:101994. doi: 10.1016/j.media.2021.101994. Epub 2021 Feb 7.

Toward 3D Face Reconstruction in Perspective Projection: Estimating 6DoF Face Pose From Monocular Image.面向透视投影的三维人脸重建：从单目图像估计 6DoF 人脸姿态。

IEEE Trans Image Process. 2023;32:3080-3091. doi: 10.1109/TIP.2023.3275535. Epub 2023 May 30.

Fine-Grained Multilevel Fusion for Anti-Occlusion Monocular 3D Object Detection.用于抗遮挡单目3D目标检测的细粒度多级融合

IEEE Trans Image Process. 2022;31:4050-4061. doi: 10.1109/TIP.2022.3180210. Epub 2022 Jun 14.

Towards Accurate Reconstruction of 3D Scene Shape From A Single Monocular Image.从单目图像精确重建三维场景形状

IEEE Trans Pattern Anal Mach Intell. 2023 May;45(5):6480-6494. doi: 10.1109/TPAMI.2022.3209968. Epub 2023 Apr 3.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

GUPNet++：用于单目3D目标检测的几何不确定性传播网络

GUPNet++: Geometry Uncertainty Propagation Network for Monocular 3D Object Detection.

作者信息

出版信息

相似文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献