用于抗遮挡单目3D目标检测的细粒度多级融合

Fine-Grained Multilevel Fusion for Anti-Occlusion Monocular 3D Object Detection.

作者信息

Liu He, Liu Huaping, Wang Yikai, Sun Fuchun, Huang Wenbing

出版信息

IEEE Trans Image Process. 2022;31:4050-4061. doi: 10.1109/TIP.2022.3180210. Epub 2022 Jun 14.

DOI:10.1109/TIP.2022.3180210

Abstract

We propose a deep fine-grained multi-level fusion architecture for monocular 3D object detection, with an additionally designed anti-occlusion optimization process. Conventional monocular 3D object detection methods usually leverage geometry constraints such as keypoints, object shape relationships, and 3D to 2D optimizations to offset the lack of accurate depth information. However, these methods still struggle against directly extracting rich information for fusion from the depth estimation. To solve the problem, we integrate the monocular 3D features with the pseudo-LiDAR filter generation network between fine-grained multi-level layers. Our network utilizes the inherent multi-scale and promotes depth and semantic information flow in different stages. The new architecture can obtain features that incorporate more reliable depth information. At the same time, the problem of occlusion among objects is prevalent in natural scenes yet remains unsolved mainly. We propose a novel loss function that aims at alleviating the problem of occlusion. Extensive experiments have proved that the framework demonstrates a competitive performance, especially for the complex scenes with occlusion.

摘要

我们提出了一种用于单目3D目标检测的深度细粒度多级融合架构，并额外设计了一个抗遮挡优化过程。传统的单目3D目标检测方法通常利用几何约束，如关键点、物体形状关系以及3D到2D的优化，来弥补缺乏准确深度信息的不足。然而，这些方法在直接从深度估计中提取丰富信息进行融合方面仍存在困难。为了解决这个问题，我们在细粒度多级层之间将单目3D特征与伪激光雷达滤波器生成网络进行集成。我们的网络利用固有的多尺度特性，并促进不同阶段的深度和语义信息流。新架构能够获得包含更可靠深度信息的特征。同时，物体间的遮挡问题在自然场景中普遍存在且主要仍未得到解决。我们提出了一种新颖的损失函数，旨在缓解遮挡问题。大量实验证明，该框架表现出具有竞争力的性能，特别是对于存在遮挡的复杂场景。

相似文献

Fine-Grained Multilevel Fusion for Anti-Occlusion Monocular 3D Object Detection.用于抗遮挡单目3D目标检测的细粒度多级融合

IEEE Trans Image Process. 2022;31:4050-4061. doi: 10.1109/TIP.2022.3180210. Epub 2022 Jun 14.

GAC3D: improving monocular 3D object detection with ground-guide model and adaptive convolution.GAC3D：利用地面引导模型和自适应卷积改进单目3D目标检测

PeerJ Comput Sci. 2021 Oct 6;7:e686. doi: 10.7717/peerj-cs.686. eCollection 2021.

eGAC3D: enhancing depth adaptive convolution and depth estimation for monocular 3D object pose detection.eGAC3D：用于单目3D目标姿态检测的增强深度自适应卷积和深度估计

PeerJ Comput Sci. 2022 Nov 3;8:e1144. doi: 10.7717/peerj-cs.1144. eCollection 2022.

Deep Learning-Based Monocular 3D Object Detection with Refinement of Depth Information.基于深度学习的具有深度信息细化的单目 3D 目标检测。

Sensors (Basel). 2022 Mar 28;22(7):2576. doi: 10.3390/s22072576.

OBMO: One Bounding Box Multiple Objects for Monocular 3D Object Detection.OBMO：用于单目3D目标检测的一个边界框多个目标

IEEE Trans Image Process. 2023 Nov 21;PP. doi: 10.1109/TIP.2023.3333225.

Graph-DETR4D: Spatio-Temporal Graph Modeling for Multi-View 3D Object Detection.Graph-DETR4D：用于多视图3D目标检测的时空图建模

IEEE Trans Image Process. 2024;33:4488-4500. doi: 10.1109/TIP.2024.3430473. Epub 2024 Aug 21.

MonoDCN: Monocular 3D object detection based on dynamic convolution.MonoDCN：基于动态卷积的单目三维目标检测。

PLoS One. 2022 Oct 4;17(10):e0275438. doi: 10.1371/journal.pone.0275438. eCollection 2022.

MonoGRNet: A General Framework for Monocular 3D Object Detection.MonoGRNet：单目3D目标检测的通用框架

IEEE Trans Pattern Anal Mach Intell. 2022 Sep;44(9):5170-5184. doi: 10.1109/TPAMI.2021.3074363. Epub 2022 Aug 4.

SFA-MDEN: Semantic-Feature-Aided Monocular Depth Estimation Network Using Dual Branches.SFA-MDEN：基于语义特征辅助的双通道单目深度估计网络。

Sensors (Basel). 2021 Aug 13;21(16):5476. doi: 10.3390/s21165476.

Fine-Grained 3D Shape Classification With Hierarchical Part-View Attention.基于层次化局部视图注意力的细粒度三维形状分类

IEEE Trans Image Process. 2021;30:1744-1758. doi: 10.1109/TIP.2020.3048623. Epub 2021 Jan 14.

引用本文的文献

IV-YOLO: A Lightweight Dual-Branch Object Detection Network.IV-YOLO：一种轻量级双分支目标检测网络。

Sensors (Basel). 2024 Sep 24;24(19):6181. doi: 10.3390/s24196181.

A survey on 3D object detection in real time for autonomous driving.一项关于自动驾驶实时三维目标检测的调查。

Front Robot AI. 2024 Mar 6;11:1212070. doi: 10.3389/frobt.2024.1212070. eCollection 2024.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

用于抗遮挡单目3D目标检测的细粒度多级融合

Fine-Grained Multilevel Fusion for Anti-Occlusion Monocular 3D Object Detection.

作者信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献