Suppr超能文献

EPNet++:用于多模态 3D 目标检测的级联双向融合。

EPNet++: Cascade Bi-Directional Fusion for Multi-Modal 3D Object Detection.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2023 Jul;45(7):8324-8341. doi: 10.1109/TPAMI.2022.3228806. Epub 2023 Jun 5.

Abstract

Recently, fusing the LiDAR point cloud and camera image to improve the performance and robustness of 3D object detection has received more and more attention, as these two modalities naturally possess strong complementarity. In this paper, we propose EPNet++ for multi-modal 3D object detection by introducing a novel Cascade Bi-directional Fusion (CB-Fusion) module and a Multi-Modal Consistency (MC) loss. More concretely, the proposed CB-Fusion module enhances point features with plentiful semantic information absorbed from the image features in a cascade bi-directional interaction fusion manner, leading to more powerful and discriminative feature representations. The MC loss explicitly guarantees the consistency between predicted scores from two modalities to obtain more comprehensive and reliable confidence scores. The experimental results on the KITTI, JRDB and SUN-RGBD datasets demonstrate the superiority of EPNet++ over the state-of-the-art methods. Besides, we emphasize a critical but easily overlooked problem, which is to explore the performance and robustness of a 3D detector in a sparser scene. Extensive experiments present that EPNet++ outperforms the existing SOTA methods with remarkable margins in highly sparse point cloud cases, which might be an available direction to reduce the expensive cost of LiDAR sensors.

摘要

最近,融合激光雷达点云和相机图像以提高 3D 目标检测的性能和鲁棒性受到了越来越多的关注,因为这两种模态自然具有很强的互补性。在本文中,我们通过引入一种新的级联双向融合(CB-Fusion)模块和多模态一致性(MC)损失,提出了用于多模态 3D 目标检测的 EPNet++。具体来说,所提出的 CB-Fusion 模块以级联双向交互融合的方式增强了具有丰富语义信息的点特征,从而产生更强大和有区分性的特征表示。MC 损失明确保证了来自两种模态的预测得分之间的一致性,以获得更全面和可靠的置信得分。在 KITTI、JRDB 和 SUN-RGBD 数据集上的实验结果表明,EPNet++ 优于最先进的方法。此外,我们强调了一个关键但容易被忽视的问题,即探索 3D 检测器在稀疏场景中的性能和鲁棒性。大量实验表明,在高度稀疏的点云情况下,EPNet++ 优于现有的 SOTA 方法,这可能是降低激光雷达传感器昂贵成本的一个可行方向。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验