• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

多视图和 3D 可变形部件模型。

Multi-view and 3D deformable part models.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2015 Nov;37(11):2232-45. doi: 10.1109/TPAMI.2015.2408347.

DOI:10.1109/TPAMI.2015.2408347
PMID:26440264
Abstract

As objects are inherently 3D, they have been modeled in 3D in the early days of computer vision. Due to the ambiguities arising from mapping 2D features to 3D models, 3D object representations have been neglected and 2D feature-based models are the predominant paradigm in object detection nowadays. While such models have achieved outstanding bounding box detection performance, they come with limited expressiveness, as they are clearly limited in their capability of reasoning about 3D shape or viewpoints. In this work, we bring the worlds of 3D and 2D object representations closer, by building an object detector which leverages the expressive power of 3D object representations while at the same time can be robustly matched to image evidence. To that end, we gradually extend the successful deformable part model [1] to include viewpoint information and part-level 3D geometry information, resulting in several different models with different level of expressiveness. We end up with a 3D object model, consisting of multiple object parts represented in 3D and a continuous appearance model. We experimentally verify that our models, while providing richer object hypotheses than the 2D object models, provide consistently better joint object localization and viewpoint estimation than the state-of-the-art multi-view and 3D object detectors on various benchmarks (KITTI [2] , 3D object classes [3] , Pascal3D+ [4] , Pascal VOC 2007 [5] , EPFL multi-view cars[6] ).

摘要

由于物体本质上是三维的,因此在计算机视觉的早期阶段就已经对其进行了三维建模。由于将二维特征映射到三维模型会产生歧义,因此三维物体表示被忽视了,而二维特征基模型现在是物体检测中的主要范例。虽然这些模型在边界框检测性能方面取得了出色的成绩,但它们的表达能力有限,因为它们在推理三维形状或视角方面的能力显然有限。在这项工作中,我们通过构建一个利用三维物体表示的表达能力的物体检测器,使三维和二维物体表示的世界更加接近,同时可以稳健地与图像证据匹配。为此,我们逐渐扩展了成功的可变形部件模型[1],以包含视点信息和部件级别的三维几何信息,从而产生了具有不同表达能力的几个不同模型。最终得到了一个 3D 物体模型,由多个以 3D 表示的物体部件和连续的外观模型组成。我们通过实验验证了我们的模型,虽然它们提供了比二维物体模型更丰富的物体假设,但与各种基准(KITTI[2]、3D 物体类别[3]、Pascal3D+[4]、Pascal VOC 2007[5]、EPFL 多视图汽车[6])上的最新多视图和 3D 物体检测器相比,提供了更一致的物体定位和视角估计。

相似文献

1
Multi-view and 3D deformable part models.多视图和 3D 可变形部件模型。
IEEE Trans Pattern Anal Mach Intell. 2015 Nov;37(11):2232-45. doi: 10.1109/TPAMI.2015.2408347.
2
Detailed 3D representations for object recognition and modeling.用于目标识别和建模的详细 3D 表示。
IEEE Trans Pattern Anal Mach Intell. 2013 Nov;35(11):2608-23. doi: 10.1109/TPAMI.2013.87.
3
Learning 3D Object Templates by Quantizing Geometry and Appearance Spaces.通过量化几何和外观空间来学习 3D 对象模板。
IEEE Trans Pattern Anal Mach Intell. 2015 Jun;37(6):1190-205. doi: 10.1109/TPAMI.2014.2362141.
4
Few-Shot Object Detection and Viewpoint Estimation for Objects in the Wild.基于小样本的野外目标检测与视角估计
IEEE Trans Pattern Anal Mach Intell. 2023 Mar;45(3):3090-3106. doi: 10.1109/TPAMI.2022.3174072. Epub 2023 Feb 3.
5
Novel Views of Objects from a Single Image.从单张图像看新视角。
IEEE Trans Pattern Anal Mach Intell. 2017 Aug;39(8):1576-1590. doi: 10.1109/TPAMI.2016.2601093. Epub 2016 Aug 17.
6
Learning Category-Specific Deformable 3D Models for Object Reconstruction.学习用于目标重建的类别特定可变形 3D 模型。
IEEE Trans Pattern Anal Mach Intell. 2017 Apr;39(4):719-731. doi: 10.1109/TPAMI.2016.2574713. Epub 2016 Jun 1.
7
Learning And-Or Models to Represent Context and Occlusion for Car Detection and Viewpoint Estimation.学习与或模型以表示用于汽车检测和视角估计的上下文与遮挡情况。
IEEE Trans Pattern Anal Mach Intell. 2016 Sep 1;38(9):1829-1843. doi: 10.1109/TPAMI.2015.2497699. Epub 2015 Nov 4.
8
A New 3D Object Pose Detection Method Using LIDAR Shape Set.一种使用激光雷达形状集的新型三维物体姿态检测方法。
Sensors (Basel). 2018 Mar 16;18(3):882. doi: 10.3390/s18030882.
9
Visual Object Recognition with 3D-Aware Features in KITTI Urban Scenes.基于KITTI城市场景中3D感知特征的视觉目标识别
Sensors (Basel). 2015 Apr 20;15(4):9228-50. doi: 10.3390/s150409228.
10
Relation Graph Network for 3D Object Detection in Point Clouds.用于点云三维目标检测的关系图网络
IEEE Trans Image Process. 2021;30:92-107. doi: 10.1109/TIP.2020.3031371. Epub 2020 Nov 18.

引用本文的文献

1
Improving object detection quality with structural constraints.利用结构约束提高目标检测质量。
PLoS One. 2022 May 18;17(5):e0267863. doi: 10.1371/journal.pone.0267863. eCollection 2022.
2
A comprehensive swarming intelligent method for optimizing deep learning-based object detection by unmanned ground vehicles.一种用于优化基于深度学习的无人地面车辆目标检测的综合群体智能方法。
PLoS One. 2021 May 13;16(5):e0251339. doi: 10.1371/journal.pone.0251339. eCollection 2021.
3
Estimation of Pedestrian Pose Orientation Using Soft Target Training Based on Teacher⁻Student Framework.
基于师生框架的软目标训练估计行人姿态方向。
Sensors (Basel). 2019 Mar 6;19(5):1147. doi: 10.3390/s19051147.
4
Object Detection and Classification by Decision-Level Fusion for Intelligent Vehicle Systems.用于智能车辆系统的基于决策级融合的目标检测与分类
Sensors (Basel). 2017 Jan 22;17(1):207. doi: 10.3390/s17010207.