• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于小样本的野外目标检测与视角估计

Few-Shot Object Detection and Viewpoint Estimation for Objects in the Wild.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2023 Mar;45(3):3090-3106. doi: 10.1109/TPAMI.2022.3174072. Epub 2023 Feb 3.

DOI:10.1109/TPAMI.2022.3174072
PMID:35536822
Abstract

Detecting objects and estimating their viewpoints in images are key tasks of 3D scene understanding. Recent approaches have achieved excellent results on very large benchmarks for object detection and viewpoint estimation. However, performances are still lagging behind for novel object categories with few samples. In this paper, we tackle the problems of few-shot object detection and few-shot viewpoint estimation. We demonstrate on both tasks the benefits of guiding the network prediction with class-representative features extracted from data in different modalities: image patches for object detection, and aligned 3D models for viewpoint estimation. Despite its simplicity, our method outperforms state-of-the-art methods by a large margin on a range of datasets, including PASCAL and COCO for few-shot object detection, and Pascal3D+ and ObjectNet3D for few-shot viewpoint estimation. Furthermore, when the 3D model is not available, we introduce a simple category-agnostic viewpoint estimation method by exploiting geometrical similarities and consistent pose labeling across different classes. While it moderately reduces performance, this approach still obtains better results than previous methods in this setting. Last, for the first time, we tackle the combination of both few-shot tasks, on three challenging benchmarks for viewpoint estimation in the wild, ObjectNet3D, Pascal3D+ and Pix3D, showing very promising results.

摘要

在图像中检测目标并估计其视角是 3D 场景理解的关键任务。最近的方法在用于目标检测和视角估计的大型基准测试中取得了优异的结果。然而,对于新的、样本较少的物体类别,性能仍然落后。在本文中,我们解决了少样本目标检测和少样本视角估计的问题。我们在这两个任务中都展示了从不同模态的数据中提取出具有代表性的特征来指导网络预测的好处:用于目标检测的图像补丁,以及用于视角估计的对齐的 3D 模型。尽管我们的方法很简单,但它在一系列数据集上的性能都优于最先进的方法,包括用于少样本目标检测的 PASCAL 和 COCO 数据集,以及用于少样本视角估计的 Pascal3D+ 和 ObjectNet3D 数据集。此外,当 3D 模型不可用时,我们通过利用不同类别之间的几何相似性和一致的姿态标注,引入了一种简单的类别不可知的视角估计方法。虽然它适度降低了性能,但在这种设置下,这种方法仍然比以前的方法获得了更好的结果。最后,我们首次解决了这两个少样本任务的组合问题,在三个具有挑战性的野外视角估计基准上,即 ObjectNet3D、Pascal3D+ 和 Pix3D,取得了非常有前景的结果。

相似文献

1
Few-Shot Object Detection and Viewpoint Estimation for Objects in the Wild.基于小样本的野外目标检测与视角估计
IEEE Trans Pattern Anal Mach Intell. 2023 Mar;45(3):3090-3106. doi: 10.1109/TPAMI.2022.3174072. Epub 2023 Feb 3.
2
Multi-view and 3D deformable part models.多视图和 3D 可变形部件模型。
IEEE Trans Pattern Anal Mach Intell. 2015 Nov;37(11):2232-45. doi: 10.1109/TPAMI.2015.2408347.
3
Composite Object Relation Modeling for Few-Shot Scene Recognition.用于少样本场景识别的复合对象关系建模
IEEE Trans Image Process. 2023;32:5678-5691. doi: 10.1109/TIP.2023.3321475. Epub 2023 Oct 17.
4
Improved region proposal network for enhanced few-shot object detection.改进的区域提议网络,用于增强少样本目标检测。
Neural Netw. 2024 Dec;180:106699. doi: 10.1016/j.neunet.2024.106699. Epub 2024 Sep 3.
5
Towards Generalized Few-Shot Open-Set Object Detection.迈向广义少样本开放集目标检测
IEEE Trans Image Process. 2024 Feb 15;PP. doi: 10.1109/TIP.2024.3364495.
6
Semantics-Guided Contrastive Network for Zero-Shot Object Detection.用于零样本目标检测的语义引导对比网络
IEEE Trans Pattern Anal Mach Intell. 2024 Mar;46(3):1530-1544. doi: 10.1109/TPAMI.2021.3140070. Epub 2024 Feb 6.
7
Dual Branch Multi-Level Semantic Learning for Few-Shot Segmentation.用于少样本分割的双分支多级语义学习
IEEE Trans Image Process. 2024;33:1432-1447. doi: 10.1109/TIP.2024.3364056. Epub 2024 Feb 21.
8
Few-Shot Object Detection: Application to Medieval Musicological Studies.少样本目标检测:在中世纪音乐学研究中的应用。
J Imaging. 2022 Jan 19;8(2):18. doi: 10.3390/jimaging8020018.
9
Capturing the Geometry of Object Categories from Video Supervision.从视频监督中捕捉对象类别的几何信息。
IEEE Trans Pattern Anal Mach Intell. 2020 Feb;42(2):261-275. doi: 10.1109/TPAMI.2018.2871117. Epub 2018 Sep 19.
10
Generative Multi-Label Zero-Shot Learning.生成式多标签零样本学习
IEEE Trans Pattern Anal Mach Intell. 2023 Dec;45(12):14611-14624. doi: 10.1109/TPAMI.2023.3295772. Epub 2023 Nov 3.

引用本文的文献

1
Learning from Demonstrations via Deformable Residual Multi-Attention Domain-Adaptive Meta-Learning.通过可变形残差多注意力域自适应元学习从示范中学习
Biomimetics (Basel). 2025 Feb 11;10(2):103. doi: 10.3390/biomimetics10020103.
2
Extreme R-CNN: Few-Shot Object Detection via Sample Synthesis and Knowledge Distillation.极端区域卷积神经网络:通过样本合成与知识蒸馏实现少样本目标检测
Sensors (Basel). 2024 Dec 7;24(23):7833. doi: 10.3390/s24237833.
3
Few-Shot Object Detection in Remote Sensing Images via Data Clearing and Stationary Meta-Learning.
通过数据清理和静态元学习实现遥感图像中的少样本目标检测
Sensors (Basel). 2024 Jun 15;24(12):3882. doi: 10.3390/s24123882.
4
Towards Stabilized Few-Shot Object Detection with Less Forgetting via Sample Normalization.通过样本归一化实现少遗忘的稳定少样本目标检测
Sensors (Basel). 2024 May 27;24(11):3456. doi: 10.3390/s24113456.
5
MSFFAL: Few-Shot Object Detection via Multi-Scale Feature Fusion and Attentive Learning.MSFFAL:基于多尺度特征融合和注意力学习的小样本目标检测。
Sensors (Basel). 2023 Mar 30;23(7):3609. doi: 10.3390/s23073609.