使用轮廓片段的多尺度分类目标识别

Multiscale categorical object recognition using contour fragments.

作者信息

Shotton Jamie, Blake Andrew, Cipolla Roberto

机构信息

Toshiba Corporate R&D Center, Kawasaki, Japan.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2008 Jul;30(7):1270-81. doi: 10.1109/TPAMI.2007.70772.

DOI:10.1109/TPAMI.2007.70772

PMID:18550908

Abstract

Psychophysical studies [9], [17] show that we can recognize objects using fragments of outline contour alone. This paper proposes a new automatic visual recognition system based only on local contour features, capable of localizing objects in space and scale. The system first builds a class-specific codebook of local fragments of contour using a novel formulation of chamfer matching. These local fragments allow recognition that is robust to within-class variation, pose changes, and articulation. Boosting combines these fragments into a cascaded sliding-window classifier, and mean shift is used to select strong responses as a final set of detections. We show how learning can be performed iteratively on both training and test sets to boot-strap an improved classifier. We compare with other methods based on contour and local descriptors in our detailed evaluation over 17 challenging categories, and obtain highly competitive results. The results confirm that contour is indeed a powerful cue for multi-scale and multi-class visual object recognition.

摘要

心理物理学研究[9]、[17]表明，我们仅使用轮廓线的片段就能识别物体。本文提出了一种全新的自动视觉识别系统，该系统仅基于局部轮廓特征，能够在空间和尺度上对物体进行定位。该系统首先使用一种新颖的倒角匹配公式构建特定类别的轮廓局部片段码本。这些局部片段使得识别对类内变化、姿态变化和关节运动具有鲁棒性。提升算法将这些片段组合成一个级联滑动窗口分类器，均值漂移用于选择强响应作为最终的检测集。我们展示了如何在训练集和测试集上迭代地进行学习，以引导改进分类器。在对17个具有挑战性的类别进行的详细评估中，我们将其与基于轮廓和局部描述符的其他方法进行了比较，并获得了极具竞争力的结果。结果证实，轮廓确实是多尺度和多类视觉物体识别的有力线索。

相似文献

Multiscale categorical object recognition using contour fragments.

IEEE Trans Pattern Anal Mach Intell. 2008 Jul;30(7):1270-81. doi: 10.1109/TPAMI.2007.70772.

Generic object recognition with boosting.

IEEE Trans Pattern Anal Mach Intell. 2006 Mar;28(3):416-31. doi: 10.1109/TPAMI.2006.54.

Robustness of shape descriptors to incomplete contour representations.

IEEE Trans Pattern Anal Mach Intell. 2005 Nov;27(11):1793-804. doi: 10.1109/TPAMI.2005.225.

Sparse representation for coarse and fine object recognition.

IEEE Trans Pattern Anal Mach Intell. 2006 Apr;28(4):555-67. doi: 10.1109/TPAMI.2006.84.

Signature detection and matching for document image retrieval.

IEEE Trans Pattern Anal Mach Intell. 2009 Nov;31(11):2015-31. doi: 10.1109/TPAMI.2008.237.

Discriminative learning and recognition of image set classes using canonical correlations.

IEEE Trans Pattern Anal Mach Intell. 2007 Jun;29(6):1005-18. doi: 10.1109/TPAMI.2007.1037.

IEEE Trans Pattern Anal Mach Intell. 2008 Jul;30(7):1282-92. doi: 10.1109/TPAMI.2007.70769.

Context-based object-class recognition and retrieval by generalized correlograms.

IEEE Trans Pattern Anal Mach Intell. 2007 Oct;29(10):1818-33. doi: 10.1109/TPAMI.2007.1098.

Groups of adjacent contour segments for object detection.

IEEE Trans Pattern Anal Mach Intell. 2008 Jan;30(1):36-51. doi: 10.1109/TPAMI.2007.1144.

Multicue HMM-UKF for real-time contour tracking.

IEEE Trans Pattern Anal Mach Intell. 2006 Sep;28(9):1525-9. doi: 10.1109/TPAMI.2006.190.

引用本文的文献

Ship detection based on semantic aggregation for video surveillance images with complex backgrounds.

PeerJ Comput Sci. 2024 Dec 23;10:e2624. doi: 10.7717/peerj-cs.2624. eCollection 2024.

Failure Handling of Robotic Pick and Place Tasks With Multimodal Cues Under Partial Object Occlusion.

Front Neurorobot. 2021 Mar 8;15:570507. doi: 10.3389/fnbot.2021.570507. eCollection 2021.

Bayesian Edge Detector Using Deformable Directivity-Aware Sampling Window.

Entropy (Basel). 2020 Sep 25;22(10):1080. doi: 10.3390/e22101080.

A Deep-Learning Model with Task-Specific Bounding Box Regressors and Conditional Back-Propagation for Moving Object Detection in ADAS Applications.

Sensors (Basel). 2020 Sep 15;20(18):5269. doi: 10.3390/s20185269.

Orientation-Constrained System for Lamp Detection in Buildings Based on Computer Vision.

Sensors (Basel). 2019 Mar 28;19(7):1516. doi: 10.3390/s19071516.

Shape retrieval using hierarchical total Bregman soft clustering.

IEEE Trans Pattern Anal Mach Intell. 2012 Dec;34(12):2407-19. doi: 10.1109/TPAMI.2012.44.

Detection of neuron membranes in electron microscopy images using a serial neural network architecture.

Med Image Anal. 2010 Dec;14(6):770-83. doi: 10.1016/j.media.2010.06.002. Epub 2010 Jun 18.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

使用轮廓片段的多尺度分类目标识别

Multiscale categorical object recognition using contour fragments.

作者信息

Shotton Jamie, Blake Andrew, Cipolla Roberto

机构信息

Toshiba Corporate R&D Center, Kawasaki, Japan.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2008 Jul;30(7):1270-81. doi: 10.1109/TPAMI.2007.70772.

DOI:10.1109/TPAMI.2007.70772

PMID:18550908

Abstract

摘要

使用轮廓片段的多尺度分类目标识别

Multiscale categorical object recognition using contour fragments.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

使用轮廓片段的多尺度分类目标识别

Multiscale categorical object recognition using contour fragments.

作者信息

机构信息

出版信息

相似文献

引用本文的文献