S-CNN：用于目标检测的子类别感知卷积网络

S-CNN: Subcategory-Aware Convolutional Networks for Object Detection.

作者信息

Chen Tao, Lu Shijian, Fan Jiayuan

出版信息

IEEE Trans Pattern Anal Mach Intell. 2018 Oct;40(10):2522-2528. doi: 10.1109/TPAMI.2017.2756936. Epub 2017 Sep 26.

DOI:10.1109/TPAMI.2017.2756936

Abstract

The marriage between the deep convolutional neural network (CNN) and region proposals has made breakthroughs for object detection in recent years. While the discriminative object features are learned via a deep CNN for classification, the large intra-class variation and deformation still limit the performance of the CNN based object detection. We propose a subcategory-aware CNN (S-CNN) to solve the object intra-class variation problem. In the proposed technique, the training samples are first grouped into multiple subcategories automatically through a novel instance sharing maximum margin clustering process. A multi-component Aggregated Channel Feature (ACF) detector is then trained to produce more latent training samples, where each ACF component corresponds to one clustered subcategory. The produced latent samples together with their subcategory labels are further fed into a CNN classifier to filter out false proposals for object detection. An iterative learning algorithm is designed for the joint optimization of image subcategorization, multi-component ACF detector, and subcategory-aware CNN classifier. Experiments on INRIA Person dataset, Pascal VOC 2007 dataset and MS COCO dataset show that the proposed technique clearly outperforms the state-of-the-art methods for generic object detection.

摘要

近年来，深度卷积神经网络（CNN）与区域建议的结合在目标检测方面取得了突破。虽然通过深度CNN学习判别性目标特征用于分类，但类内的较大变化和变形仍然限制了基于CNN的目标检测性能。我们提出了一种子类别感知CNN（S-CNN）来解决目标类内变化问题。在所提出的技术中，首先通过一种新颖的实例共享最大间隔聚类过程将训练样本自动分组为多个子类别。然后训练一个多组件聚合通道特征（ACF）检测器以产生更多潜在训练样本，其中每个ACF组件对应一个聚类的子类别。所产生的潜在样本及其子类别标签进一步输入到CNN分类器中，以过滤掉用于目标检测的错误建议。设计了一种迭代学习算法用于图像子分类、多组件ACF检测器和子类别感知CNN分类器的联合优化。在INRIA Person数据集、Pascal VOC 2007数据集和MS COCO数据集上的实验表明，所提出的技术在通用目标检测方面明显优于当前的先进方法。

相似文献

S-CNN: Subcategory-Aware Convolutional Networks for Object Detection.

IEEE Trans Pattern Anal Mach Intell. 2018 Oct;40(10):2522-2528. doi: 10.1109/TPAMI.2017.2756936. Epub 2017 Sep 26.

Learning Rotation-Invariant and Fisher Discriminative Convolutional Neural Networks for Object Detection.

IEEE Trans Image Process. 2019 Jan;28(1):265-278. doi: 10.1109/TIP.2018.2867198.

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks.

IEEE Trans Pattern Anal Mach Intell. 2017 Jun;39(6):1137-1149. doi: 10.1109/TPAMI.2016.2577031. Epub 2016 Jun 6.

HCP: A Flexible CNN Framework for Multi-label Image Classification.

IEEE Trans Pattern Anal Mach Intell. 2016 Sep 1;38(9):1901-1907. doi: 10.1109/TPAMI.2015.2491929. Epub 2015 Oct 26.

PCL: Proposal Cluster Learning for Weakly Supervised Object Detection.

IEEE Trans Pattern Anal Mach Intell. 2020 Jan;42(1):176-191. doi: 10.1109/TPAMI.2018.2876304. Epub 2018 Oct 16.

Co-trained convolutional neural networks for automated detection of prostate cancer in multi-parametric MRI.

Med Image Anal. 2017 Dec;42:212-227. doi: 10.1016/j.media.2017.08.006. Epub 2017 Aug 24.

Edge Preserving and Multi-Scale Contextual Neural Network for Salient Object Detection.

IEEE Trans Image Process. 2018;27(1):121-134. doi: 10.1109/TIP.2017.2756825.

Deep Regionlets: Blended Representation and Deep Learning for Generic Object Detection.

IEEE Trans Pattern Anal Mach Intell. 2021 Jun;43(6):1914-1927. doi: 10.1109/TPAMI.2019.2957780. Epub 2021 May 11.

Object Detection Networks on Convolutional Feature Maps.

IEEE Trans Pattern Anal Mach Intell. 2017 Jul;39(7):1476-1481. doi: 10.1109/TPAMI.2016.2601099. Epub 2016 Aug 17.

Combining Faster R-CNN and Model-Driven Clustering for Elongated Object Detection.

IEEE Trans Image Process. 2020;29(1):2052-2065. doi: 10.1109/TIP.2019.2947792. Epub 2019 Oct 22.

引用本文的文献

Application of Artificial Intelligence in Anatomical Structure Recognition of Standard Section of Fetal Heart.

Comput Math Methods Med. 2023 Jan 24;2023:5650378. doi: 10.1155/2023/5650378. eCollection 2023.

Categorization of Images Using Autoencoder Hashing and Training of Intra Bin Classifiers for Image Classification and Annotation.

J Med Syst. 2018 Jun 11;42(7):132. doi: 10.1007/s10916-018-0986-6.

Deep Learning for Computer Vision: A Brief Review.

Comput Intell Neurosci. 2018 Feb 1;2018:7068349. doi: 10.1155/2018/7068349. eCollection 2018.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

S-CNN：用于目标检测的子类别感知卷积网络

S-CNN: Subcategory-Aware Convolutional Networks for Object Detection.

作者信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献