基于自监督与协作分类器的少样本目标检测

Few-Shot Object Detection With Self-Supervising and Cooperative Classifier.

作者信息

Qi Di, Hu Jilin, Shen Jianbing

出版信息

IEEE Trans Neural Netw Learn Syst. 2024 Apr;35(4):5435-5446. doi: 10.1109/TNNLS.2022.3204597. Epub 2024 Apr 4.

DOI:10.1109/TNNLS.2022.3204597

Abstract

Few-shot object detection (FSOD), which detects novel objects with only a few training instances, has recently attracted more attention. Previous works focus on making the most use of label information of objects. Still, they fail to consider the structural and semantic information of the image itself and solve the misclassification between data-abundant base classes and data-scarce novel classes efficiently. In this article, we propose FSOD with Self-Supervising and Cooperative Classifier ( [Formula: see text]) approach to deal with those concerns. Specifically, we analyze the underlying performance degradation of novel classes in FSOD and discover that false-positive samples are the main reason. By looking into these false-positive samples, we further notice that misclassifying novel classes as base classes are the main cause. Thus, we introduce double RoI heads into the existing Fast-RCNN to learn more specific features for novel classes. We also consider using self-supervised learning (SSL) to learn more structural and semantic information. Finally, we propose a cooperative classifier (CC) with the base-novel regularization to maximize the interclass variance between base and novel classes. In the experiment, [Formula: see text] outperforms all the latest baselines in most cases on PASCAL VOC and COCO.

摘要

少样本目标检测（FSOD）能够仅通过少量训练实例来检测新目标，近来受到了更多关注。先前的工作专注于充分利用目标的标签信息。然而，它们未能考虑图像本身的结构和语义信息，也未能有效解决数据丰富的基础类别与数据稀缺的新类别之间的误分类问题。在本文中，我们提出了具有自监督和协作分类器（[公式：见正文]）的FSOD方法来处理这些问题。具体而言，我们分析了FSOD中新类别潜在的性能下降情况，并发现误报样本是主要原因。通过研究这些误报样本，我们进一步注意到将新类别误分类为基础类别是主要原因。因此，我们在现有的Fast - RCNN中引入双感兴趣区域（RoI）头，以学习新类别的更特定特征。我们还考虑使用自监督学习（SSL）来学习更多的结构和语义信息。最后，我们提出了一种具有基础 - 新类别正则化的协作分类器（CC），以最大化基础类别和新类别之间的类间方差。在实验中，[公式：见正文]在PASCAL VOC和COCO数据集上的大多数情况下优于所有最新的基线方法。

相似文献

Few-Shot Object Detection With Self-Supervising and Cooperative Classifier.

IEEE Trans Neural Netw Learn Syst. 2024 Apr;35(4):5435-5446. doi: 10.1109/TNNLS.2022.3204597. Epub 2024 Apr 4.

Improved region proposal network for enhanced few-shot object detection.

Neural Netw. 2024 Dec;180:106699. doi: 10.1016/j.neunet.2024.106699. Epub 2024 Sep 3.

ECEA: Extensible Co-Existing Attention for Few-Shot Object Detection.

IEEE Trans Image Process. 2024;33:5564-5576. doi: 10.1109/TIP.2024.3411771. Epub 2024 Oct 4.

Proposal Distribution Calibration for Few-Shot Object Detection.

IEEE Trans Neural Netw Learn Syst. 2025 Jan;36(1):1911-1918. doi: 10.1109/TNNLS.2023.3331648. Epub 2025 Jan 7.

Category Knowledge-Guided Parameter Calibration for Few-Shot Object Detection.

IEEE Trans Image Process. 2023;32:1092-1107. doi: 10.1109/TIP.2023.3239197. Epub 2023 Feb 3.

Efficient Few-Shot Object Detection via Knowledge Inheritance.

IEEE Trans Image Process. 2023;32:321-334. doi: 10.1109/TIP.2022.3228162. Epub 2022 Dec 21.

Expandable-RCNN: toward high-efficiency incremental few-shot object detection.

Front Artif Intell. 2024 Apr 23;7:1377337. doi: 10.3389/frai.2024.1377337. eCollection 2024.

Prediction Calibration for Generalized Few-Shot Semantic Segmentation.

IEEE Trans Image Process. 2023;32:3311-3323. doi: 10.1109/TIP.2023.3282070. Epub 2023 Jun 12.

A Survey of Self-Supervised and Few-Shot Object Detection.

IEEE Trans Pattern Anal Mach Intell. 2023 Apr;45(4):4071-4089. doi: 10.1109/TPAMI.2022.3199617. Epub 2023 Mar 7.

Cyclic Self-Training With Proposal Weight Modulation for Cross-Supervised Object Detection.

IEEE Trans Image Process. 2023;32:1992-2002. doi: 10.1109/TIP.2023.3261752. Epub 2023 Apr 4.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于自监督与协作分类器的少样本目标检测

Few-Shot Object Detection With Self-Supervising and Cooperative Classifier.

作者信息

出版信息

相似文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献