• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

MSFFAL:基于多尺度特征融合和注意力学习的小样本目标检测。

MSFFAL: Few-Shot Object Detection via Multi-Scale Feature Fusion and Attentive Learning.

机构信息

Shanghai Advanced Research Institute, Chinese Academy of Sciences, Shanghai 201210, China.

School of Electronic, Electrical and Communication Engineering, University of Chinese Academy of Sciences, Beijing 100049, China.

出版信息

Sensors (Basel). 2023 Mar 30;23(7):3609. doi: 10.3390/s23073609.

DOI:10.3390/s23073609
PMID:37050671
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10099036/
Abstract

Few-shot object detection (FSOD) is proposed to solve the application problem of traditional detectors in scenarios lacking training samples. The meta-learning methods have attracted the researchers' attention for their excellent generalization performance. They usually select the same class of support features according to the query labels to weight the query features. However, the model cannot possess the ability of active identification only by using the same category support features, and feature selection causes difficulties in the testing process without labels. The single-scale feature of the model also leads to poor performance in small object detection. In addition, the hard samples in the support branch impact the backbone's representation of the support features, thus impacting the feature weighting process. To overcome these problems, we propose a multi-scale feature fusion and attentive learning (MSFFAL) framework for few-shot object detection. We first design the backbone with multi-scale feature fusion and channel attention mechanism to improve the model's detection accuracy on small objects and the representation of hard support samples. Based on this, we propose an attention loss to replace the feature weighting module. The loss allows the model to consistently represent the objects of the same category in the two branches and realizes the active recognition of the model. The model no longer depends on query labels to select features when testing, optimizing the model testing process. The experiments show that MSFFAL outperforms the state-of-the-art (SOTA) by 0.7-7.8% on the Pascal VOC and exhibits 1.61 times the result of the baseline model in MS COCO's small objects detection.

摘要

少样本目标检测 (FSOD) 旨在解决传统探测器在缺乏训练样本的场景下的应用问题。元学习方法因其出色的泛化性能引起了研究人员的关注。它们通常根据查询标签选择相同类别的支持特征来对查询特征进行加权。然而,模型仅通过使用相同类别的支持特征无法具备主动识别的能力,并且在没有标签的测试过程中特征选择会带来困难。模型的单尺度特征也导致在小目标检测中性能较差。此外,支持分支中的硬样本会影响骨干网络对支持特征的表示,从而影响特征加权过程。为了解决这些问题,我们提出了一种用于少样本目标检测的多尺度特征融合和注意学习 (MSFFAL) 框架。我们首先设计了具有多尺度特征融合和通道注意力机制的骨干网络,以提高模型在小目标上的检测精度和硬支持样本的表示能力。在此基础上,我们提出了一种注意力损失来替代特征加权模块。该损失使模型能够在两个分支中一致地表示同一类别的物体,从而实现模型的主动识别。模型在测试时不再依赖查询标签来选择特征,优化了模型的测试过程。实验表明,在 Pascal VOC 上,MSFFAL 比最先进的方法 (SOTA) 提高了 0.7-7.8%,在 MS COCO 的小目标检测中,比基线模型提高了 1.61 倍。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fec1/10099036/feb7084dcbaf/sensors-23-03609-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fec1/10099036/968db21fff74/sensors-23-03609-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fec1/10099036/3bd4aee93708/sensors-23-03609-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fec1/10099036/8cfb41def9df/sensors-23-03609-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fec1/10099036/82d18257b965/sensors-23-03609-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fec1/10099036/b084de9bcafc/sensors-23-03609-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fec1/10099036/9cfddb8e52f8/sensors-23-03609-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fec1/10099036/feb7084dcbaf/sensors-23-03609-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fec1/10099036/968db21fff74/sensors-23-03609-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fec1/10099036/3bd4aee93708/sensors-23-03609-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fec1/10099036/8cfb41def9df/sensors-23-03609-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fec1/10099036/82d18257b965/sensors-23-03609-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fec1/10099036/b084de9bcafc/sensors-23-03609-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fec1/10099036/9cfddb8e52f8/sensors-23-03609-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fec1/10099036/feb7084dcbaf/sensors-23-03609-g007.jpg

相似文献

1
MSFFAL: Few-Shot Object Detection via Multi-Scale Feature Fusion and Attentive Learning.MSFFAL:基于多尺度特征融合和注意力学习的小样本目标检测。
Sensors (Basel). 2023 Mar 30;23(7):3609. doi: 10.3390/s23073609.
2
ECEA: Extensible Co-Existing Attention for Few-Shot Object Detection.ECEA:用于少样本目标检测的可扩展共存注意力机制
IEEE Trans Image Process. 2024;33:5564-5576. doi: 10.1109/TIP.2024.3411771. Epub 2024 Oct 4.
3
Few-shot learning approach with multi-scale feature fusion and attention for plant disease recognition.基于多尺度特征融合与注意力机制的少样本学习方法用于植物病害识别
Front Plant Sci. 2022 Sep 16;13:907916. doi: 10.3389/fpls.2022.907916. eCollection 2022.
4
Improved region proposal network for enhanced few-shot object detection.改进的区域提议网络,用于增强少样本目标检测。
Neural Netw. 2024 Dec;180:106699. doi: 10.1016/j.neunet.2024.106699. Epub 2024 Sep 3.
5
Decoupled Metric Network for Single-Stage Few-Shot Object Detection.用于单阶段少样本目标检测的解耦度量网络
IEEE Trans Cybern. 2023 Jan;53(1):514-525. doi: 10.1109/TCYB.2022.3149825. Epub 2022 Dec 23.
6
Research on a Traffic Sign Recognition Method under Small Sample Conditions.基于小样本条件的交通标志识别方法研究。
Sensors (Basel). 2023 May 26;23(11):5091. doi: 10.3390/s23115091.
7
Lightweight Feature Enhancement Network for Single-Shot Object Detection.轻量级特征增强网络用于单目标检测。
Sensors (Basel). 2021 Feb 4;21(4):1066. doi: 10.3390/s21041066.
8
Single-Shot Object Detection via Feature Enhancement and Channel Attention.基于特征增强与通道注意力的单阶段目标检测
Sensors (Basel). 2022 Sep 10;22(18):6857. doi: 10.3390/s22186857.
9
Few-Shot Object Detection in Remote Sensing Images via Data Clearing and Stationary Meta-Learning.通过数据清理和静态元学习实现遥感图像中的少样本目标检测
Sensors (Basel). 2024 Jun 15;24(12):3882. doi: 10.3390/s24123882.
10
Improving Single Shot Object Detection With Feature Scale Unmixing.通过特征尺度解混改进单阶段目标检测
IEEE Trans Image Process. 2021;30:2708-2721. doi: 10.1109/TIP.2020.3048630. Epub 2021 Feb 10.

本文引用的文献

1
Few-Shot Object Detection and Viewpoint Estimation for Objects in the Wild.基于小样本的野外目标检测与视角估计
IEEE Trans Pattern Anal Mach Intell. 2023 Mar;45(3):3090-3106. doi: 10.1109/TPAMI.2022.3174072. Epub 2023 Feb 3.
2
Learning to Match Anchors for Visual Object Detection.学习用于视觉目标检测的锚匹配。
IEEE Trans Pattern Anal Mach Intell. 2022 Jun;44(6):3096-3109. doi: 10.1109/TPAMI.2021.3050494. Epub 2022 May 5.