• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种具有混合编码器和迭代样本特征学习的高效少样本类别无关计数框架。

An efficient low-shot class-agnostic counting framework with hybrid encoder and iterative exemplar feature learning.

作者信息

Yang Qinghua, Liu Bin, Tian Yan, Shi Yangming, Du Xinxin, He Fangyuan, Guo Jikun

机构信息

School of Artificial Intelligence, China University of Mining and Technology (Beijing), Beijing, China.

Energy Ningxia Coal Industry Co., Ltd., YangChangWan Coal Mine, YingChuan, NingXia, China.

出版信息

PLoS One. 2025 Jun 6;20(6):e0322360. doi: 10.1371/journal.pone.0322360. eCollection 2025.

DOI:10.1371/journal.pone.0322360
PMID:40478809
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12143539/
Abstract

Few-shot learning techniques have enabled the rapid adaptation of a general AI model to various tasks using limited data. In this study, we focus on class-agnostic low-shot object counting, a challenging problem that aims to achieve accurate object counting with only a few annotated samples (few-shot) or even in the absence of any annotated data (zero-shot). In existing methods, the primary focus is often on enhancing performance, while relatively little attention is given to inference time-an equally critical factor in many practical applications. We propose a model that achieves real-time inference without compromising performance. Specifically, we design a multi-scale hybrid encoder to enhance feature representation and optimize computational efficiency. This encoder applies self-attention exclusively to high-level features and cross-scale fusion modules to integrate adjacent features, reducing training costs. Additionally, we introduce a learnable shape embedding and an iterative exemplar feature learning module, that progressively enriches exemplar features with class-level characteristics by learning from similar objects within the image, which are essential for improving subsequent matching performance. Extensive experiments on the FSC147, Val-COCO, Test-COCO, CARPK, and ShanghaiTech datasets demonstrate our model's effectiveness and generalizability compared to state-of-the-art methods.

摘要

少样本学习技术使通用人工智能模型能够利用有限的数据快速适应各种任务。在本研究中,我们专注于类别无关的少样本目标计数,这是一个具有挑战性的问题,旨在仅使用少量带注释样本(少样本)甚至在没有任何带注释数据(零样本)的情况下实现准确的目标计数。在现有方法中,主要关注点通常是提高性能,而在许多实际应用中同样关键的推理时间却相对很少受到关注。我们提出了一种在不影响性能的情况下实现实时推理的模型。具体而言,我们设计了一种多尺度混合编码器来增强特征表示并优化计算效率。该编码器仅将自注意力应用于高级特征,并使用跨尺度融合模块来整合相邻特征,从而降低训练成本。此外,我们引入了一个可学习的形状嵌入和一个迭代示例特征学习模块,该模块通过从图像中的相似对象学习来逐步丰富具有类别级特征的示例特征,这对于提高后续匹配性能至关重要。在FSC147、Val-COCO、Test-COCO、CARPK和上海科技数据集上进行的大量实验表明,与现有技术方法相比,我们的模型具有有效性和通用性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e1de/12143539/291d38a0e51e/pone.0322360.g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e1de/12143539/37732192bd0f/pone.0322360.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e1de/12143539/5f31b905c01f/pone.0322360.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e1de/12143539/7540530e5d6f/pone.0322360.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e1de/12143539/9185b5e6b2d9/pone.0322360.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e1de/12143539/dbde5dea9037/pone.0322360.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e1de/12143539/cc05158f1877/pone.0322360.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e1de/12143539/11db607d10ac/pone.0322360.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e1de/12143539/96d97b682595/pone.0322360.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e1de/12143539/358fd1f8a19e/pone.0322360.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e1de/12143539/c67aca8cdaf5/pone.0322360.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e1de/12143539/3407d7ae69ae/pone.0322360.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e1de/12143539/f728b0fa3f85/pone.0322360.g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e1de/12143539/6ed10f12b6df/pone.0322360.g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e1de/12143539/291d38a0e51e/pone.0322360.g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e1de/12143539/37732192bd0f/pone.0322360.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e1de/12143539/5f31b905c01f/pone.0322360.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e1de/12143539/7540530e5d6f/pone.0322360.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e1de/12143539/9185b5e6b2d9/pone.0322360.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e1de/12143539/dbde5dea9037/pone.0322360.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e1de/12143539/cc05158f1877/pone.0322360.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e1de/12143539/11db607d10ac/pone.0322360.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e1de/12143539/96d97b682595/pone.0322360.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e1de/12143539/358fd1f8a19e/pone.0322360.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e1de/12143539/c67aca8cdaf5/pone.0322360.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e1de/12143539/3407d7ae69ae/pone.0322360.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e1de/12143539/f728b0fa3f85/pone.0322360.g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e1de/12143539/6ed10f12b6df/pone.0322360.g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e1de/12143539/291d38a0e51e/pone.0322360.g014.jpg

相似文献

1
An efficient low-shot class-agnostic counting framework with hybrid encoder and iterative exemplar feature learning.一种具有混合编码器和迭代样本特征学习的高效少样本类别无关计数框架。
PLoS One. 2025 Jun 6;20(6):e0322360. doi: 10.1371/journal.pone.0322360. eCollection 2025.
2
SATCount: A scale-aware transformer-based class-agnostic counting framework.SATCount:一种基于注意力机制的Transformer 无类别计数框架。
Neural Netw. 2024 Apr;172:106126. doi: 10.1016/j.neunet.2024.106126. Epub 2024 Jan 12.
3
SAVE: Self-Attention on Visual Embedding for Zero-Shot Generic Object Counting.SAVE:用于零样本通用目标计数的视觉嵌入自注意力机制
J Imaging. 2025 Feb 10;11(2):52. doi: 10.3390/jimaging11020052.
4
Accurate few-shot object counting with Hough matching feature enhancement.基于霍夫匹配特征增强的精确少样本目标计数
Front Comput Neurosci. 2023 Mar 30;17:1145219. doi: 10.3389/fncom.2023.1145219. eCollection 2023.
5
Enhancing few-shot image classification through learnable multi-scale embedding and attention mechanisms.通过可学习的多尺度嵌入和注意力机制增强少样本图像分类
Neural Netw. 2025 Jul;187:107339. doi: 10.1016/j.neunet.2025.107339. Epub 2025 Mar 11.
6
Self-Supervised Image Segmentation Using Meta-Learning and Multi-Backbone Feature Fusion.基于元学习和多骨干特征融合的自监督图像分割
Int J Neural Syst. 2025 May;35(5):2550012. doi: 10.1142/S0129065725500121. Epub 2025 Feb 3.
7
Attention-Assisted Feature Comparison and Feature Enhancement for Class-Agnostic Counting.用于类别无关计数的注意力辅助特征比较与特征增强
Sensors (Basel). 2023 Nov 11;23(22):9126. doi: 10.3390/s23229126.
8
PFMNet: Prototype-based feature mapping network for few-shot domain adaptation in medical image segmentation.PFMNet:基于原型的特征映射网络,用于医学图像分割中的少样本领域自适应。
Comput Med Imaging Graph. 2024 Sep;116:102406. doi: 10.1016/j.compmedimag.2024.102406. Epub 2024 May 28.
9
MSFFAL: Few-Shot Object Detection via Multi-Scale Feature Fusion and Attentive Learning.MSFFAL:基于多尺度特征融合和注意力学习的小样本目标检测。
Sensors (Basel). 2023 Mar 30;23(7):3609. doi: 10.3390/s23073609.
10
Visual-guided attentive attributes embedding for zero-shot learning.基于视觉引导的注意力属性嵌入的零样本学习。
Neural Netw. 2021 Nov;143:709-718. doi: 10.1016/j.neunet.2021.07.031. Epub 2021 Aug 11.

本文引用的文献

1
DEA-Net: Single Image Dehazing Based on Detail-Enhanced Convolution and Content-Guided Attention.DEA-Net:基于细节增强卷积和内容引导注意力的单图像去雾
IEEE Trans Image Process. 2024;33:1002-1015. doi: 10.1109/TIP.2024.3354108. Epub 2024 Jan 26.
2
Study on the evolution of Chinese characters based on few-shot learning: From oracle bone inscriptions to regular script.基于少样本学习的汉字演变研究:从甲骨文到楷书。
PLoS One. 2022 Aug 19;17(8):e0272974. doi: 10.1371/journal.pone.0272974. eCollection 2022.
3
Population estimation beyond counts-Inferring demographic characteristics.
超越计数的人口估计——推断人口特征。
PLoS One. 2022 Apr 5;17(4):e0266484. doi: 10.1371/journal.pone.0266484. eCollection 2022.
4
Channel-spatial attention network for fewshot classification.基于通道-空间注意力网络的小样本分类。
PLoS One. 2019 Dec 12;14(12):e0225426. doi: 10.1371/journal.pone.0225426. eCollection 2019.
5
Deep learning approach to peripheral leukocyte recognition.深度学习方法识别外周白细胞。
PLoS One. 2019 Jun 25;14(6):e0218808. doi: 10.1371/journal.pone.0218808. eCollection 2019.
6
UAS imagery reveals new survey opportunities for counting hippos.UAS 影像为河马计数提供了新的调查机会。
PLoS One. 2018 Nov 14;13(11):e0206413. doi: 10.1371/journal.pone.0206413. eCollection 2018.
7
TasselNet: counting maize tassels in the wild via local counts regression network.TasselNet:通过局部计数回归网络对野外玉米雄穗进行计数
Plant Methods. 2017 Nov 1;13:79. doi: 10.1186/s13007-017-0224-0. eCollection 2017.
8
Multiagent cooperation and competition with deep reinforcement learning.基于深度强化学习的多智能体合作与竞争
PLoS One. 2017 Apr 5;12(4):e0172395. doi: 10.1371/journal.pone.0172395. eCollection 2017.
9
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks.更快的 R-CNN:基于区域建议网络的实时目标检测。
IEEE Trans Pattern Anal Mach Intell. 2017 Jun;39(6):1137-1149. doi: 10.1109/TPAMI.2016.2577031. Epub 2016 Jun 6.