• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于多尺度注意力和层次级别增强的域泛化中的人群计数

Crowd counting in domain generalization based on multi-scale attention and hierarchy level enhancement.

作者信息

Zhou Jiarui, Zhang Jianming, Gui Yan

机构信息

School of Computer and Communication Engineering, Changsha University of Science and Technology, Changsha, 410114, China.

出版信息

Sci Rep. 2025 Jan 2;15(1):155. doi: 10.1038/s41598-024-83725-5.

DOI:10.1038/s41598-024-83725-5
PMID:39747398
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11696471/
Abstract

In order to solve the problem of weak single domain generalization ability in existing crowd counting methods, this study proposes a new crowd counting framework called Multi-scale Attention and Hierarchy level Enhancement (MAHE). Firstly, the model can focus on both the detailed features and the macro information of structural position changes through the fusion of channel attention and spatial attention. Secondly, the addition of multi-head attention feature module facilitates the model's capacity to effectively capture complex dependency relationships between sequence elements. In addition, the three-stage encoding and decoding processing mode enables the model to effectively represent crowd density information. Finally, the fusion of multi-scale features derived from different receptive fields is further enhanced through multi-scale hierarchy level feature fusion, thereby enabling the model to learn high-level semantic information and low-level multi-scale visual field feature information. This method enhances the model's capacity to capture key feature information, even in highly differentiated datasets, thereby improving the model's generalization ability on a single domain. The model has demonstrated strong generalization capabilities through extensive experiments on different datasets. This study not only improves the accuracy of crowd counting, but also introduces a new research approach for single domain generalization of crowd counting.

摘要

为了解决现有人群计数方法中单域泛化能力较弱的问题,本研究提出了一种名为多尺度注意力与层次增强(MAHE)的新型人群计数框架。首先,该模型通过通道注意力和空间注意力的融合,能够同时关注结构位置变化的细节特征和宏观信息。其次,多头注意力特征模块的加入有助于模型有效捕捉序列元素之间复杂的依赖关系。此外,三阶段编码和解码处理模式使模型能够有效表示人群密度信息。最后,通过多尺度层次特征融合进一步增强了来自不同感受野的多尺度特征的融合,从而使模型能够学习高级语义信息和低级多尺度视野特征信息。该方法增强了模型捕捉关键特征信息的能力,即使在高度分化的数据集中也是如此,从而提高了模型在单域上的泛化能力。通过在不同数据集上的大量实验,该模型展示了强大的泛化能力。本研究不仅提高了人群计数的准确性,还为人群计数的单域泛化引入了一种新的研究方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c130/11696471/39e2ea144cb4/41598_2024_83725_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c130/11696471/5befa2941ba2/41598_2024_83725_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c130/11696471/bc5e2308ae65/41598_2024_83725_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c130/11696471/3af3f3835298/41598_2024_83725_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c130/11696471/8ae8614c0045/41598_2024_83725_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c130/11696471/22f7bf13b9c0/41598_2024_83725_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c130/11696471/5a164241d6c1/41598_2024_83725_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c130/11696471/5b858c49df1b/41598_2024_83725_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c130/11696471/39e2ea144cb4/41598_2024_83725_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c130/11696471/5befa2941ba2/41598_2024_83725_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c130/11696471/bc5e2308ae65/41598_2024_83725_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c130/11696471/3af3f3835298/41598_2024_83725_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c130/11696471/8ae8614c0045/41598_2024_83725_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c130/11696471/22f7bf13b9c0/41598_2024_83725_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c130/11696471/5a164241d6c1/41598_2024_83725_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c130/11696471/5b858c49df1b/41598_2024_83725_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c130/11696471/39e2ea144cb4/41598_2024_83725_Fig8_HTML.jpg

相似文献

1
Crowd counting in domain generalization based on multi-scale attention and hierarchy level enhancement.基于多尺度注意力和层次级别增强的域泛化中的人群计数
Sci Rep. 2025 Jan 2;15(1):155. doi: 10.1038/s41598-024-83725-5.
2
COMAL: compositional multi-scale feature enhanced learning for crowd counting.COMAL:用于人群计数的组合多尺度特征增强学习
Multimed Tools Appl. 2022;81(15):20541-20560. doi: 10.1007/s11042-022-12249-9. Epub 2022 Mar 11.
3
An Adaptive Multi-Scale Network Based on Depth Information for Crowd Counting.一种基于深度信息的自适应多尺度人群计数网络
Sensors (Basel). 2023 Sep 11;23(18):7805. doi: 10.3390/s23187805.
4
DCAM-NET:A novel domain generalization optic cup and optic disc segmentation pipeline with multi-region and multi-scale convolution attention mechanism.DCAM-NET:一种具有多区域和多尺度卷积注意力机制的新型领域泛化视杯和视盘分割管道。
Comput Biol Med. 2023 Sep;163:107076. doi: 10.1016/j.compbiomed.2023.107076. Epub 2023 May 28.
5
Feature diffusion reconstruction mechanism network for crop spike head detection.用于作物穗头检测的特征扩散重建机制网络。
Front Plant Sci. 2024 Oct 1;15:1459515. doi: 10.3389/fpls.2024.1459515. eCollection 2024.
6
Crowd Counting Based on Multiscale Spatial Guided Perception Aggregation Network.基于多尺度空间引导感知聚合网络的人群计数
IEEE Trans Neural Netw Learn Syst. 2024 Dec;35(12):17465-17478. doi: 10.1109/TNNLS.2023.3304348. Epub 2024 Dec 2.
7
HRANet: Hierarchical region-aware network for crowd counting.HRANet:用于人群计数的分层区域感知网络。
Appl Intell (Dordr). 2022;52(11):12191-12205. doi: 10.1007/s10489-021-03030-w. Epub 2022 Feb 2.
8
MRSNet: Multi-Resolution Scale Feature Fusion-Based Universal Density Counting Network.MRSNet:基于多分辨率尺度特征融合的通用密度计数网络。
Sensors (Basel). 2024 Sep 14;24(18):5974. doi: 10.3390/s24185974.
9
Optimizing CNN for pavement distress detection via edge-enhanced multi-scale feature fusion.通过边缘增强多尺度特征融合优化卷积神经网络用于路面病害检测
PLoS One. 2025 Apr 9;20(4):e0319299. doi: 10.1371/journal.pone.0319299. eCollection 2025.
10
HADF-Crowd: A Hierarchical Attention-Based Dense Feature Extraction Network for Single-Image Crowd Counting.基于分层注意力的密集特征提取网络用于单幅图像人群计数(HADF-Crowd)。
Sensors (Basel). 2021 May 17;21(10):3483. doi: 10.3390/s21103483.

本文引用的文献

1
SRCD: Semantic Reasoning With Compound Domains for Single-Domain Generalized Object Detection.
IEEE Trans Neural Netw Learn Syst. 2025 Jul;36(7):12497-12506. doi: 10.1109/TNNLS.2024.3480120.
2
One-Shot Any-Scene Crowd Counting With Local-to-Global Guidance.基于局部到全局引导的一次性任意场景人群计数
IEEE Trans Image Process. 2024;33:6622-6632. doi: 10.1109/TIP.2024.3420713. Epub 2024 Dec 3.
3
Learning Discriminative Features for Crowd Counting.
IEEE Trans Image Process. 2024;33:3749-3764. doi: 10.1109/TIP.2024.3408609. Epub 2024 Jun 13.
4
Hybrid Perturbation Strategy for Semi-Supervised Crowd Counting.用于半监督人群计数的混合扰动策略
IEEE Trans Image Process. 2024;33:1227-1240. doi: 10.1109/TIP.2024.3361730. Epub 2024 Feb 13.
5
Virtual Classification: Modulating Domain-Specific Knowledge for Multidomain Crowd Counting.虚拟分类:为多领域人群计数调制特定领域知识
IEEE Trans Neural Netw Learn Syst. 2025 Feb;36(2):2958-2972. doi: 10.1109/TNNLS.2024.3350363. Epub 2025 Feb 6.
6
CapMatch: Semi-Supervised Contrastive Transformer Capsule With Feature-Based Knowledge Distillation for Human Activity Recognition.CapMatch:用于人类活动识别的基于特征知识蒸馏的半监督对比变压器胶囊网络
IEEE Trans Neural Netw Learn Syst. 2025 Feb;36(2):2690-2704. doi: 10.1109/TNNLS.2023.3344294. Epub 2025 Feb 6.
7
Efficient Crowd Counting via Dual Knowledge Distillation.通过双重知识蒸馏实现高效人群计数
IEEE Trans Image Process. 2023 Dec 21;PP. doi: 10.1109/TIP.2023.3343609.
8
Confusion Region Mining for Crowd Counting.用于人群计数的混淆区域挖掘
IEEE Trans Neural Netw Learn Syst. 2024 Dec;35(12):18039-18051. doi: 10.1109/TNNLS.2023.3311020. Epub 2024 Dec 2.
9
Redesigning Multi-Scale Neural Network for Crowd Counting.重新设计用于人群计数的多尺度神经网络。
IEEE Trans Image Process. 2023;32:3664-3678. doi: 10.1109/TIP.2023.3289290. Epub 2023 Jul 4.
10
Deformable Density Estimation via Adaptive Representation.通过自适应表示进行可变形密度估计
IEEE Trans Image Process. 2023;32:1134-1144. doi: 10.1109/TIP.2023.3240839. Epub 2023 Feb 10.