• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于偏移解耦的可变形卷积的高效人群计数。

Offset-decoupled deformable convolution for efficient crowd counting.

机构信息

Department of Educational Technology, Ocean University of China, Qingdao, 266100, China.

Department of Electrical and Communication Engineering, University of Chinese Academy of Sciences, Beijing, 100049, China.

出版信息

Sci Rep. 2022 Jul 18;12(1):12229. doi: 10.1038/s41598-022-16415-9.

DOI:10.1038/s41598-022-16415-9
PMID:35851829
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9293988/
Abstract

Crowd counting is considered a challenging issue in computer vision. One of the most critical challenges in crowd counting is considering the impact of scale variations. Compared with other methods, better performance is achieved with CNN-based methods. However, given the limit of fixed geometric structures, the head-scale features are not completely obtained. Deformable convolution with additional offsets is widely used in the fields of image classification and pattern recognition, as it can successfully exploit the potential of spatial information. However, owing to the randomly generated parameters of offsets in network initialization, the sampling points of the deformable convolution are disorderly stacked, weakening the effectiveness of feature extraction. To handle the invalid learning of offsets and the inefficient utilization of deformable convolution, an offset-decoupled deformable convolution (ODConv) is proposed in this paper. It can completely obtain information within the effective region of sampling points, leading to better performance. In extensive experiments, average MAE of 62.3, 8.3, 91.9, and 159.3 are achieved using our method on the ShanghaiTech A, ShanghaiTech B, UCF-QNRF, and UCF_CC_50 datasets, respectively, outperforming the state-of-the-art methods and validating the effectiveness of the proposed ODConv.

摘要

人群计数被认为是计算机视觉中的一个具有挑战性的问题。人群计数中最关键的挑战之一是考虑尺度变化的影响。与其他方法相比,基于 CNN 的方法可以获得更好的性能。然而,由于固定几何结构的限制,头部尺度特征并没有被完全获取。带有附加偏移量的可变形卷积在图像分类和模式识别等领域得到了广泛应用,因为它可以成功地利用空间信息的潜力。然而,由于网络初始化中偏移量的随机生成参数,可变形卷积的采样点是无序堆叠的,这削弱了特征提取的有效性。为了解决偏移量的无效学习和可变形卷积的低效利用问题,本文提出了一种偏移量解耦可变形卷积(ODConv)。它可以完全获取采样点有效区域内的信息,从而获得更好的性能。在广泛的实验中,我们的方法在 ShanghaiTech A、ShanghaiTech B、UCF-QNRF 和 UCF_CC_50 数据集上的平均 MAE 分别达到了 62.3、8.3、91.9 和 159.3,优于最先进的方法,验证了所提出的 ODConv 的有效性。

相似文献

1
Offset-decoupled deformable convolution for efficient crowd counting.基于偏移解耦的可变形卷积的高效人群计数。
Sci Rep. 2022 Jul 18;12(1):12229. doi: 10.1038/s41598-022-16415-9.
2
An Adaptive Multi-Scale Network Based on Depth Information for Crowd Counting.一种基于深度信息的自适应多尺度人群计数网络
Sensors (Basel). 2023 Sep 11;23(18):7805. doi: 10.3390/s23187805.
3
Crowd Counting Based on Multiscale Spatial Guided Perception Aggregation Network.基于多尺度空间引导感知聚合网络的人群计数
IEEE Trans Neural Netw Learn Syst. 2024 Dec;35(12):17465-17478. doi: 10.1109/TNNLS.2023.3304348. Epub 2024 Dec 2.
4
HADF-Crowd: A Hierarchical Attention-Based Dense Feature Extraction Network for Single-Image Crowd Counting.基于分层注意力的密集特征提取网络用于单幅图像人群计数(HADF-Crowd)。
Sensors (Basel). 2021 May 17;21(10):3483. doi: 10.3390/s21103483.
5
COMAL: compositional multi-scale feature enhanced learning for crowd counting.COMAL:用于人群计数的组合多尺度特征增强学习
Multimed Tools Appl. 2022;81(15):20541-20560. doi: 10.1007/s11042-022-12249-9. Epub 2022 Mar 11.
6
SPCANet: congested crowd counting strip pooling combined attention network.SPCANet:拥堵人群计数的条带池化联合注意力网络。
PeerJ Comput Sci. 2024 Sep 18;10:e2273. doi: 10.7717/peerj-cs.2273. eCollection 2024.
7
Context-Aware Multi-Scale Aggregation Network for Congested Crowd Counting.上下文感知多尺度聚合网络用于拥挤人群计数。
Sensors (Basel). 2022 Apr 22;22(9):3233. doi: 10.3390/s22093233.
8
Embedding Perspective Analysis Into Multi-Column Convolutional Neural Network for Crowd Counting.将视角分析嵌入多列卷积神经网络进行人群计数
IEEE Trans Image Process. 2021;30:1395-1407. doi: 10.1109/TIP.2020.3043122. Epub 2020 Dec 29.
9
A Dilated Convolutional Neural Network for Cross-Layers of Contextual Information for Congested Crowd Counting.一种用于拥挤人群计数的跨层上下文信息的扩张卷积神经网络。
Sensors (Basel). 2024 Mar 12;24(6):1816. doi: 10.3390/s24061816.
10
PaDNet: Pan-Density Crowd Counting.PaDNet:全景密度人群计数
IEEE Trans Image Process. 2019 Nov 12. doi: 10.1109/TIP.2019.2952083.

引用本文的文献

1
Multiscale regional calibration network for crowd counting.用于人群计数的多尺度区域校准网络。
Sci Rep. 2025 Jan 22;15(1):2866. doi: 10.1038/s41598-025-86247-w.
2
Apple Grading Based on Multi-Dimensional View Processing and Deep Learning.基于多维度视图处理和深度学习的苹果分级
Foods. 2023 May 24;12(11):2117. doi: 10.3390/foods12112117.

本文引用的文献

1
MH-MetroNet-A Multi-Head CNN for Passenger-Crowd Attendance Estimation.MH-MetroNet——一种用于乘客人群出勤估计的多头卷积神经网络
J Imaging. 2020 Jul 2;6(7):62. doi: 10.3390/jimaging6070062.
2
NWPU-Crowd: A Large-Scale Benchmark for Crowd Counting and Localization.西工大人群计数数据集:大规模人群计数和定位基准数据集
IEEE Trans Pattern Anal Mach Intell. 2021 Jun;43(6):2141-2149. doi: 10.1109/TPAMI.2020.3013269. Epub 2021 May 11.