基于偏移解耦的可变形卷积的高效人群计数。

Offset-decoupled deformable convolution for efficient crowd counting.

机构信息

Department of Educational Technology, Ocean University of China, Qingdao, 266100, China.

Department of Electrical and Communication Engineering, University of Chinese Academy of Sciences, Beijing, 100049, China.

出版信息

Sci Rep. 2022 Jul 18;12(1):12229. doi: 10.1038/s41598-022-16415-9.

DOI:10.1038/s41598-022-16415-9

PMID:35851829

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9293988/

Abstract

Crowd counting is considered a challenging issue in computer vision. One of the most critical challenges in crowd counting is considering the impact of scale variations. Compared with other methods, better performance is achieved with CNN-based methods. However, given the limit of fixed geometric structures, the head-scale features are not completely obtained. Deformable convolution with additional offsets is widely used in the fields of image classification and pattern recognition, as it can successfully exploit the potential of spatial information. However, owing to the randomly generated parameters of offsets in network initialization, the sampling points of the deformable convolution are disorderly stacked, weakening the effectiveness of feature extraction. To handle the invalid learning of offsets and the inefficient utilization of deformable convolution, an offset-decoupled deformable convolution (ODConv) is proposed in this paper. It can completely obtain information within the effective region of sampling points, leading to better performance. In extensive experiments, average MAE of 62.3, 8.3, 91.9, and 159.3 are achieved using our method on the ShanghaiTech A, ShanghaiTech B, UCF-QNRF, and UCF_CC_50 datasets, respectively, outperforming the state-of-the-art methods and validating the effectiveness of the proposed ODConv.

摘要

人群计数被认为是计算机视觉中的一个具有挑战性的问题。人群计数中最关键的挑战之一是考虑尺度变化的影响。与其他方法相比，基于 CNN 的方法可以获得更好的性能。然而，由于固定几何结构的限制，头部尺度特征并没有被完全获取。带有附加偏移量的可变形卷积在图像分类和模式识别等领域得到了广泛应用，因为它可以成功地利用空间信息的潜力。然而，由于网络初始化中偏移量的随机生成参数，可变形卷积的采样点是无序堆叠的，这削弱了特征提取的有效性。为了解决偏移量的无效学习和可变形卷积的低效利用问题，本文提出了一种偏移量解耦可变形卷积（ODConv）。它可以完全获取采样点有效区域内的信息，从而获得更好的性能。在广泛的实验中，我们的方法在 ShanghaiTech A、ShanghaiTech B、UCF-QNRF 和 UCF_CC_50 数据集上的平均 MAE 分别达到了 62.3、8.3、91.9 和 159.3，优于最先进的方法，验证了所提出的 ODConv 的有效性。

Suppr 超能文献

文献检索

文件翻译

深度研究

Suppr 超能文献

文献检索

文件翻译

深度研究

基于偏移解耦的可变形卷积的高效人群计数。

Offset-decoupled deformable convolution for efficient crowd counting.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

基于偏移解耦的可变形卷积的高效人群计数。

Offset-decoupled deformable convolution for efficient crowd counting.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献