具有多分辨率协作表示的级联并行人群计数网络

Cascaded parallel crowd counting network with multi-resolution collaborative representation.

作者信息

Lyu Lei, Han Run, Chen Ziming

机构信息

School of Information Science and Engineering, Shandong Normal University, Jinan, 250358 China.

Shandong Provincial Key Laboratory for Distributed Computer Software Novel Technology, Jinan, 250358 China.

出版信息

Appl Intell (Dordr). 2023;53(3):3002-3016. doi: 10.1007/s10489-022-03639-5. Epub 2022 May 19.

DOI:10.1007/s10489-022-03639-5

PMID:35607431

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9117858/

Abstract

Accurately estimating the size and density distribution of a crowd from images is of great importance to public safety and crowd management during the COVID-19 pandemic, but it is very challenging as it is affected by many complex factors, including perspective distortion and background noise information. In this paper, we propose a novel multi-resolution collaborative representation framework called the cascaded parallel network (CP-Net), consisting of three parallel scale-specific branches connected in a cascading mode. In the framework, the three cascaded multi-resolution branches efficiently capture multi-scale features through their specific receptive fields. Additionally, multi-level feature fusion and information filtering are performed continuously on each branch to resist noise interference and perspective distortion. Moreover, we design an information exchange module across independent branches to refine the features extracted by each specific branch and deal with perspective distortion by using complementary information of multiple resolutions. To further improve the robustness of the network to scale variance and generate high-quality density maps, we construct a multi-receptive field fusion module to aggregate multi-scale features more comprehensively. The performance of our proposed CP-Net is verified on the challenging counting datasets (UCF_CC_50, UCF-QNRF, Shanghai Tech A&B, and WorldExpo'10), and the experimental results demonstrate the superiority of the proposed method.

摘要

在新冠疫情期间，从图像中准确估计人群的规模和密度分布对于公共安全和人群管理至关重要，但由于受到包括透视畸变和背景噪声信息在内的许多复杂因素影响，这极具挑战性。在本文中，我们提出了一种名为级联并行网络（CP-Net）的新型多分辨率协作表示框架，它由以级联模式连接的三个并行的特定尺度分支组成。在该框架中，三个级联的多分辨率分支通过其特定的感受野有效地捕捉多尺度特征。此外，在每个分支上持续进行多级特征融合和信息过滤，以抵抗噪声干扰和透视畸变。而且，我们设计了一个跨独立分支的信息交换模块，以细化每个特定分支提取的特征，并利用多分辨率的互补信息处理透视畸变。为了进一步提高网络对尺度变化的鲁棒性并生成高质量的密度图，我们构建了一个多感受野融合模块，以更全面地聚合多尺度特征。我们提出的CP-Net的性能在具有挑战性的计数数据集（UCF_CC_50、UCF-QNRF、上海科技大学A&B和世博10）上得到了验证，实验结果证明了所提方法的优越性。