• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于人群计数的轻量级上下文融合卷积神经网络方案的设计与分析

Design and Analysis of a Lightweight Context Fusion CNN Scheme for Crowd Counting.

作者信息

Yu Yang, Huang Jifeng, Du Wen, Xiong Naixue

机构信息

College of Information, Mechanical and Electrical Engineering, Shanghai Normal University, Shanghai 201418, China.

DS Information Technology Co., Ltd., Shanghai 200032, China.

出版信息

Sensors (Basel). 2019 Apr 29;19(9):2013. doi: 10.3390/s19092013.

DOI:10.3390/s19092013
PMID:31035697
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6539683/
Abstract

Crowd counting, which is widely used in disaster management, traffic monitoring, and other fields of urban security, is a challenging task that is attracting increasing interest from researchers. For better accuracy, most methods have attempted to handle the scale variation explicitly. which results in huge scale changes of the object size. However, earlier methods based on convolutional neural networks (CNN) have focused primarily on improving accuracy while ignoring the complexity of the model. This paper proposes a novel method based on a lightweight CNN-based network for estimating crowd counting and generating density maps under resource constraints. The network is composed of three components: a basic feature extractor (BFE), a stacked à trous convolution module (SACM), and a context fusion module (CFM). The BFE encodes basic feature information with reduced spatial resolution for further refining. Various pieces of contextual information are generated through a short pipeline in SACM. To generate a context fusion density map, CFM distills feature maps from the above components. The whole network is trained in an end-to-end fashion and uses a compression factor to restrict its size. Experiments on three highly-challenging datasets demonstrate that the proposed method delivers attractive performance.

摘要

人群计数在灾害管理、交通监控和城市安全等其他领域有着广泛应用,是一项具有挑战性的任务,正吸引着研究人员越来越多的关注。为了获得更高的准确性,大多数方法都试图显式地处理尺度变化,这导致了目标尺寸的巨大尺度变化。然而,早期基于卷积神经网络(CNN)的方法主要集中在提高准确性上,而忽略了模型的复杂性。本文提出了一种基于轻量级CNN网络的新方法,用于在资源受限的情况下估计人群数量并生成密度图。该网络由三个组件组成:一个基本特征提取器(BFE)、一个堆叠空洞卷积模块(SACM)和一个上下文融合模块(CFM)。BFE以降低的空间分辨率对基本特征信息进行编码,以便进一步细化。通过SACM中的一个短管道生成各种上下文信息。为了生成上下文融合密度图,CFM从上述组件中提取特征图。整个网络以端到端的方式进行训练,并使用一个压缩因子来限制其大小。在三个极具挑战性的数据集上进行的实验表明,所提出的方法具有出色的性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a47/6539683/400f69314685/sensors-19-02013-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a47/6539683/0ae3c5f7d49f/sensors-19-02013-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a47/6539683/516e8d94cac0/sensors-19-02013-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a47/6539683/ca8172dfb5cc/sensors-19-02013-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a47/6539683/0e8032355c5e/sensors-19-02013-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a47/6539683/fa398ce327b7/sensors-19-02013-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a47/6539683/bbad687ea866/sensors-19-02013-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a47/6539683/400f69314685/sensors-19-02013-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a47/6539683/0ae3c5f7d49f/sensors-19-02013-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a47/6539683/516e8d94cac0/sensors-19-02013-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a47/6539683/ca8172dfb5cc/sensors-19-02013-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a47/6539683/0e8032355c5e/sensors-19-02013-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a47/6539683/fa398ce327b7/sensors-19-02013-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a47/6539683/bbad687ea866/sensors-19-02013-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a47/6539683/400f69314685/sensors-19-02013-g007.jpg

相似文献

1
Design and Analysis of a Lightweight Context Fusion CNN Scheme for Crowd Counting.用于人群计数的轻量级上下文融合卷积神经网络方案的设计与分析
Sensors (Basel). 2019 Apr 29;19(9):2013. doi: 10.3390/s19092013.
2
An Adaptive Multi-Scale Network Based on Depth Information for Crowd Counting.一种基于深度信息的自适应多尺度人群计数网络
Sensors (Basel). 2023 Sep 11;23(18):7805. doi: 10.3390/s23187805.
3
An effective modular approach for crowd counting in an image using convolutional neural networks.基于卷积神经网络的图像人群计数的有效模块化方法。
Sci Rep. 2022 Apr 6;12(1):5795. doi: 10.1038/s41598-022-09685-w.
4
HADF-Crowd: A Hierarchical Attention-Based Dense Feature Extraction Network for Single-Image Crowd Counting.基于分层注意力的密集特征提取网络用于单幅图像人群计数(HADF-Crowd)。
Sensors (Basel). 2021 May 17;21(10):3483. doi: 10.3390/s21103483.
5
Redesigned Skip-Network for Crowd Counting with Dilated Convolution and Backward Connection.用于人群计数的重新设计的跳跃网络:带空洞卷积和反向连接
J Imaging. 2020 May 2;6(5):28. doi: 10.3390/jimaging6050028.
6
Context-Aware Multi-Scale Aggregation Network for Congested Crowd Counting.上下文感知多尺度聚合网络用于拥挤人群计数。
Sensors (Basel). 2022 Apr 22;22(9):3233. doi: 10.3390/s22093233.
7
Edge Preserving and Multi-Scale Contextual Neural Network for Salient Object Detection.边缘保持和多尺度上下文神经网络的显著目标检测。
IEEE Trans Image Process. 2018;27(1):121-134. doi: 10.1109/TIP.2017.2756825.
8
Mask-Refined R-CNN: A Network for Refining Object Details in Instance Segmentation.Mask-Refined R-CNN:用于实例分割中细化对象细节的网络。
Sensors (Basel). 2020 Feb 13;20(4):1010. doi: 10.3390/s20041010.
9
Smart Camera Aware Crowd Counting via Multiple Task Fractional Stride Deep Learning.基于多任务分数步长深度学习的智能相机人群计数。
Sensors (Basel). 2019 Mar 18;19(6):1346. doi: 10.3390/s19061346.
10
COMAL: compositional multi-scale feature enhanced learning for crowd counting.COMAL:用于人群计数的组合多尺度特征增强学习
Multimed Tools Appl. 2022;81(15):20541-20560. doi: 10.1007/s11042-022-12249-9. Epub 2022 Mar 11.

引用本文的文献

1
Context-Aware Multi-Scale Aggregation Network for Congested Crowd Counting.上下文感知多尺度聚合网络用于拥挤人群计数。
Sensors (Basel). 2022 Apr 22;22(9):3233. doi: 10.3390/s22093233.
2
Congested Crowd Counting via Adaptive Multi-Scale Context Learning.基于自适应多尺度上下文学习的拥挤人群计数。
Sensors (Basel). 2021 May 29;21(11):3777. doi: 10.3390/s21113777.
3
Counting Crowds with Perspective Distortion Correction via Adaptive Learning.利用自适应学习进行透视失真校正的人群计数。

本文引用的文献

1
Body Structure Aware Deep Crowd Counting.人体结构感知的深度学习人群计数。
IEEE Trans Image Process. 2018 Mar;27(3):1049-1059. doi: 10.1109/TIP.2017.2740160. Epub 2017 Aug 14.
2
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks.更快的 R-CNN:基于区域建议网络的实时目标检测。
IEEE Trans Pattern Anal Mach Intell. 2017 Jun;39(6):1137-1149. doi: 10.1109/TPAMI.2016.2577031. Epub 2016 Jun 6.
3
Visual Tracking: An Experimental Survey.视觉跟踪:实验综述。
Sensors (Basel). 2020 Jul 6;20(13):3781. doi: 10.3390/s20133781.
IEEE Trans Pattern Anal Mach Intell. 2014 Jul;36(7):1442-68. doi: 10.1109/TPAMI.2013.230.
4
Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition.空间金字塔池化在深度卷积网络中的视觉识别。
IEEE Trans Pattern Anal Mach Intell. 2015 Sep;37(9):1904-16. doi: 10.1109/TPAMI.2015.2389824.
5
Pedestrian detection: an evaluation of the state of the art.行人检测:现状评估。
IEEE Trans Pattern Anal Mach Intell. 2012 Apr;34(4):743-61. doi: 10.1109/TPAMI.2011.155.
6
Hough forests for object detection, tracking, and action recognition.用于目标检测、跟踪和动作识别的 Hough 森林。
IEEE Trans Pattern Anal Mach Intell. 2011 Nov;33(11):2188-202. doi: 10.1109/TPAMI.2011.70.