• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用深度学习对闭路电视监控录像中的感兴趣区域内的人员进行计数。

Counting people inside a region-of-interest in CCTV footage with deep learning.

作者信息

Pardamean Bens, Abid Faizal, Cenggoro Tjeng Wawan, Elwirehardja Gregorius Natanael, Muljo Hery Harjono

机构信息

Computer Science Department, BINUS Graduate Program - Master of Computer Science Program, Bina Nusantara University, Jakarta, Indonesia.

Bioinformatics and Data Science Research Center, Bina Nusantara University, Jakarta, Indonesia.

出版信息

PeerJ Comput Sci. 2022 Sep 22;8:e1067. doi: 10.7717/peerj-cs.1067. eCollection 2022.

DOI:10.7717/peerj-cs.1067
PMID:36262152
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9575879/
Abstract

In recent years, the performance of people-counting models has been dramatically increased that they can be implemented in practical cases. However, the current models can only count all of the people captured in the inputted closed circuit television (CCTV) footage. Oftentimes, we only want to count people in a specific Region-of-Interest (RoI) in the footage. Unfortunately, simple approaches such as covering the area outside of the RoI are not applicable without degrading the performance of the models. Therefore, we developed a novel learning strategy that enables a deep-learning-based people counting model to count people only in a certain RoI. In the proposed method, the people counting model has two heads that are attached on top of a crowd counting backbone network. These two heads respectively learn to count people inside the RoI and negate the people count outside the RoI. We named this proposed method Gap Regularizer and tested it on ResNet-50, ResNet-101, CSRNet, and SFCN. The experiment results showed that Gap Regularizer can reduce the mean absolute error (MAE), root mean square error (RMSE), and grid average mean error (GAME) of ResNet-50, which is the smallest CNN model, with the highest reduction of 45.2%, 41.25%, and 46.43%, respectively. On shallow models such as the CSRNet, the regularizer can also drastically increase the SSIM by up to 248.65% in addition to reducing the MAE, RMSE, and GAME. The Gap Regularizer can also improve the performance of SFCN which is a deep CNN model with back-end features by up to 17.22% and 10.54% compared to its standard version. Moreover, the impacts of the Gap Regularizer on these two models are also generally statistically significant (-value < 0.05) on the MOT17-09, MOT20-02, and RHC datasets. However, it has a limitation in which it is unable to make significant impacts on deep models without back-end features such as the ResNet-101.

摘要

近年来,人数统计模型的性能有了显著提升,使其能够应用于实际案例中。然而,当前的模型只能对输入的闭路电视(CCTV)画面中捕捉到的所有人进行计数。通常情况下,我们只想对画面中特定的感兴趣区域(RoI)内的人员进行计数。不幸的是,诸如覆盖RoI之外区域的简单方法在不降低模型性能的情况下并不适用。因此,我们开发了一种新颖的学习策略,使基于深度学习的人数统计模型能够仅对特定RoI内的人员进行计数。在所提出的方法中,人数统计模型有两个头部,连接在人群计数骨干网络之上。这两个头部分别学习对RoI内的人员进行计数,并对RoI外的人员计数进行否定。我们将此方法命名为间隙正则化器,并在ResNet-50、ResNet-101、CSRNet和SFCN上进行了测试。实验结果表明,间隙正则化器可以降低ResNet-50(最小的卷积神经网络模型)的平均绝对误差(MAE)、均方根误差(RMSE)和网格平均平均误差(GAME),降幅分别高达45.2%、41.25%和46.43%。在诸如CSRNet这样的浅层模型上,除了降低MAE、RMSE和GAME之外,该正则化器还能将结构相似性指数(SSIM)大幅提高高达248.65%。与标准版本相比,间隙正则化器还能将具有后端特征的深度卷积神经网络模型SFCN的性能提高高达17.22%和10.54%。此外,间隙正则化器对这两个模型的影响在MOT17-09、MOT20-02和RHC数据集上通常也具有统计学意义(p值<0.05)。然而,它有一个局限性,即对没有后端特征的深度模型(如ResNet-101)无法产生显著影响。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2db7/9575879/cf580322c098/peerj-cs-08-1067-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2db7/9575879/bd584b7b808f/peerj-cs-08-1067-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2db7/9575879/dd8693aba8f7/peerj-cs-08-1067-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2db7/9575879/6df7b1379141/peerj-cs-08-1067-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2db7/9575879/f785ad91a4df/peerj-cs-08-1067-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2db7/9575879/ef9bfb771fd6/peerj-cs-08-1067-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2db7/9575879/0b637c052d89/peerj-cs-08-1067-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2db7/9575879/acc796608159/peerj-cs-08-1067-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2db7/9575879/39b13aa5cc34/peerj-cs-08-1067-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2db7/9575879/6b19ca893443/peerj-cs-08-1067-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2db7/9575879/63665f812ffc/peerj-cs-08-1067-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2db7/9575879/832b942324ca/peerj-cs-08-1067-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2db7/9575879/cf580322c098/peerj-cs-08-1067-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2db7/9575879/bd584b7b808f/peerj-cs-08-1067-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2db7/9575879/dd8693aba8f7/peerj-cs-08-1067-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2db7/9575879/6df7b1379141/peerj-cs-08-1067-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2db7/9575879/f785ad91a4df/peerj-cs-08-1067-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2db7/9575879/ef9bfb771fd6/peerj-cs-08-1067-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2db7/9575879/0b637c052d89/peerj-cs-08-1067-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2db7/9575879/acc796608159/peerj-cs-08-1067-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2db7/9575879/39b13aa5cc34/peerj-cs-08-1067-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2db7/9575879/6b19ca893443/peerj-cs-08-1067-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2db7/9575879/63665f812ffc/peerj-cs-08-1067-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2db7/9575879/832b942324ca/peerj-cs-08-1067-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2db7/9575879/cf580322c098/peerj-cs-08-1067-g012.jpg

相似文献

1
Counting people inside a region-of-interest in CCTV footage with deep learning.利用深度学习对闭路电视监控录像中的感兴趣区域内的人员进行计数。
PeerJ Comput Sci. 2022 Sep 22;8:e1067. doi: 10.7717/peerj-cs.1067. eCollection 2022.
2
Counting manatee aggregations using deep neural networks and Anisotropic Gaussian Kernel.使用深度神经网络和各向异性高斯核来计数海牛聚集数量。
Sci Rep. 2023 Nov 13;13(1):19793. doi: 10.1038/s41598-023-45507-3.
3
ACF: An Armed CCTV Footage Dataset for Enhancing Weapon Detection.ACF:用于增强武器检测的武装 CCTV 视频数据集。
Sensors (Basel). 2022 Sep 21;22(19):7158. doi: 10.3390/s22197158.
4
Deep-learning-based direct inversion for material decomposition.基于深度学习的材料分解直接反演
Med Phys. 2020 Dec;47(12):6294-6309. doi: 10.1002/mp.14523. Epub 2020 Oct 30.
5
Smart Camera Aware Crowd Counting via Multiple Task Fractional Stride Deep Learning.基于多任务分数步长深度学习的智能相机人群计数。
Sensors (Basel). 2019 Mar 18;19(6):1346. doi: 10.3390/s19061346.
6
Projection-domain scatter correction for cone beam computed tomography using a residual convolutional neural network.基于残差卷积神经网络的锥形束 CT 投影域散射校正。
Med Phys. 2019 Jul;46(7):3142-3155. doi: 10.1002/mp.13583. Epub 2019 Jun 5.
7
A transfer learning-based CNN and LSTM hybrid deep learning model to classify motor imagery EEG signals.一种基于迁移学习的卷积神经网络和长短期记忆网络混合深度学习模型,用于对运动想象脑电信号进行分类。
Comput Biol Med. 2022 Apr;143:105288. doi: 10.1016/j.compbiomed.2022.105288. Epub 2022 Feb 10.
8
Water quality assessment of a river using deep learning Bi-LSTM methodology: forecasting and validation.基于深度学习 Bi-LSTM 方法的河流水质评估:预测与验证。
Environ Sci Pollut Res Int. 2022 Feb;29(9):12875-12889. doi: 10.1007/s11356-021-13875-w. Epub 2021 May 14.
9
Deep convolutional neural network and IoT technology for healthcare.用于医疗保健的深度卷积神经网络和物联网技术。
Digit Health. 2024 Jan 17;10:20552076231220123. doi: 10.1177/20552076231220123. eCollection 2024 Jan-Dec.
10
Crowd density estimation using deep learning for Hajj pilgrimage video analytics.基于深度学习的朝觐视频分析中的人群密度估计。
F1000Res. 2021 Nov 24;10:1190. doi: 10.12688/f1000research.73156.2. eCollection 2021.

引用本文的文献

1
Comparative analysis of deep learning models for detecting face mask.用于检测口罩的深度学习模型的比较分析。
Procedia Comput Sci. 2023;216:48-56. doi: 10.1016/j.procs.2022.12.110. Epub 2023 Jan 10.

本文引用的文献

1
Focal Loss for Dense Object Detection.用于密集目标检测的焦散损失
IEEE Trans Pattern Anal Mach Intell. 2020 Feb;42(2):318-327. doi: 10.1109/TPAMI.2018.2858826. Epub 2018 Jul 23.
2
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks.更快的 R-CNN:基于区域建议网络的实时目标检测。
IEEE Trans Pattern Anal Mach Intell. 2017 Jun;39(6):1137-1149. doi: 10.1109/TPAMI.2016.2577031. Epub 2016 Jun 6.
3
Deep learning.深度学习。
Nature. 2015 May 28;521(7553):436-44. doi: 10.1038/nature14539.
4
The influence of craniofacial to standing height proportion on perceived attractiveness.颅面与身高比例对感知吸引力的影响。
Int J Oral Maxillofac Surg. 2008 Oct;37(10):877-85. doi: 10.1016/j.ijom.2008.07.022. Epub 2008 Sep 7.
5
Image quality assessment: from error visibility to structural similarity.图像质量评估:从误差可见性到结构相似性。
IEEE Trans Image Process. 2004 Apr;13(4):600-12. doi: 10.1109/tip.2003.819861.