• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于多尺度人脸检测的混合金字塔卷积网络

Hybrid Pyramid Convolutional Network for Multiscale Face Detection.

作者信息

Hou Shaoqi, Fang Dongdong, Pan Yixi, Li Ye, Yin Guangqiang

机构信息

School of Information and Communication Engineering, University of Electronic Science and Technology of China, Chengdu, China.

Glasgow College, University of Electronic Science and Technology of China, Chengdu, China.

出版信息

Comput Intell Neurosci. 2021 May 5;2021:9963322. doi: 10.1155/2021/9963322. eCollection 2021.

DOI:10.1155/2021/9963322
PMID:34035802
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8116149/
Abstract

Face detection remains a challenging problem due to the high variability of scale and occlusion despite the strong representational power of deep convolutional neural networks and their implicit robustness. To handle hard face detection under extreme circumstances especially tiny faces detection, in this paper, we proposed a multiscale Hybrid Pyramid Convolutional Network (HPCNet), which is a one-stage fully convolutional network. Our HPCNet consists of three newly presented modules: firstly, we designed the Hybrid Dilated Convolution (HDC) module to replace the fully connected layers in VGG16, which enlarges receptive field and reduces its loss of local information; secondly, we constructed the Hybrid Feature Pyramid (HFP) to combine semantic information from higher layers together with details from lower layers; and thirdly, to deal with the problem of occlusion and blurring effectively, we introduced Context Information Extractor (CIE) in HPCNet. In addition, we presented an improved Online Hard Example Mining (OHEM) strategy, which can enhance the average precision of face detection by balancing the number of positive and negative samples. Our method has achieved an accuracy of 0.933, 0.924, and 0.848 on the Easy, Medium, and Hard subset of WIDER FACE, respectively, which surpasses most of the advanced algorithms.

摘要

尽管深度卷积神经网络具有强大的表征能力和内在的鲁棒性,但由于尺度和遮挡的高度变化性,人脸检测仍然是一个具有挑战性的问题。为了处理极端情况下的硬人脸检测,特别是微小人脸检测,在本文中,我们提出了一种多尺度混合金字塔卷积网络(HPCNet),它是一种单阶段全卷积网络。我们的HPCNet由三个新提出的模块组成:首先,我们设计了混合扩张卷积(HDC)模块来取代VGG16中的全连接层,这扩大了感受野并减少了局部信息的损失;其次,我们构建了混合特征金字塔(HFP),将来自较高层的语义信息与来自较低层的细节结合起来;第三,为了有效处理遮挡和模糊问题,我们在HPCNet中引入了上下文信息提取器(CIE)。此外,我们提出了一种改进的在线难例挖掘(OHEM)策略,该策略可以通过平衡正负样本数量来提高人脸检测的平均精度。我们的方法在WIDER FACE的Easy、Medium和Hard子集上分别达到了0.933、0.924和0.848的准确率,超过了大多数先进算法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e84b/8116149/d51210fc2ffa/CIN2021-9963322.015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e84b/8116149/60465e9e592a/CIN2021-9963322.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e84b/8116149/5adaa15c603d/CIN2021-9963322.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e84b/8116149/983818f56e31/CIN2021-9963322.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e84b/8116149/c460d04a963a/CIN2021-9963322.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e84b/8116149/09c9f7b4e44c/CIN2021-9963322.005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e84b/8116149/e11af3e20297/CIN2021-9963322.006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e84b/8116149/90ff442fae4c/CIN2021-9963322.007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e84b/8116149/6986af355db0/CIN2021-9963322.008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e84b/8116149/ca3fb24791f4/CIN2021-9963322.009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e84b/8116149/a17296e2ab6a/CIN2021-9963322.010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e84b/8116149/c06efec82a6f/CIN2021-9963322.011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e84b/8116149/72445ef5687a/CIN2021-9963322.012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e84b/8116149/03028dea5263/CIN2021-9963322.013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e84b/8116149/2f4adbf060fc/CIN2021-9963322.014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e84b/8116149/d51210fc2ffa/CIN2021-9963322.015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e84b/8116149/60465e9e592a/CIN2021-9963322.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e84b/8116149/5adaa15c603d/CIN2021-9963322.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e84b/8116149/983818f56e31/CIN2021-9963322.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e84b/8116149/c460d04a963a/CIN2021-9963322.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e84b/8116149/09c9f7b4e44c/CIN2021-9963322.005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e84b/8116149/e11af3e20297/CIN2021-9963322.006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e84b/8116149/90ff442fae4c/CIN2021-9963322.007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e84b/8116149/6986af355db0/CIN2021-9963322.008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e84b/8116149/ca3fb24791f4/CIN2021-9963322.009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e84b/8116149/a17296e2ab6a/CIN2021-9963322.010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e84b/8116149/c06efec82a6f/CIN2021-9963322.011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e84b/8116149/72445ef5687a/CIN2021-9963322.012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e84b/8116149/03028dea5263/CIN2021-9963322.013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e84b/8116149/2f4adbf060fc/CIN2021-9963322.014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e84b/8116149/d51210fc2ffa/CIN2021-9963322.015.jpg

相似文献

1
Hybrid Pyramid Convolutional Network for Multiscale Face Detection.用于多尺度人脸检测的混合金字塔卷积网络
Comput Intell Neurosci. 2021 May 5;2021:9963322. doi: 10.1155/2021/9963322. eCollection 2021.
2
DMA-HPCNet: Dual Multi-Level Attention Hybrid Pyramid Convolution Neural Network for Alzheimer's Disease Classification.
IEEE Trans Neural Syst Rehabil Eng. 2024;32:1955-1964. doi: 10.1109/TNSRE.2024.3398640. Epub 2024 May 22.
3
MR-FPN: Multi-Level Residual Feature Pyramid Text Detection Network Based on Self-Attention Environment.MR-FPN:基于自注意力环境的多尺度残差特征金字塔文本检测网络。
Sensors (Basel). 2022 Apr 27;22(9):3337. doi: 10.3390/s22093337.
4
Lightweight YOLOv4 with Multiple Receptive Fields for Detection of Pulmonary Tuberculosis.基于多感受野的轻量化 YOLOv4 用于肺结核检测。
Comput Intell Neurosci. 2022 Mar 31;2022:9465646. doi: 10.1155/2022/9465646. eCollection 2022.
5
Industrial equipment detection algorithm under complex working conditions based on ROMS R-CNN.基于 ROMS R-CNN 的复杂工况下工业设备检测算法。
PLoS One. 2022 Apr 7;17(4):e0266444. doi: 10.1371/journal.pone.0266444. eCollection 2022.
6
Sonar Objective Detection Based on Dilated Separable Densely Connected CNNs and Quantum-Behaved PSO Algorithm.基于扩张可分离密集连接卷积神经网络和量子行为粒子群算法的声纳目标检测
Comput Intell Neurosci. 2021 Jan 18;2021:6235319. doi: 10.1155/2021/6235319. eCollection 2021.
7
Multilevel and Multiscale Feature Aggregation in Deep Networks for Facial Constitution Classification.深度学习网络中的多层次多尺度特征聚合用于面部特征分类。
Comput Math Methods Med. 2019 Dec 20;2019:1258782. doi: 10.1155/2019/1258782. eCollection 2019.
8
PyDiNet: Pyramid Dilated Network for medical image segmentation.PyDiNet:用于医学图像分割的金字塔扩张网络。
Neural Netw. 2021 Aug;140:274-281. doi: 10.1016/j.neunet.2021.03.023. Epub 2021 Mar 26.
9
Optimization Algorithm of Moving Object Detection Using Multiscale Pyramid Convolutional Neural Networks.基于多尺度金字塔卷积神经网络的运动目标检测优化算法。
Comput Intell Neurosci. 2023 Mar 10;2023:3320547. doi: 10.1155/2023/3320547. eCollection 2023.
10
SEFPN: Scale-Equalizing Feature Pyramid Network for Object Detection.SEFPN:用于目标检测的尺度均衡特征金字塔网络。
Sensors (Basel). 2021 Oct 27;21(21):7136. doi: 10.3390/s21217136.

本文引用的文献

1
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks.更快的 R-CNN:基于区域建议网络的实时目标检测。
IEEE Trans Pattern Anal Mach Intell. 2017 Jun;39(6):1137-1149. doi: 10.1109/TPAMI.2016.2577031. Epub 2016 Jun 6.
2
Object detection with discriminatively trained part-based models.基于判别式训练的部件模型的目标检测。
IEEE Trans Pattern Anal Mach Intell. 2010 Sep;32(9):1627-45. doi: 10.1109/TPAMI.2009.167.