• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于鲁棒人脸检测的宽高比匹配

Wide aspect ratio matching for robust face detection.

作者信息

Luo Shi, Li Xiongfei, Zhang Xiaoli

机构信息

Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun, 130012 China.

College of Computer Science and Technology, Jilin University, Changchun, 130012 China.

出版信息

Multimed Tools Appl. 2023;82(7):10535-10552. doi: 10.1007/s11042-022-13667-5. Epub 2022 Sep 6.

DOI:10.1007/s11042-022-13667-5
PMID:36090154
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9444702/
Abstract

Recently, anchor-based methods have achieved great progress in face detection. They adopt standard anchor matching strategy to sample positive anchors according to predefined IoU threshold. However, the max IoUs of extreme aspect ratio faces are still lower than fixed positive threshold, leading to the sampling failure from these faces. To construct a more robust detection model, more positive anchors from extreme aspect ratio faces need to be sampled and participate in the training phase. The goal of the present research is to improve the detection performance by reasonably extending sampling range of face aspect ratio. In this paper, we firstly explore the factors that affect the max IoU of each face in theory. Then, anchor matching simulation is performed to evaluate the sampling range of face aspect ratio. Finally, we propose a Wide Aspect Ratio Matching (WARM) strategy to collect more representative positive anchors from ground-truth faces across a wide range of aspect ratios. Besides, we present a novel feature enhancement module, named Receptive Field Diversity (RFD) module, to provide diverse receptive field corresponding to different aspect ratios. Extensive experiments have been conducted on popular benchmarks to show the effectiveness of our method, which can help detectors better capture extreme aspect ratio faces. Our method achieves promising APs on WIDER FACE validation dataset (easy: 0.965, medium: 0.955, hard: 0.904) and impressive generalization capability on FDDB dataset.

摘要

最近,基于锚点的方法在人脸检测方面取得了巨大进展。它们采用标准的锚点匹配策略,根据预定义的交并比(IoU)阈值对正锚点进行采样。然而,极端宽高比人脸的最大IoU仍然低于固定的正阈值,导致这些人脸采样失败。为了构建一个更强大的检测模型,需要从极端宽高比人脸中采样更多的正锚点并参与训练阶段。本研究的目标是通过合理扩展人脸宽高比的采样范围来提高检测性能。在本文中,我们首先从理论上探索影响每个人脸最大IoU的因素。然后,进行锚点匹配模拟以评估人脸宽高比的采样范围。最后,我们提出了一种宽高比匹配(WARM)策略,以从各种宽高比的真实人脸中收集更具代表性的正锚点。此外,我们提出了一种新颖的特征增强模块,称为感受野多样性(RFD)模块,以提供对应于不同宽高比的多样化感受野。我们在流行的基准上进行了大量实验,以证明我们方法的有效性,该方法可以帮助检测器更好地捕捉极端宽高比的人脸。我们的方法在WIDER FACE验证数据集上取得了可观的平均精度(AP)(简单:0.965,中等:0.955,困难:0.904),并且在FDDB数据集上具有令人印象深刻的泛化能力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f22/9444702/3cbfb4fca138/11042_2022_13667_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f22/9444702/295f23de492b/11042_2022_13667_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f22/9444702/0d3062bc7afa/11042_2022_13667_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f22/9444702/2e5d92e8335c/11042_2022_13667_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f22/9444702/7eb5e2e90e0c/11042_2022_13667_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f22/9444702/3c8db5b78915/11042_2022_13667_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f22/9444702/3cbfb4fca138/11042_2022_13667_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f22/9444702/295f23de492b/11042_2022_13667_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f22/9444702/0d3062bc7afa/11042_2022_13667_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f22/9444702/2e5d92e8335c/11042_2022_13667_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f22/9444702/7eb5e2e90e0c/11042_2022_13667_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f22/9444702/3c8db5b78915/11042_2022_13667_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f22/9444702/3cbfb4fca138/11042_2022_13667_Fig7_HTML.jpg

相似文献

1
Wide aspect ratio matching for robust face detection.用于鲁棒人脸检测的宽高比匹配
Multimed Tools Appl. 2023;82(7):10535-10552. doi: 10.1007/s11042-022-13667-5. Epub 2022 Sep 6.
2
RefineFace: Refinement Neural Network for High Performance Face Detection.RefineFace:用于高性能人脸检测的精化神经网络。
IEEE Trans Pattern Anal Mach Intell. 2021 Nov;43(11):4008-4020. doi: 10.1109/TPAMI.2020.2997456. Epub 2021 Oct 1.
3
Face Detection With Different Scales Based on Faster R-CNN.基于更快区域卷积神经网络的不同尺度人脸检测
IEEE Trans Cybern. 2019 Nov;49(11):4017-4028. doi: 10.1109/TCYB.2018.2859482. Epub 2018 Aug 14.
4
SCPM-Net: An anchor-free 3D lung nodule detection network using sphere representation and center points matching.SCPM-Net:一种使用球体表示和中心点匹配的无锚点3D肺结节检测网络。
Med Image Anal. 2022 Jan;75:102287. doi: 10.1016/j.media.2021.102287. Epub 2021 Oct 22.
5
The Lightweight Anchor Dynamic Assignment Algorithm for Object Detection.用于目标检测的轻量级锚点动态分配算法
Sensors (Basel). 2023 Jul 11;23(14):6306. doi: 10.3390/s23146306.
6
Group Sampling for Scale Invariant Face Detection.基于群组抽样的尺度不变人脸识别。
IEEE Trans Pattern Anal Mach Intell. 2022 Feb;44(2):985-1001. doi: 10.1109/TPAMI.2020.3012414. Epub 2022 Jan 7.
7
Learning to Match Anchors for Visual Object Detection.学习用于视觉目标检测的锚匹配。
IEEE Trans Pattern Anal Mach Intell. 2022 Jun;44(6):3096-3109. doi: 10.1109/TPAMI.2021.3050494. Epub 2022 May 5.
8
Dynamic Label Assignment for Object Detection by Combining Predicted IoUs and Anchor IoUs.通过结合预测交并比和锚框交并比进行目标检测的动态标签分配
J Imaging. 2022 Jul 11;8(7):193. doi: 10.3390/jimaging8070193.
9
Anchor Cascade for Efficient Face Detection.用于高效人脸检测的锚级联
IEEE Trans Image Process. 2018 Dec 14. doi: 10.1109/TIP.2018.2886790.
10
Hybrid Pyramid Convolutional Network for Multiscale Face Detection.用于多尺度人脸检测的混合金字塔卷积网络
Comput Intell Neurosci. 2021 May 5;2021:9963322. doi: 10.1155/2021/9963322. eCollection 2021.

引用本文的文献

1
An open dataset for intelligent recognition and classification of abnormal condition in longwall mining.综采工作面异常状态智能识别与分类的公开数据集
Sci Data. 2023 Jun 27;10(1):416. doi: 10.1038/s41597-023-02322-9.
2
A convolutional neural network for face mask detection in IoT-based smart healthcare systems.用于基于物联网的智能医疗系统中口罩检测的卷积神经网络。
Front Physiol. 2023 Mar 31;14:1143249. doi: 10.3389/fphys.2023.1143249. eCollection 2023.

本文引用的文献

1
Transfer learning for image classification using VGG19: Caltech-101 image data set.使用VGG19进行图像分类的迁移学习:加州理工学院101图像数据集。
J Ambient Intell Humaniz Comput. 2023;14(4):3609-3620. doi: 10.1007/s12652-021-03488-z. Epub 2021 Sep 17.
2
Face mask detection using YOLOv3 and faster R-CNN models: COVID-19 environment.使用YOLOv3和更快的R-CNN模型进行口罩检测:COVID-19环境
Multimed Tools Appl. 2021;80(13):19753-19768. doi: 10.1007/s11042-021-10711-8. Epub 2021 Mar 1.
3
RefineFace: Refinement Neural Network for High Performance Face Detection.
RefineFace:用于高性能人脸检测的精化神经网络。
IEEE Trans Pattern Anal Mach Intell. 2021 Nov;43(11):4008-4020. doi: 10.1109/TPAMI.2020.2997456. Epub 2021 Oct 1.
4
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks.更快的 R-CNN:基于区域建议网络的实时目标检测。
IEEE Trans Pattern Anal Mach Intell. 2017 Jun;39(6):1137-1149. doi: 10.1109/TPAMI.2016.2577031. Epub 2016 Jun 6.
5
A Fast and Accurate Unconstrained Face Detector.一种快速准确的无约束人脸检测器。
IEEE Trans Pattern Anal Mach Intell. 2016 Feb;38(2):211-23. doi: 10.1109/TPAMI.2015.2448075.
6
Fast Feature Pyramids for Object Detection.快速目标检测特征金字塔。
IEEE Trans Pattern Anal Mach Intell. 2014 Aug;36(8):1532-45. doi: 10.1109/TPAMI.2014.2300479.