复杂环境下的密集行人面部检测

Dense pedestrian face detection in complex environments.

作者信息

Gao Qiang, Ding Bingru, Jia Xu, Xie Yinghong, Han Xiaowei

机构信息

Institute of Innovation Science and Technology, Shenyang University, Shenyang, 110044, China.

School of Information Engineering, Shenyang University, Shenyang, 110044, China.

出版信息

Sci Rep. 2024 Sep 13;14(1):21460. doi: 10.1038/s41598-024-72523-8.

DOI:10.1038/s41598-024-72523-8

PMID:39271932

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11399418/

Abstract

To address the problem of dense crowd face detection in complex environments, this paper proposes a face detection model named Deep and Compact Face Detection (DCFD), which adopts an improved lightweight EfficientNetV2 network to replace the backbone network of RetinaFace. A large kernel attention mechanism is introduced to address the face detection task more accurately. The backbone network, an improved efficient channel attention (ECA) mechanism, is added to further improve the algorithm performance. The feature fusion module is an improved neural architecture search feature pyramid network (NAS-FPN) that significantly improves the face detection accuracy in different scenes. To balance the training process of positive and negative samples, we use the focus loss function to replace the traditional cross-entropy loss function. In different environments, the DCFD algorithm has shown efficient face detection performance. This algorithm provides not only a feasible and effective solution for solving the problem of face detection in dense groups but also an important basis for improving the accuracy of face detection models in practical applications.

摘要

为解决复杂环境下密集人群面部检测问题，本文提出一种名为深度紧凑面部检测（DCFD）的面部检测模型，该模型采用改进的轻量级EfficientNetV2网络替换RetinaFace的主干网络。引入大内核注意力机制以更准确地处理面部检测任务。在主干网络中添加了改进的高效通道注意力（ECA）机制，以进一步提高算法性能。特征融合模块是改进的神经架构搜索特征金字塔网络（NAS-FPN），可显著提高不同场景下的面部检测精度。为平衡正负样本的训练过程，我们使用焦点损失函数替代传统的交叉熵损失函数。在不同环境下，DCFD算法均展现出高效的面部检测性能。该算法不仅为解决密集人群中的面部检测问题提供了可行有效的解决方案，也为在实际应用中提高面部检测模型的精度提供了重要依据。