Department of Computer Engineering, Gachon University, Sujeong-gu, Seongnam-si 461-701, Gyeonggi-do, Republic of Korea.
Department of Artificial Intelligence, Tashkent State University of Economics, Tashkent 100066, Uzbekistan.
Sensors (Basel). 2023 Jan 2;23(1):502. doi: 10.3390/s23010502.
Most facial recognition and face analysis systems start with facial detection. Early techniques, such as Haar cascades and histograms of directed gradients, mainly rely on features that had been manually developed from particular images. However, these techniques are unable to correctly synthesize images taken in untamed situations. However, deep learning's quick development in computer vision has also sped up the development of a number of deep learning-based face detection frameworks, many of which have significantly improved accuracy in recent years. When detecting faces in face detection software, the difficulty of detecting small, scale, position, occlusion, blurring, and partially occluded faces in uncontrolled conditions is one of the problems of face identification that has been explored for many years but has not yet been entirely resolved. In this paper, we propose Retina net baseline, a single-stage face detector, to handle the challenging face detection problem. We made network improvements that boosted detection speed and accuracy. In Experiments, we used two popular datasets, such as WIDER FACE and FDDB. Specifically, on the WIDER FACE benchmark, our proposed method achieves AP of 41.0 at speed of 11.8 FPS with a single-scale inference strategy and AP of 44.2 with multi-scale inference strategy, which are results among one-stage detectors. Then, we trained our model during the implementation using the PyTorch framework, which provided an accuracy of 95.6% for the faces, which are successfully detected. Visible experimental results show that our proposed model outperforms seamless detection and recognition results achieved using performance evaluation matrices.
大多数人脸识别和面部分析系统都从面部检测开始。早期的技术,如 Haar 级联和定向梯度直方图,主要依赖于从特定图像手动开发的特征。然而,这些技术无法正确合成在自然环境中拍摄的图像。然而,计算机视觉中深度学习的快速发展也加速了许多基于深度学习的面部检测框架的发展,近年来,其中许多框架的准确性都有了显著提高。在面部检测软件中进行人脸检测时,在不受控制的条件下检测小尺寸、位置、遮挡、模糊和部分遮挡人脸的难度是人脸识别中多年来一直在探索但尚未完全解决的问题之一。在本文中,我们提出了 Retina net 基线,这是一种单级人脸检测器,用于处理具有挑战性的人脸检测问题。我们对网络进行了改进,提高了检测速度和准确性。在实验中,我们使用了两个流行的数据集,如 WIDER FACE 和 FDDB。具体来说,在 WIDER FACE 基准上,我们提出的方法在单尺度推理策略下以 11.8 FPS 的速度实现了 41.0 的 AP,在多尺度推理策略下实现了 44.2 的 AP,这是单级检测器中的结果。然后,我们在实施过程中使用 PyTorch 框架训练我们的模型,该模型对成功检测到的人脸的准确率为 95.6%。可见的实验结果表明,我们提出的模型在性能评估矩阵方面的无缝检测和识别结果表现优于其他模型。