Al-Sarrar Haifa M, Al-Baity Heyam H
Information Technology Department, Collage of Computer and Information Sciences, King Saud University, Riyadh, Saudi Arabia.
PeerJ Comput Sci. 2023 Mar 27;9:e1265. doi: 10.7717/peerj-cs.1265. eCollection 2023.
Face and face mask detection are one of the most popular topics in computer vision literature. Face mask detection refers to the detection of people's faces in digital images and determining whether they are wearing a face mask. It can be of great benefit in different domains by ensuring public safety through the monitoring of face masks. Current research details a range of proposed face mask detection models, but most of them are mainly based on convolutional neural network models. These models have some drawbacks, such as their not being robust enough for low quality images and their being unable to capture long-range dependencies. These shortcomings can be overcome using transformer neural networks. Transformer is a type of deep learning that is based on the self-attention mechanism, and its strong capabilities have attracted the attention of computer vision researchers who apply this advanced neural network architecture to visual data as it can handle long-range dependencies between input sequence elements. In this study, we developed an automatic hybrid face mask detection model that is a combination of a transformer neural network and a convolutional neural network models which can be used to detect and determine whether people are wearing face masks. The proposed hybrid model's performance was evaluated and compared to other state-of-the-art face mask detection models, and the experimental results proved the proposed model's ability to achieve a highest average precision of 89.4% with an execution time of 2.8 s. Thus, the proposed hybrid model is fit for a practical, real-time trial and can contribute towards public healthcare in terms of infectious disease control.
面部和口罩检测是计算机视觉文献中最热门的话题之一。口罩检测是指在数字图像中检测人脸,并确定他们是否佩戴口罩。通过监测口罩来确保公共安全,这在不同领域可能会有很大益处。当前的研究详细介绍了一系列提出的口罩检测模型,但其中大多数主要基于卷积神经网络模型。这些模型存在一些缺点,例如对低质量图像的鲁棒性不足,以及无法捕捉长距离依赖关系。使用变压器神经网络可以克服这些缺点。变压器是一种基于自注意力机制的深度学习类型,其强大的能力吸引了计算机视觉研究人员的关注,他们将这种先进的神经网络架构应用于视觉数据,因为它可以处理输入序列元素之间的长距离依赖关系。在本研究中,我们开发了一种自动混合口罩检测模型,它是变压器神经网络和卷积神经网络模型的组合,可用于检测和确定人们是否佩戴口罩。对所提出的混合模型的性能进行了评估,并与其他先进的口罩检测模型进行了比较,实验结果证明了所提出的模型能够以2.8秒的执行时间实现最高89.4%的平均精度。因此,所提出的混合模型适合进行实际的实时试验,并在传染病控制方面有助于公共医疗保健。