Vignesh Baalaji S, Sandhya S, Sajidha S A, Nisha V M, Vimalapriya M D, Tyagi Amit Kumar
Present Address: School of Compter Science and Engineering, Vellore Institute of Technology, Chennai, India.
Post Graduate Department of Computer Science and Technology, Women's Christian College, Chennai, India.
J Ambient Intell Humaniz Comput. 2023 Jun 7:1-11. doi: 10.1007/s12652-023-04624-7.
The COVID-19 pandemic poses a global health challenge. The World Health Organization states that face masks are proven to be effective, especially in public areas. Real-time monitoring of face masks is challenging and exhaustive for humans. To reduce human effort and to provide an enforcement mechanism, an autonomous system has been proposed to detect non-masked people and retrieve their identity using computer vision. The proposed method introduces a novel and efficient method that involves fine-tuning the pre-trained ResNet-50 model with a new head layer for classification between masked and non-masked people. The classifier is trained using adaptive momentum optimization algorithm with decaying learning rate and binary cross-entropy loss. Data augmentation and dropout regularization are employed to achieve best convergence. During real-time application of our classifier on videos, a Caffe face detector model based on Single Shot MultiBox Detector is used to extract the face regions of interest from each frame, on which the trained classifier is applied for detecting the non-masked people. The faces of these people are then captured, which is passed on to a deep siamese neural network, based on VGG-Face model for face matching. The captured faces are compared with the reference images from the database, by extracting the features and calculating cosine distance. If the faces match, the details of that person are retrieved from the database and displayed on the web application. The proposed method has secured best results where the trained classifier has achieved 99.74% accuracy, and the identity retrieval model achieved 98.24% accuracy.
新冠疫情带来了全球健康挑战。世界卫生组织指出,口罩已被证明是有效的,尤其是在公共场所。对口罩佩戴情况进行实时监测对人类来说既具有挑战性又很繁琐。为了减轻人力负担并提供一种执行机制,人们提出了一种自主系统,利用计算机视觉来检测未戴口罩的人员并获取其身份信息。所提出的方法引入了一种新颖且高效的方法,即使用新的头部层对预训练的ResNet - 50模型进行微调,以对戴口罩和未戴口罩的人员进行分类。分类器使用具有衰减学习率的自适应动量优化算法和二元交叉熵损失进行训练。采用数据增强和随机失活正则化来实现最佳收敛。在将我们的分类器实时应用于视频时,基于单阶段多框检测器的Caffe人脸检测器模型用于从每一帧中提取感兴趣的人脸区域,在这些区域上应用训练好的分类器来检测未戴口罩的人员。然后捕捉这些人员的面部图像,并将其传递给基于VGG - Face模型的深度暹罗神经网络进行人脸匹配。通过提取特征并计算余弦距离,将捕捉到的人脸与数据库中的参考图像进行比较。如果人脸匹配,则从数据库中检索该人员的详细信息并显示在网络应用程序上。所提出的方法取得了最佳结果,训练好的分类器准确率达到了99.74%,身份检索模型准确率达到了98.24%。