一种使用Transformer和卷积神经网络模型的新型混合口罩检测方法。

A novel hybrid face mask detection approach using Transformer and convolutional neural network models.

作者信息

Al-Sarrar Haifa M, Al-Baity Heyam H

机构信息

Information Technology Department, Collage of Computer and Information Sciences, King Saud University, Riyadh, Saudi Arabia.

出版信息

PeerJ Comput Sci. 2023 Mar 27;9:e1265. doi: 10.7717/peerj-cs.1265. eCollection 2023.

DOI:10.7717/peerj-cs.1265

PMID:37346550

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10280424/

Abstract

Face and face mask detection are one of the most popular topics in computer vision literature. Face mask detection refers to the detection of people's faces in digital images and determining whether they are wearing a face mask. It can be of great benefit in different domains by ensuring public safety through the monitoring of face masks. Current research details a range of proposed face mask detection models, but most of them are mainly based on convolutional neural network models. These models have some drawbacks, such as their not being robust enough for low quality images and their being unable to capture long-range dependencies. These shortcomings can be overcome using transformer neural networks. Transformer is a type of deep learning that is based on the self-attention mechanism, and its strong capabilities have attracted the attention of computer vision researchers who apply this advanced neural network architecture to visual data as it can handle long-range dependencies between input sequence elements. In this study, we developed an automatic hybrid face mask detection model that is a combination of a transformer neural network and a convolutional neural network models which can be used to detect and determine whether people are wearing face masks. The proposed hybrid model's performance was evaluated and compared to other state-of-the-art face mask detection models, and the experimental results proved the proposed model's ability to achieve a highest average precision of 89.4% with an execution time of 2.8 s. Thus, the proposed hybrid model is fit for a practical, real-time trial and can contribute towards public healthcare in terms of infectious disease control.

摘要

面部和口罩检测是计算机视觉文献中最热门的话题之一。口罩检测是指在数字图像中检测人脸，并确定他们是否佩戴口罩。通过监测口罩来确保公共安全，这在不同领域可能会有很大益处。当前的研究详细介绍了一系列提出的口罩检测模型，但其中大多数主要基于卷积神经网络模型。这些模型存在一些缺点，例如对低质量图像的鲁棒性不足，以及无法捕捉长距离依赖关系。使用变压器神经网络可以克服这些缺点。变压器是一种基于自注意力机制的深度学习类型，其强大的能力吸引了计算机视觉研究人员的关注，他们将这种先进的神经网络架构应用于视觉数据，因为它可以处理输入序列元素之间的长距离依赖关系。在本研究中，我们开发了一种自动混合口罩检测模型，它是变压器神经网络和卷积神经网络模型的组合，可用于检测和确定人们是否佩戴口罩。对所提出的混合模型的性能进行了评估，并与其他先进的口罩检测模型进行了比较，实验结果证明了所提出的模型能够以2.8秒的执行时间实现最高89.4%的平均精度。因此，所提出的混合模型适合进行实际的实时试验，并在传染病控制方面有助于公共医疗保健。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8185/10280424/c081abad7f7e/peerj-cs-09-1265-g001.jpg

相似文献

A novel hybrid face mask detection approach using Transformer and convolutional neural network models.一种使用Transformer和卷积神经网络模型的新型混合口罩检测方法。

PeerJ Comput Sci. 2023 Mar 27;9:e1265. doi: 10.7717/peerj-cs.1265. eCollection 2023.

Performance analysis of hybrid deep learning framework using a vision transformer and convolutional neural network for handwritten digit recognition.使用视觉Transformer和卷积神经网络的混合深度学习框架对手写数字识别的性能分析

MethodsX. 2024 Jan 5;12:102554. doi: 10.1016/j.mex.2024.102554. eCollection 2024 Jun.

Capturing Time Dynamics From Speech Using Neural Networks for Surgical Mask Detection.使用神经网络从语音中捕捉时间动态以进行手术口罩检测。

IEEE J Biomed Health Inform. 2022 Aug;26(8):4291-4302. doi: 10.1109/JBHI.2022.3173128. Epub 2022 Aug 11.

3D face-model reconstruction from a single image: A feature aggregation approach using hierarchical transformer with weak supervision.基于分层 Transformer 的弱监督特征聚合方法的单幅图像 3D 人脸模型重建

Neural Netw. 2022 Dec;156:108-122. doi: 10.1016/j.neunet.2022.09.019. Epub 2022 Oct 1.

SMD-YOLO: An efficient and lightweight detection method for mask wearing status during the COVID-19 pandemic.SMD-YOLO：一种用于 COVID-19 大流行期间戴口罩状态的高效轻量级检测方法。

Comput Methods Programs Biomed. 2022 Jun;221:106888. doi: 10.1016/j.cmpb.2022.106888. Epub 2022 May 13.

Face mask detection using YOLOv3 and faster R-CNN models: COVID-19 environment.使用YOLOv3和更快的R-CNN模型进行口罩检测：COVID-19环境

Multimed Tools Appl. 2021;80(13):19753-19768. doi: 10.1007/s11042-021-10711-8. Epub 2021 Mar 1.

Face Mask-Wearing Detection Model Based on Loss Function and Attention Mechanism.基于损失函数和注意力机制的口罩佩戴检测模型。

Comput Intell Neurosci. 2022 Jul 12;2022:2452291. doi: 10.1155/2022/2452291. eCollection 2022.

A convolutional neural network for face mask detection in IoT-based smart healthcare systems.用于基于物联网的智能医疗系统中口罩检测的卷积神经网络。

Front Physiol. 2023 Mar 31;14:1143249. doi: 10.3389/fphys.2023.1143249. eCollection 2023.

Efficient agricultural pest classification using vision transformer with hybrid pooled multihead attention.利用融合池多头注意力的视觉转换器实现高效农业虫害分类。

Comput Biol Med. 2024 Jul;177:108584. doi: 10.1016/j.compbiomed.2024.108584. Epub 2024 May 13.

Deep learning for face mask detection: a survey.用于口罩检测的深度学习：一项综述。

Multimed Tools Appl. 2023 Mar 4:1-41. doi: 10.1007/s11042-023-14686-6.

本文引用的文献

A Survey on Vision Transformer.视觉Transformer综述

IEEE Trans Pattern Anal Mach Intell. 2023 Jan;45(1):87-110. doi: 10.1109/TPAMI.2022.3152247. Epub 2022 Dec 5.

Coronavirus disease (COVID-19) cases analysis using machine-learning applications.使用机器学习应用进行冠状病毒病（COVID-19）病例分析。

Appl Nanosci. 2023;13(3):2013-2025. doi: 10.1007/s13204-021-01868-7. Epub 2021 May 21.

SSDMNV2: A real time DNN-based face mask detection system using single shot multibox detector and MobileNetV2.SSDMNV2：一种基于深度神经网络的实时口罩检测系统，使用单阶段多框检测器和MobileNetV2。

Sustain Cities Soc. 2021 Mar;66:102692. doi: 10.1016/j.scs.2020.102692. Epub 2020 Dec 31.

Fighting against COVID-19: A novel deep learning model based on YOLO-v2 with ResNet-50 for medical face mask detection.抗击新冠疫情：一种基于带有ResNet-50的YOLO-v2的新型深度学习模型用于医用口罩检测

Sustain Cities Soc. 2021 Feb;65:102600. doi: 10.1016/j.scs.2020.102600. Epub 2020 Nov 12.

Identifying Facemask-Wearing Condition Using Image Super-Resolution with Classification Network to Prevent COVID-19.利用图像超分辨率和分类网络识别口罩佩戴状态，预防 COVID-19。

Sensors (Basel). 2020 Sep 14;20(18):5236. doi: 10.3390/s20185236.

A hybrid deep transfer learning model with machine learning methods for face mask detection in the era of the COVID-19 pandemic.一种结合机器学习方法的混合深度迁移学习模型，用于COVID-19大流行时代的口罩检测。

Measurement (Lond). 2021 Jan 1;167:108288. doi: 10.1016/j.measurement.2020.108288. Epub 2020 Jul 28.

Focal Loss for Dense Object Detection.用于密集目标检测的焦散损失

IEEE Trans Pattern Anal Mach Intell. 2020 Feb;42(2):318-327. doi: 10.1109/TPAMI.2018.2858826. Epub 2018 Jul 23.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

一种使用Transformer和卷积神经网络模型的新型混合口罩检测方法。

A novel hybrid face mask detection approach using Transformer and convolutional neural network models.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献