Sertic Peter, Alahmar Ayman, Akilan Thangarajah, Javorac Marko, Gupta Yash
Department of Software Engineering, Lakehead University, Thunder Bay, ON P7B 5E1, Canada.
Healthcare (Basel). 2022 May 9;10(5):873. doi: 10.3390/healthcare10050873.
This paper proposes and implements a dedicated hardware accelerated real-time face-mask detection system using deep learning (DL). The proposed face-mask detection model (MaskDetect) was benchmarked on three embedded platforms: Raspberry PI 4B with either Google Coral USB TPU or Intel Neural Compute Stick 2 VPU, and NVIDIA Jetson Nano. The MaskDetect was independently quantised and optimised for each hardware accelerated implementation. An ablation study was carried out on the proposed model and its quantised implementations on the embedded hardware configurations above as a comparison to other popular transfer-learning models, such as VGG16, ResNet-50V2, and InceptionV3, which are compatible with these acceleration hardware platforms. The ablation study revealed that MaskDetect achieved excellent average face-mask detection performance with accuracy above 94% across all embedded platforms except for Coral, which achieved an average accuracy of nearly 90%. With respect to detection performance (accuracy), inference speed (frames per second (FPS)), and product cost, the ablation study revealed that implementation on Jetson Nano is the best choice for real-time face-mask detection. It achieved 94.2% detection accuracy and twice greater FPS when compared to its desktop hardware counterpart.
本文提出并实现了一种使用深度学习(DL)的专用硬件加速实时面部口罩检测系统。所提出的面部口罩检测模型(MaskDetect)在三个嵌入式平台上进行了基准测试:配备谷歌珊瑚USB TPU或英特尔神经计算棒2 VPU的树莓派4B,以及英伟达Jetson Nano。MaskDetect针对每个硬件加速实现进行了独立量化和优化。针对上述嵌入式硬件配置,对所提出的模型及其量化实现进行了消融研究,并与其他流行的迁移学习模型(如与这些加速硬件平台兼容的VGG16、ResNet-50V2和InceptionV3)进行了比较。消融研究表明,MaskDetect在所有嵌入式平台上都实现了出色的平均面部口罩检测性能,除了珊瑚平台,其平均准确率接近90%,其他平台的准确率均高于94%。关于检测性能(准确率)、推理速度(每秒帧数(FPS))和产品成本,消融研究表明,在Jetson Nano上实现是实时面部口罩检测的最佳选择。与桌面硬件相比,它实现了94.2%的检测准确率,并且FPS提高了一倍。