California Partners for Advanced Transportation Technology, University of California, Berkeley, CA 94804, USA.
Sensors (Basel). 2022 Aug 30;22(17):6529. doi: 10.3390/s22176529.
Drowsiness is one of the leading causes of traffic accidents. For those who operate large machinery or motor vehicles, incidents due to lack of sleep can cause property damage and sometimes lead to grave consequences of injuries and fatality. This study aims to design learning models to recognize drowsiness through human facial features. In addition, this work analyzes the attentions of individual neurons in the learning model to understand how neural networks interpret drowsiness. For this analysis, gradient-weighted class activation mapping (Grad-CAM) is implemented in the neural networks to display the attention of neurons. The eye and face images are processed separately to the model for the training process. The results initially show that better results can be obtained by delivering eye images alone. The effect of Grad-CAM is also more reasonable using eye images alone. Furthermore, this work proposed a feature analysis method, K-nearest neighbors Sigma (KNN-Sigma), to estimate the homogeneous concentration and heterogeneous separation of the extracted features. In the end, we found that the fusion of face and eye signals gave the best results for recognition accuracy and KNN-sigma. The area under the curve (AUC) of using face, eye, and fusion images are 0.814, 0.897, and 0.935, respectively.
困意是导致交通事故的主要原因之一。对于操作大型机械或机动车的人来说,因睡眠不足而导致的事故可能会造成财产损失,有时甚至会导致重伤或死亡等严重后果。本研究旨在设计通过人脸特征识别困意的学习模型。此外,本工作还分析了学习模型中个别神经元的注意力,以了解神经网络如何解释困意。为此,在神经网络中实现了梯度加权类激活映射(Grad-CAM),以显示神经元的注意力。模型的训练过程中分别处理眼睛和脸部图像。结果初步表明,仅提供眼部图像可以获得更好的结果。仅使用眼部图像时,Grad-CAM 的效果也更为合理。此外,本工作还提出了一种特征分析方法,K-最近邻 Sigma(KNN-Sigma),用于估计提取特征的同质性浓度和异质性分离。最后,我们发现融合人脸和眼部信号的方法在识别准确率和 KNN-sigma 方面效果最佳。使用人脸、眼睛和融合图像的曲线下面积(AUC)分别为 0.814、0.897 和 0.935。