School of Computer Science and Engineering, University of New South Wales (UNSW), Australia; Centre for Accident Research and Road Safety - Queensland (CARRS-Q), Queensland University of Technology (QUT), Australia.
Centre for Accident Research and Road Safety - Queensland (CARRS-Q), Queensland University of Technology (QUT), Australia; School of Psychology and Wellbeing, University of Southern Queensland (USQ), Australia; School of Exercise and Nutrition Sciences, Queensland University of Technology (QUT), Australia.
Comput Methods Programs Biomed. 2024 Jan;243:107925. doi: 10.1016/j.cmpb.2023.107925. Epub 2023 Nov 8.
Drowsiness behind the wheel is a major road safety issue with efforts focused on developing drowsy driving detection systems. However, most drowsy driving detection studies using physiological signals have focused on developing a 'black box' machine learning classifier, with much less focus on 'robustness' and 'explainability'-two crucial properties of a trustworthy machine learning model. Therefore, this study has focused on using multiple validation techniques to evaluate the overall performance of such a system using multiple supervised machine learning-based classifiers and then unbox the black box model using explainable machine learning.
Driving was simulated via a 30-minute psychomotor vigilance task while the participants reported their level of subjective sleepiness with their physiological signals: electroencephalogram (EEG), electrooculogram (EOG) and electrocardiogram (ECG) being recorded. Six different techniques, comprising subject-dependent and independent techniques were applied for model validation and robustness testing with three supervised machine learning classifiers, namely K-nearest neighbours (KNN), support vector machines (SVM) and random forest (RF), and two explainable methods, namely SHapley Additive exPlanation (SHAP) analysis and partial dependency analysis (PDA) were leveraged for model interpretation.
The study identified the leave one participant out, a subject-independent validation technique to be most useful, with the best sensitivity of 70.3 %, specificity of 82.2 %, and an accuracy of 80.1 % using the random forest classifier in addressing the autocorrelation issue due to inter-individual differences in physiological signals. Moreover, the explainable results suggest most important physiological features for drowsiness detection, with a clear cut-off in the decision boundary.
The implication of the study will ensure a rigorous validation for robustness testing and an explainable machine learning approach to developing a trustworthy drowsiness detection system and enhancing road safety. The explainable machine learning-based results show promise in real-life deployment of the physiological-signal based in-vehicle trustworthy drowsiness detection system, with higher reliability and explainability, along with a lower system cost.
驾驶时困倦是一个重大的道路安全问题,因此人们致力于开发困倦驾驶检测系统。然而,大多数使用生理信号的困倦驾驶检测研究都侧重于开发一个“黑箱”机器学习分类器,而很少关注“稳健性”和“可解释性”——这是可信机器学习模型的两个关键特性。因此,本研究侧重于使用多种验证技术,使用多种基于监督机器学习的分类器评估该系统的整体性能,然后使用可解释的机器学习来“打开黑箱”模型。
通过 30 分钟的精神运动警觉性任务模拟驾驶,同时参与者使用生理信号报告他们的主观困倦程度:记录脑电图(EEG)、眼电图(EOG)和心电图(ECG)。应用了六种不同的技术,包括基于受试者的和独立的技术,用于模型验证和稳健性测试,使用了三个监督机器学习分类器,即 K-最近邻(KNN)、支持向量机(SVM)和随机森林(RF),以及两种可解释方法,即 Shapley 加法解释(SHAP)分析和偏依赖分析(PDA),用于模型解释。
研究发现,在解决由于生理信号个体间差异引起的自相关问题时,独立于受试者的留一参与者验证技术最为有用,使用随机森林分类器的敏感性为 70.3%,特异性为 82.2%,准确性为 80.1%。此外,可解释的结果表明了对于困倦检测最重要的生理特征,在决策边界上有明显的划分。
该研究的意义在于为稳健性测试提供严格的验证,并采用可解释的机器学习方法开发值得信赖的困倦检测系统,从而提高道路安全。基于可解释机器学习的结果显示,基于生理信号的车内可信困倦检测系统具有更高的可靠性和可解释性,以及更低的系统成本,有望在实际应用中部署。