Kaczorowska Monika, Plechawska-Wójcik Małgorzata, Tokovarov Mikhail
Department of Computer Science, Lublin University of Technology, 20-618 Lublin, Poland.
Brain Sci. 2021 Feb 9;11(2):210. doi: 10.3390/brainsci11020210.
The paper is focussed on the assessment of cognitive workload level using selected machine learning models. In the study, eye-tracking data were gathered from 29 healthy volunteers during examination with three versions of the computerised version of the digit symbol substitution test (DSST). Understanding cognitive workload is of great importance in analysing human mental fatigue and the performance of intellectual tasks. It is also essential in the context of explanation of the brain cognitive process. Eight three-class classification machine learning models were constructed and analysed. Furthermore, the technique of interpretable machine learning model was applied to obtain the measures of feature importance and its contribution to the brain cognitive functions. The measures allowed improving the quality of classification, simultaneously lowering the number of applied features to six or eight, depending on the model. Moreover, the applied method of explainable machine learning provided valuable insights into understanding the process accompanying various levels of cognitive workload. The main classification performance metrics, such as F1, recall, precision, accuracy, and the area under the Receiver operating characteristic curve (ROC AUC) were used in order to assess the quality of classification quantitatively. The best result obtained on the complete feature set was as high as 0.95 (F1); however, feature importance interpretation allowed increasing the result up to 0.97 with only seven of 20 features applied.
本文聚焦于使用选定的机器学习模型评估认知工作量水平。在该研究中,在对29名健康志愿者进行三种版本的数字化符号替换测试(DSST)计算机化版本检查期间收集了眼动追踪数据。理解认知工作量在分析人类精神疲劳和智力任务表现方面非常重要。在解释大脑认知过程的背景下,这也是必不可少的。构建并分析了八个三类分类机器学习模型。此外,应用可解释机器学习模型技术来获得特征重要性的度量及其对大脑认知功能的贡献。这些度量有助于提高分类质量,同时根据模型将应用特征的数量减少到六个或八个。此外,所应用的可解释机器学习方法为理解伴随不同认知工作量水平的过程提供了有价值的见解。为了定量评估分类质量,使用了主要的分类性能指标,如F1、召回率、精确率、准确率和接收器操作特征曲线下面积(ROC AUC)。在完整特征集上获得的最佳结果高达0.95(F1);然而,通过仅应用20个特征中的七个,特征重要性解释使结果提高到了0.97。