Dept. of Computer and Software, Hanyang University, Seongdong-gu, Seoul, South Korea.
PLoS One. 2022 Aug 16;17(8):e0267282. doi: 10.1371/journal.pone.0267282. eCollection 2022.
eXplainable Artificial Intelligence (XAI) is a new trend of machine learning. Machine learning models are used to predict or decide something, and they derive output based on a large volume of data set. Here, the problem is that it is hard to know why such prediction was derived, especially when using deep learning models. It makes the models unreliable in the case of reliability-critical applications. So, it is required to explain how they derived such output. It is a reliability-critical application for self-driving cars because the mistakes made by the computers inside them can lead to critical accidents. So, it is necessary to adopt XAI models in this field. In this paper, we propose an XAI method based on computing and explaining the difference of the output values of the neurons in the last hidden layer of convolutional neural networks. First, we input the original image and some modified images of it. Then we derive output values for each image and compare these values. Then, we introduce the Sensitivity Analysis technique to explain which parts of the original image are needed to distinguish the category. In detail, we divide the image into several parts and fill these parts with shades. First, we compute the influence value on the vector indicating the last hidden layer of the model for each of these parts. Then we draw shades whose darkness is in proportion to the influence values. The experimental results show that our approach for XAI in self-driving cars finds the parts needed to distinguish the category of these images accurately.
可解释人工智能(XAI)是机器学习的一个新趋势。机器学习模型用于预测或决策,它们根据大量数据集得出输出。这里的问题是,很难知道为什么会得出这样的预测,尤其是在使用深度学习模型时。这使得模型在可靠性关键应用中不可靠。因此,需要解释它们是如何得出这样的输出的。对于自动驾驶汽车来说,这是一个可靠性关键的应用,因为它们内部计算机的错误可能导致严重事故。因此,有必要在该领域采用 XAI 模型。在本文中,我们提出了一种基于计算和解释卷积神经网络最后隐藏层神经元输出值差异的 XAI 方法。首先,我们输入原始图像和一些修改后的图像。然后,我们为每张图像导出输出值并比较这些值。然后,我们引入敏感性分析技术来解释区分类别所需的原始图像的哪些部分。具体来说,我们将图像分成几个部分,并为这些部分填充阴影。首先,我们计算每个部分对模型最后隐藏层向量的影响值。然后,我们绘制阴影,其暗度与影响值成正比。实验结果表明,我们在自动驾驶汽车中的 XAI 方法能够准确地找到区分这些图像类别的所需部分。