Zurowietz Martin, Nattkemper Tim W
Biodata Mining Group, Faculty of Technology, Bielefeld University, Bielefeld, Germany.
Front Artif Intell. 2020 Jul 23;3:49. doi: 10.3389/frai.2020.00049. eCollection 2020.
Deep artificial neural networks have become the go-to method for many machine learning tasks. In the field of computer vision, deep convolutional neural networks achieve state-of-the-art performance for tasks such as classification, object detection, or instance segmentation. As deep neural networks become more and more complex, their inner workings become more and more opaque, rendering them a "black box" whose decision making process is no longer comprehensible. In recent years, various methods have been presented that attempt to peek inside the black box and to visualize the inner workings of deep neural networks, with a focus on deep convolutional neural networks for computer vision. These methods can serve as a toolbox to facilitate the design and inspection of neural networks for computer vision and the interpretation of the decision making process of the network. Here, we present the new tool Interactive Feature Localization in Deep neural networks (IFeaLiD) which provides a novel visualization approach to convolutional neural network layers. The tool interprets neural network layers as multivariate feature maps and visualizes the similarity between the feature vectors of individual pixels of an input image in a heat map display. The similarity display can reveal how the input image is perceived by different layers of the network and how the perception of one particular image region compares to the perception of the remaining image. IFeaLiD runs interactively in a web browser and can process even high resolution feature maps in real time by using GPU acceleration with WebGL 2. We present examples from four computer vision datasets with feature maps from different layers of a pre-trained ResNet101. IFeaLiD is open source and available online at https://ifealid.cebitec.uni-bielefeld.de.
深度人工神经网络已成为许多机器学习任务的首选方法。在计算机视觉领域,深度卷积神经网络在分类、目标检测或实例分割等任务中取得了领先的性能。随着深度神经网络变得越来越复杂,其内部工作原理变得越来越不透明,使其成为一个“黑匣子”,其决策过程不再可理解。近年来,已经提出了各种方法来窥探这个黑匣子并可视化深度神经网络的内部工作原理,重点是用于计算机视觉的深度卷积神经网络。这些方法可以作为一个工具箱,以促进用于计算机视觉的神经网络的设计和检查以及网络决策过程的解释。在这里,我们展示了深度神经网络中的交互式特征定位新工具(IFeaLiD),它为卷积神经网络层提供了一种新颖的可视化方法。该工具将神经网络层解释为多变量特征图,并在热图显示中可视化输入图像单个像素的特征向量之间的相似性。相似性显示可以揭示网络的不同层如何感知输入图像,以及一个特定图像区域的感知与其余图像的感知相比如何。IFeaLiD在网络浏览器中交互式运行,并且可以通过使用WebGL 2的GPU加速实时处理甚至高分辨率的特征图。我们展示了来自四个计算机视觉数据集的示例,其中包含预训练的ResNet101不同层的特征图。IFeaLiD是开源的,可在https://ifealid.cebitec.uni-bielefeld.de在线获取。