Graduate School of Engineering, Nagoya University, Nagoya 464-8603, Japan.
Institute of Innovation for Future Society, Nagoya University, Nagoya 464-8601, Japan.
Neural Netw. 2021 Nov;143:42-51. doi: 10.1016/j.neunet.2021.05.007. Epub 2021 May 12.
We investigate classification performance of neural networks (NNs) based on topological insight in an attempt to guarantee stability of their inference. NNs which can accurately classify a dataset map it into a hidden space while disentangling intertwined data. NNs sometimes acquire forcible mapping to disentangle the data, and this forcible mapping generates outliers. The mapping around the outliers is unstable because the outputs change drastically. Hence, we define stable NNs to mean that they do not generate outliers. To investigate the possibility of the existence of outliers, we use persistent homology and a method to estimate the confidence set for persistence diagrams. The combined use enables us to test whether the focused geometry is topologically simple, that is, no outliers. In this work, we use the MNIST and CIFAR-10 datasets and investigate the relationship between the classification performance and the topological characteristics with several NNs. Investigation results with the MNIST dataset show that the test accuracy of all the networks is superior, exceeding 98%, even though the transformed dataset is not topologically simple. Results with the CIFAR-10 dataset also show that the possibility of the existence of outliers is shown in the mapping by the accurate convolutional NNs. Therefore, we conclude that the presented investigation is necessary to guarantee that the NNs, in particular deep NNs, do not acquire unstable mapping for forcible classification.
我们基于拓扑学的洞察力研究神经网络(NN)的分类性能,试图保证其推理的稳定性。能够准确分类数据集的神经网络将其映射到隐藏空间,同时解缠交织的数据。神经网络有时会强行进行映射以解缠数据,而这种强行映射会产生异常值。异常值周围的映射是不稳定的,因为输出会发生剧烈变化。因此,我们将稳定的神经网络定义为不会产生异常值的神经网络。为了研究异常值存在的可能性,我们使用持久同调以及一种估计持续图置信集的方法。联合使用这两种方法可以测试聚焦几何是否具有拓扑简单性,即没有异常值。在这项工作中,我们使用 MNIST 和 CIFAR-10 数据集,并使用几种神经网络研究分类性能和拓扑特征之间的关系。使用 MNIST 数据集的研究结果表明,所有网络的测试准确性都很高,超过 98%,即使转换后的数据集不具有拓扑简单性。使用 CIFAR-10 数据集的结果也表明,准确的卷积神经网络在映射中存在异常值存在的可能性。因此,我们得出结论,需要进行这种研究以保证神经网络,特别是深度神经网络,不会因强行分类而获得不稳定的映射。