Som Anirudh, Choi Hongjun, Ramamurthy Karthikeyan Natesan, Buman Matthew P, Turaga Pavan
School of Arts, Media and Engineering, Arizona State University.
School of Electrical, Computer and Energy Engineering, Arizona State University.
Conf Comput Vis Pattern Recognit Workshops. 2020 Jun;2020:3639-3648. doi: 10.1109/cvprw50498.2020.00425. Epub 2020 Jul 28.
Topological features such as persistence diagrams and their functional approximations like persistence images (PIs) have been showing substantial promise for machine learning and computer vision applications. This is greatly attributed to the robustness topological representations provide against different types of physical nuisance variables seen in real-world data, such as view-point, illumination, and more. However, key bottlenecks to their large scale adoption are computational expenditure and difficulty incorporating them in a differentiable architecture. We take an important step in this paper to mitigate these bottlenecks by proposing a novel one-step approach to generate PIs directly from the input data. We design two separate convolutional neural network architectures, one designed to take in multi-variate time series signals as input and another that accepts multi-channel images as input. We call these networks Signal PI-Net and Image PI-Net respectively. To the best of our knowledge, we are the first to propose the use of deep learning for computing topological features directly from data. We explore the use of the proposed PI-Net architectures on two applications: human activity recognition using tri-axial accelerometer sensor data and image classification. We demonstrate the ease of fusion of PIs in supervised deep learning architectures and speed up of several orders of magnitude for extracting PIs from data. Our code is available at https://github.com/anirudhsom/PI-Net.
诸如持久图及其功能近似(如持久图像(PI))等拓扑特征在机器学习和计算机视觉应用中已展现出巨大的潜力。这很大程度上归因于拓扑表示对现实世界数据中所见的不同类型物理干扰变量(如视点、光照等)具有鲁棒性。然而,它们大规模应用的关键瓶颈在于计算成本以及将其纳入可微架构的难度。在本文中,我们迈出了重要一步,通过提出一种新颖的一步法直接从输入数据生成持久图像来缓解这些瓶颈。我们设计了两种独立的卷积神经网络架构,一种设计为以多变量时间序列信号作为输入,另一种接受多通道图像作为输入。我们分别将这些网络称为信号PI网络和图像PI网络。据我们所知,我们是首个提出使用深度学习直接从数据计算拓扑特征的。我们在两个应用中探索了所提出的PI网络架构的使用:使用三轴加速度计传感器数据进行人类活动识别以及图像分类。我们展示了持久图像在监督深度学习架构中融合的简便性以及从数据中提取持久图像时几个数量级的加速。我们的代码可在https://github.com/anirudhsom/PI-Net获取。