Qiao Weidong, Xu Yang, Li Hui
Key Lab of Smart Prevention and Mitigation of Civil Engineering Disasters of the Ministry of Industry and Information Technology, Harbin Institute of Technology, Harbin 150090, China.
Faculty of Computing, Harbin Institute of Technology, Harbin 150001, China.
Neural Netw. 2025 Mar;183:106980. doi: 10.1016/j.neunet.2024.106980. Epub 2024 Nov 28.
The weight-sharing mechanism of convolutional kernels ensures the translation equivariance of convolutional neural networks (CNNs) but not scale and rotation equivariance. This study proposes a SIM(2) Lie group-CNN, which can simultaneously keep scale, rotation, and translation equivariance for image classification tasks. The SIM(2) Lie group-CNN includes a lifting module, a series of group convolution modules, a global pooling layer, and a classification layer. The lifting module transfers the input image from Euclidean space to Lie group space, and the group convolution is parameterized through a fully connected network using the Lie Algebra coefficients of Lie group elements as inputs to achieve scale and rotation equivariance. It is worth noting that the mapping relationship between SIM(2) and its Lie Algebra and the distance measure of SIM(2) are defined explicitly in this paper, thus solving the problem of the metric of features on the space of SIM(2) Lie group, which contrasts with other Lie groups characterized by a single element, such as SO(2). The scale-rotation equivariance of Lie group-CNN is verified, and the best recognition accuracy is achieved on three categories of image datasets. Consequently, the SIM(2) Lie group-CNN can successfully extract geometric features and perform equivariant recognition on images with rotation and scale transformations.
卷积核的权重共享机制确保了卷积神经网络(CNN)的平移不变性,但不具备尺度和旋转不变性。本研究提出了一种SIM(2)李群卷积神经网络,它可以在图像分类任务中同时保持尺度、旋转和平移不变性。SIM(2)李群卷积神经网络包括一个提升模块、一系列群卷积模块、一个全局池化层和一个分类层。提升模块将输入图像从欧几里得空间转移到李群空间,群卷积通过一个全连接网络进行参数化,使用李群元素的李代数系数作为输入,以实现尺度和旋转不变性。值得注意的是,本文明确地定义了SIM(2)与其李代数之间的映射关系以及SIM(2)的距离度量,从而解决了SIM(2)李群空间上特征的度量问题,这与其他以单个元素为特征的李群(如SO(2))形成对比。验证了李群卷积神经网络的尺度旋转不变性,并在三类图像数据集上取得了最佳识别准确率。因此,SIM(2)李群卷积神经网络能够成功地提取几何特征,并对具有旋转和尺度变换的图像进行不变性识别。