School of Electronics and Information Engineering, Soochow University, Suzhou 215006, PR China.
Neural Netw. 2024 Dec;180:106664. doi: 10.1016/j.neunet.2024.106664. Epub 2024 Aug 27.
Complex-valued convolutional neural networks (CVCNNs) have been demonstrated effectiveness in classifying complex signals and synthetic aperture radar (SAR) images. However, due to the introduction of complex-valued parameters, CVCNNs tend to become redundant with heavy floating-point operations. Model sparsity is emerged as an efficient method of removing the redundancy without much loss of performance. Currently, there are few studies on the sparsity problem of CVCNNs. Therefore, a complex-valued soft-log threshold reweighting (CV-SLTR) algorithm is proposed for the design of sparse CVCNN to reduce the number of weight parameters and simplify the structure of CVCNN. On one hand, considering the difference between complex and real numbers, we redefine and derive the complex-valued log-sum threshold method. On the other hand, by considering the distinctive characteristics of complex-valued convolutional (CConv) layers and complex-valued fully connected (CFC) layers of CVCNNs, the complex-valued soft and log-sum threshold methods are respectively developed to prune the weights of different layers during the forward propagation, and the sparsity thresholds are optimized during the backward propagation by inducing a sparsity budget. Furthermore, different optimizers can be integrated with CV-SLTR. When stochastic gradient descent (SGD) is used, the convergence of CV-SLTR is proved if Lipschitzian continuity is satisfied. Experiments on the RadioML 2016.10A and S1SLC-CVDL datasets show that the proposed algorithm is efficient for the sparsity of CVCNNs. It is worth noting that the proposed algorithm has fast sparsity speed while maintaining high classification accuracy. These demonstrate the feasibility and potential of the CV-SLTR algorithm.
复值卷积神经网络 (CVCNNs) 在复杂信号和合成孔径雷达 (SAR) 图像分类方面表现出了有效性。然而,由于引入了复数值参数,CVCNNs 往往由于浮点运算量过大而变得冗余。模型稀疏性作为一种有效的去除冗余的方法而出现,同时性能损失很小。目前,关于 CVCNNs 的稀疏性问题的研究较少。因此,提出了一种用于稀疏 CVCNN 设计的复值软对数阈值重加权 (CV-SLTR) 算法,以减少权参数的数量并简化 CVCNN 的结构。一方面,考虑到复数和实数之间的差异,我们重新定义并推导出复值对数和阈值方法。另一方面,通过考虑 CVCNNs 的复值卷积 (CConv) 层和复值全连接 (CFC) 层的独特特征,分别开发了复值软对数和阈值方法,以在正向传播过程中修剪不同层的权重,并通过引入稀疏性预算来优化反向传播过程中的稀疏性阈值。此外,不同的优化器可以与 CV-SLTR 集成。当使用随机梯度下降 (SGD) 时,如果满足 Lipschitz 连续性,则可以证明 CV-SLTR 的收敛性。在 RadioML 2016.10A 和 S1SLC-CVDL 数据集上的实验表明,所提出的算法对于 CVCNNs 的稀疏性是有效的。值得注意的是,所提出的算法在保持高分类精度的同时,具有较快的稀疏速度。这些证明了 CV-SLTR 算法的可行性和潜力。