Patel Sahaj Anilbhai, Yildirim Abidin
Department of Electrical and Computer, University of Alabama at Birmingham, Birmingham, AL 35205, USA.
J Imaging. 2024 May 15;10(5):121. doi: 10.3390/jimaging10050121.
In graph theory, the weighted Laplacian matrix is the most utilized technique to interpret the local and global properties of a complex graph structure within computer vision applications. However, with increasing graph nodes, the Laplacian matrix's dimensionality also increases accordingly. Therefore, there is always the "curse of dimensionality"; In response to this challenge, this paper introduces a new approach to reducing the dimensionality of the weighted Laplacian matrix by utilizing the Gershgorin circle theorem by transforming the weighted Laplacian matrix into a strictly diagonal domain and then estimating rough eigenvalue inclusion of a matrix. The estimated inclusions are represented as reduced features, termed GC features; The proposed Gershgorin circle feature extraction (GCFE) method was evaluated using three publicly accessible computer vision datasets, varying image patch sizes, and three different graph types. The GCFE method was compared with eight distinct studies. The GCFE demonstrated a notable positive Z-score compared to other feature extraction methods such as I-PCA, kernel PCA, and spectral embedding. Specifically, it achieved an average Z-score of 6.953 with the 2D grid graph type and 4.473 with the pairwise graph type, particularly on the E_Balanced dataset. Furthermore, it was observed that while the accuracy of most major feature extraction methods declined with smaller image patch sizes, the GCFE maintained consistent accuracy across all tested image patch sizes. When the GCFE method was applied to the E_MNSIT dataset using the K-NN graph type, the GCFE method confirmed its consistent accuracy performance, evidenced by a low standard deviation (SD) of 0.305. This performance was notably lower compared to other methods like Isomap, which had an SD of 1.665, and LLE, which had an SD of 1.325; The GCFE outperformed most feature extraction methods in terms of classification accuracy and computational efficiency. The GCFE method also requires fewer training parameters for deep-learning models than the traditional weighted Laplacian method, establishing its potential for more effective and efficient feature extraction in computer vision tasks.
在图论中,加权拉普拉斯矩阵是计算机视觉应用中用于解释复杂图结构的局部和全局属性的最常用技术。然而,随着图节点数量的增加,拉普拉斯矩阵的维度也会相应增加。因此,总是存在“维度诅咒”;为应对这一挑战,本文引入了一种新方法,即利用格什戈林圆盘定理,通过将加权拉普拉斯矩阵变换为严格对角占优矩阵,然后估计矩阵的粗略特征值包含区域,来降低加权拉普拉斯矩阵的维度。估计的包含区域表示为降维特征,称为GC特征;使用三个可公开获取的计算机视觉数据集、不同的图像块大小和三种不同的图类型,对所提出的格什戈林圆盘特征提取(GCFE)方法进行了评估。将GCFE方法与八项不同的研究进行了比较。与其他特征提取方法(如I-PCA、核主成分分析和谱嵌入)相比,GCFE表现出显著的正Z分数。具体而言,对于二维网格图类型,它的平均Z分数为6.953,对于成对图类型,在E_Balanced数据集上的平均Z分数为4.473。此外,观察到虽然大多数主要特征提取方法的准确率随着图像块尺寸变小而下降,但GCFE在所有测试的图像块尺寸上都保持了一致的准确率。当使用K-NN图类型将GCFE方法应用于E_MNSIT数据集时,GCFE方法证实了其一致的准确率性能,其低标准差(SD)为0.305就是证明。与其他方法(如实值映射法,其标准差为1.665,以及局部线性嵌入法,其标准差为1.325)相比,该性能明显更低;在分类准确率和计算效率方面,GCFE优于大多数特征提取方法。与传统的加权拉普拉斯方法相比,GCFE方法对于深度学习模型所需的训练参数也更少,这表明它在计算机视觉任务中具有更有效和高效的特征提取潜力。