克服维度限制：计算机视觉应用中基于盖尔圆定理的加权拉普拉斯矩阵特征提取

Overcoming Dimensionality Constraints: A Gershgorin Circle Theorem-Based Feature Extraction for Weighted Laplacian Matrices in Computer Vision Applications.

作者信息

Patel Sahaj Anilbhai, Yildirim Abidin

机构信息

Department of Electrical and Computer, University of Alabama at Birmingham, Birmingham, AL 35205, USA.

出版信息

J Imaging. 2024 May 15;10(5):121. doi: 10.3390/jimaging10050121.

DOI:10.3390/jimaging10050121

PMID:38786575

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11122193/

Abstract

In graph theory, the weighted Laplacian matrix is the most utilized technique to interpret the local and global properties of a complex graph structure within computer vision applications. However, with increasing graph nodes, the Laplacian matrix's dimensionality also increases accordingly. Therefore, there is always the "curse of dimensionality"; In response to this challenge, this paper introduces a new approach to reducing the dimensionality of the weighted Laplacian matrix by utilizing the Gershgorin circle theorem by transforming the weighted Laplacian matrix into a strictly diagonal domain and then estimating rough eigenvalue inclusion of a matrix. The estimated inclusions are represented as reduced features, termed GC features; The proposed Gershgorin circle feature extraction (GCFE) method was evaluated using three publicly accessible computer vision datasets, varying image patch sizes, and three different graph types. The GCFE method was compared with eight distinct studies. The GCFE demonstrated a notable positive Z-score compared to other feature extraction methods such as I-PCA, kernel PCA, and spectral embedding. Specifically, it achieved an average Z-score of 6.953 with the 2D grid graph type and 4.473 with the pairwise graph type, particularly on the E_Balanced dataset. Furthermore, it was observed that while the accuracy of most major feature extraction methods declined with smaller image patch sizes, the GCFE maintained consistent accuracy across all tested image patch sizes. When the GCFE method was applied to the E_MNSIT dataset using the K-NN graph type, the GCFE method confirmed its consistent accuracy performance, evidenced by a low standard deviation (SD) of 0.305. This performance was notably lower compared to other methods like Isomap, which had an SD of 1.665, and LLE, which had an SD of 1.325; The GCFE outperformed most feature extraction methods in terms of classification accuracy and computational efficiency. The GCFE method also requires fewer training parameters for deep-learning models than the traditional weighted Laplacian method, establishing its potential for more effective and efficient feature extraction in computer vision tasks.

摘要

在图论中，加权拉普拉斯矩阵是计算机视觉应用中用于解释复杂图结构的局部和全局属性的最常用技术。然而，随着图节点数量的增加，拉普拉斯矩阵的维度也会相应增加。因此，总是存在“维度诅咒”；为应对这一挑战，本文引入了一种新方法，即利用格什戈林圆盘定理，通过将加权拉普拉斯矩阵变换为严格对角占优矩阵，然后估计矩阵的粗略特征值包含区域，来降低加权拉普拉斯矩阵的维度。估计的包含区域表示为降维特征，称为GC特征；使用三个可公开获取的计算机视觉数据集、不同的图像块大小和三种不同的图类型，对所提出的格什戈林圆盘特征提取（GCFE）方法进行了评估。将GCFE方法与八项不同的研究进行了比较。与其他特征提取方法（如I-PCA、核主成分分析和谱嵌入）相比，GCFE表现出显著的正Z分数。具体而言，对于二维网格图类型，它的平均Z分数为6.953，对于成对图类型，在E_Balanced数据集上的平均Z分数为4.473。此外，观察到虽然大多数主要特征提取方法的准确率随着图像块尺寸变小而下降，但GCFE在所有测试的图像块尺寸上都保持了一致的准确率。当使用K-NN图类型将GCFE方法应用于E_MNSIT数据集时，GCFE方法证实了其一致的准确率性能，其低标准差（SD）为0.305就是证明。与其他方法（如实值映射法，其标准差为1.665，以及局部线性嵌入法，其标准差为1.325）相比，该性能明显更低；在分类准确率和计算效率方面，GCFE优于大多数特征提取方法。与传统的加权拉普拉斯方法相比，GCFE方法对于深度学习模型所需的训练参数也更少，这表明它在计算机视觉任务中具有更有效和高效的特征提取潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd33/11122193/cb367d23990b/jimaging-10-00121-g001.jpg

相似文献

Overcoming Dimensionality Constraints: A Gershgorin Circle Theorem-Based Feature Extraction for Weighted Laplacian Matrices in Computer Vision Applications.

J Imaging. 2024 May 15;10(5):121. doi: 10.3390/jimaging10050121.

Gershgorin circle theorem-based feature extraction for biomedical signal analysis.

Front Neuroinform. 2024 May 16;18:1395916. doi: 10.3389/fninf.2024.1395916. eCollection 2024.

A Study on Dimensionality Reduction and Parameters for Hyperspectral Imagery Based on Manifold Learning.

Sensors (Basel). 2024 Mar 25;24(7):2089. doi: 10.3390/s24072089.

Signed Graph Metric Learning via Gershgorin Disc Perfect Alignment.

IEEE Trans Pattern Anal Mach Intell. 2022 Oct;44(10):7219-7234. doi: 10.1109/TPAMI.2021.3091682. Epub 2022 Sep 15.

Point Cloud Sampling via Graph Balancing and Gershgorin Disc Alignment.

IEEE Trans Pattern Anal Mach Intell. 2023 Jan;45(1):868-886. doi: 10.1109/TPAMI.2022.3143089. Epub 2022 Dec 5.

A Framework for Detecting Thyroid Cancer from Ultrasound and Histopathological Images Using Deep Learning, Meta-Heuristics, and MCDM Algorithms.

J Imaging. 2023 Aug 27;9(9):173. doi: 10.3390/jimaging9090173.

A graph-Laplacian-based feature extraction algorithm for neural spike sorting.

Annu Int Conf IEEE Eng Med Biol Soc. 2009;2009:3142-5. doi: 10.1109/IEMBS.2009.5332571.

Graph-Laplacian features for neural waveform classification.

IEEE Trans Biomed Eng. 2011 May;58(5):1365-72. doi: 10.1109/TBME.2010.2090349. Epub 2010 Nov 1.

Graph Convolutional Network Using Adaptive Neighborhood Laplacian Matrix for Hyperspectral Images with Application to Rice Seed Image Classification.

Sensors (Basel). 2023 Mar 27;23(7):3515. doi: 10.3390/s23073515.

Spatiotemporal convolution sleep network based on graph attention mechanism with automatic feature extraction.

Comput Methods Programs Biomed. 2024 Feb;244:107930. doi: 10.1016/j.cmpb.2023.107930. Epub 2023 Nov 14.

本文引用的文献

Non-stationary neural signal to image conversion framework for image-based deep learning algorithms.

Front Neuroinform. 2023 Mar 24;17:1081160. doi: 10.3389/fninf.2023.1081160. eCollection 2023.

Pre-trained convolutional neural networks as feature extractors toward improved malaria parasite detection in thin blood smear images.

PeerJ. 2018 Apr 16;6:e4568. doi: 10.7717/peerj.4568. eCollection 2018.

Decision tree methods: applications for classification and prediction.

Shanghai Arch Psychiatry. 2015 Apr 25;27(2):130-5. doi: 10.11919/j.issn.1002-0829.215044.

Hessian eigenmaps: locally linear embedding techniques for high-dimensional data.

Proc Natl Acad Sci U S A. 2003 May 13;100(10):5591-6. doi: 10.1073/pnas.1031596100. Epub 2003 Apr 30.

Nonlinear dimensionality reduction by locally linear embedding.

Science. 2000 Dec 22;290(5500):2323-6. doi: 10.1126/science.290.5500.2323.

A global geometric framework for nonlinear dimensionality reduction.

Science. 2000 Dec 22;290(5500):2319-23. doi: 10.1126/science.290.5500.2319.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

克服维度限制：计算机视觉应用中基于盖尔圆定理的加权拉普拉斯矩阵特征提取

Overcoming Dimensionality Constraints: A Gershgorin Circle Theorem-Based Feature Extraction for Weighted Laplacian Matrices in Computer Vision Applications.

作者信息

Patel Sahaj Anilbhai, Yildirim Abidin

机构信息

Department of Electrical and Computer, University of Alabama at Birmingham, Birmingham, AL 35205, USA.

出版信息

J Imaging. 2024 May 15;10(5):121. doi: 10.3390/jimaging10050121.

DOI:10.3390/jimaging10050121

PMID:38786575

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11122193/

Abstract

摘要

克服维度限制：计算机视觉应用中基于盖尔圆定理的加权拉普拉斯矩阵特征提取

Overcoming Dimensionality Constraints: A Gershgorin Circle Theorem-Based Feature Extraction for Weighted Laplacian Matrices in Computer Vision Applications.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

克服维度限制：计算机视觉应用中基于盖尔圆定理的加权拉普拉斯矩阵特征提取

Overcoming Dimensionality Constraints: A Gershgorin Circle Theorem-Based Feature Extraction for Weighted Laplacian Matrices in Computer Vision Applications.

作者信息

机构信息

出版信息

相似文献

本文引用的文献