基于正交向量直方图的空间信息添加图像分类。

Image classification by addition of spatial information based on histograms of orthogonal vectors.

机构信息

Department of Computer Science, National Textile University, Faisalabad, Pakistan.

Department of Software Engineering, Mirpur University of Science & Technology, Mirpur, Azad-Kashmir, Pakistan.

出版信息

PLoS One. 2018 Jun 8;13(6):e0198175. doi: 10.1371/journal.pone.0198175. eCollection 2018.

DOI:10.1371/journal.pone.0198175

PMID:29883455

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5993303/

Abstract

The Bag-of-Visual-Words (BoVW) model is widely used for image classification, object recognition and image retrieval problems. In BoVW model, the local features are quantized and 2-D image space is represented in the form of order-less histogram of visual words. The image classification performance suffers due to the order-less representation of image. This paper presents a novel image representation that incorporates the spatial information to the inverted index of BoVW model. The spatial information is added by calculating the global relative spatial orientation of visual words in a rotation invariant manner. For this, we computed the geometric relationship between triplets of identical visual words by calculating an orthogonal vector relative to each point in the triplets of identical visual words. The histogram of visual words is calculated on the basis of the magnitude of these orthogonal vectors. This calculation provides the unique information regarding the relative position of visual words when they are collinear. The proposed image representation is evaluated by using four standard image benchmarks. The experimental results and quantitative comparisons demonstrate that the proposed image representation outperforms the existing state-of-the-art in terms of classification accuracy.

摘要

BoVW 模型广泛应用于图像分类、目标识别和图像检索问题。在 BoVW 模型中，局部特征被量化，二维图像空间以无序的视觉词直方图的形式表示。由于图像的无序表示，图像分类性能受到影响。本文提出了一种新的图像表示方法，将空间信息纳入 BoVW 模型的倒排索引中。通过以旋转不变的方式计算视觉词的全局相对空间方向，添加了空间信息。为此，我们通过计算三个相同视觉词之间的每个点的正交向量来计算相同视觉词之间的三元组的几何关系。基于这些正交向量的大小计算视觉词的直方图。当视觉词共线时，此计算提供了有关视觉词相对位置的唯一信息。通过使用四个标准图像基准对所提出的图像表示进行评估。实验结果和定量比较表明，所提出的图像表示在分类准确性方面优于现有最先进的方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6c60/5993303/20103f4199cf/pone.0198175.g001.jpg

相似文献

Image classification by addition of spatial information based on histograms of orthogonal vectors.

PLoS One. 2018 Jun 8;13(6):e0198175. doi: 10.1371/journal.pone.0198175. eCollection 2018.

Modeling global geometric spatial information for rotation invariant classification of satellite images.

PLoS One. 2019 Jul 19;14(7):e0219833. doi: 10.1371/journal.pone.0219833. eCollection 2019.

A thousand words in a scene.

IEEE Trans Pattern Anal Mach Intell. 2007 Sep;29(9):1575-89. doi: 10.1109/TPAMI.2007.1155.

Texture-specific bag of visual words model and spatial cone matching-based method for the retrieval of focal liver lesions using multiphase contrast-enhanced CT images.

Int J Comput Assist Radiol Surg. 2018 Jan;13(1):151-164. doi: 10.1007/s11548-017-1671-9. Epub 2017 Nov 5.

A Hybrid Geometric Spatial Image Representation for scene classification.

PLoS One. 2018 Sep 12;13(9):e0203339. doi: 10.1371/journal.pone.0203339. eCollection 2018.

USB: ultrashort binary descriptor for fast visual matching and retrieval.

IEEE Trans Image Process. 2014 Aug;23(8):3671-83. doi: 10.1109/TIP.2014.2330794. Epub 2014 Jun 12.

A Novel Image Retrieval Based on Visual Words Integration of SIFT and SURF.

PLoS One. 2016 Jun 17;11(6):e0157428. doi: 10.1371/journal.pone.0157428. eCollection 2016.

Pooling region learning of visual word for image classification using bag-of-visual-words model.

PLoS One. 2020 Jun 5;15(6):e0234144. doi: 10.1371/journal.pone.0234144. eCollection 2020.

Universal and adapted vocabularies for generic visual categorization.

IEEE Trans Pattern Anal Mach Intell. 2008 Jul;30(7):1243-56. doi: 10.1109/TPAMI.2007.70755.

Content Based Image Retrieval by Using Color Descriptor and Discrete Wavelet Transform.

J Med Syst. 2018 Jan 25;42(3):44. doi: 10.1007/s10916-017-0880-7.

引用本文的文献

How deep is your art: An experimental study on the limits of artistic understanding in a single-task, single-modality neural network.

PLoS One. 2024 Nov 6;19(11):e0305943. doi: 10.1371/journal.pone.0305943. eCollection 2024.

Multi-modal medical image classification using deep residual network and genetic algorithm.

PLoS One. 2023 Jun 29;18(6):e0287786. doi: 10.1371/journal.pone.0287786. eCollection 2023.

Compare the performance of the models in art classification.

PLoS One. 2021 Mar 12;16(3):e0248414. doi: 10.1371/journal.pone.0248414. eCollection 2021.

Pooling region learning of visual word for image classification using bag-of-visual-words model.

PLoS One. 2020 Jun 5;15(6):e0234144. doi: 10.1371/journal.pone.0234144. eCollection 2020.

Visual complexity modelling based on image features fusion of multiple kernels.

PeerJ. 2019 Jul 18;7:e7075. doi: 10.7717/peerj.7075. eCollection 2019.

Modeling global geometric spatial information for rotation invariant classification of satellite images.

PLoS One. 2019 Jul 19;14(7):e0219833. doi: 10.1371/journal.pone.0219833. eCollection 2019.

A Hybrid Geometric Spatial Image Representation for scene classification.

PLoS One. 2018 Sep 12;13(9):e0203339. doi: 10.1371/journal.pone.0203339. eCollection 2018.

本文引用的文献

Content Based Image Retrieval by Using Color Descriptor and Discrete Wavelet Transform.

J Med Syst. 2018 Jan 25;42(3):44. doi: 10.1007/s10916-017-0880-7.

A Novel Image Retrieval Based on Visual Words Integration of SIFT and SURF.

PLoS One. 2016 Jun 17;11(6):e0157428. doi: 10.1371/journal.pone.0157428. eCollection 2016.

Considering the Spatial Layout Information of Bag of Features (BoF) Framework for Image Classification.

PLoS One. 2015 Jun 29;10(6):e0131164. doi: 10.1371/journal.pone.0131164. eCollection 2015.

Local coding based matching kernel method for image classification.

PLoS One. 2014 Aug 13;9(8):e103575. doi: 10.1371/journal.pone.0103575. eCollection 2014.

Co.Vi.Wo.: Color Visual Words Based on Non-Predefined Size Codebooks.

IEEE Trans Cybern. 2013 Feb;43(1):192-205. doi: 10.1109/TSMCB.2012.2203300. Epub 2012 Jul 3.

Generating descriptive visual words and visual phrases for large-scale image applications.

IEEE Trans Image Process. 2011 Sep;20(9):2664-77. doi: 10.1109/TIP.2011.2128333. Epub 2011 Mar 17.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于正交向量直方图的空间信息添加图像分类。

Image classification by addition of spatial information based on histograms of orthogonal vectors.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献