利用全景图像和基于压缩感知的视觉描述符进行设备上的移动视觉位置识别。

On-device mobile visual location recognition by using panoramic images and compressed sensing based visual descriptors.

机构信息

School of Computer Science and Technology, Huazhong University of Science & Technology, Wuhan, People's Republic of China.

出版信息

PLoS One. 2014 Jun 3;9(6):e98806. doi: 10.1371/journal.pone.0098806. eCollection 2014.

DOI:10.1371/journal.pone.0098806

PMID:24892288

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4043852/

Abstract

Mobile Visual Location Recognition (MVLR) has attracted a lot of researchers' attention in the past few years. Existing MVLR applications commonly use Query-by-Example (QBE) based image retrieval principle to fulfill the location recognition task. However, the QBE framework is not reliable enough due to the variations in the capture conditions and viewpoint changes between the query image and the database images. To solve the above problem, we make following contributions to the design of a panorama based on-device MVLR system. Firstly, we design a heading (from digital compass) aware BOF (Bag-of-features) model to generate the descriptors of panoramic images. Our approach fully considers the characteristics of the panoramic images and can facilitate the panorama based on-device MVLR to a large degree. Secondly, to search high dimensional visual descriptors directly on mobile devices, we propose an effective bilinear compressed sensing based encoding method. While being fast and accurate enough for on-device implementation, our algorithm can also reduce the memory usage of projection matrix significantly. Thirdly, we also release a panoramas database as well as a set of test panoramic quires which can be used as a new benchmark to facilitate further research in the area. Experimental results prove the effectiveness of the proposed methods for on-device MVLR applications.

摘要

移动视觉定位识别 (MVLR) 在过去几年中引起了许多研究人员的关注。现有的 MVLR 应用程序通常使用基于示例查询 (QBE) 的图像检索原理来完成定位识别任务。然而，由于查询图像和数据库图像之间的捕获条件和视角变化，QBE 框架不够可靠。为了解决上述问题，我们为基于设备的全景 MVLR 系统的设计做出了以下贡献。首先，我们设计了一个基于方向（来自数字指南针）感知的 BOF（特征袋）模型来生成全景图像的描述符。我们的方法充分考虑了全景图像的特点，可以在很大程度上促进基于设备的全景 MVLR。其次，为了在移动设备上直接搜索高维视觉描述符，我们提出了一种有效的基于双线性压缩感知的编码方法。我们的算法既快速又准确，足以在设备上实现，同时还可以显著减少投影矩阵的内存使用。第三，我们还发布了一个全景数据库以及一组测试全景查询，可以作为一个新的基准来促进该领域的进一步研究。实验结果证明了所提出的方法在基于设备的 MVLR 应用中的有效性。

相似文献

On-device mobile visual location recognition by using panoramic images and compressed sensing based visual descriptors.

PLoS One. 2014 Jun 3;9(6):e98806. doi: 10.1371/journal.pone.0098806. eCollection 2014.

A Panoramic Localizer Based on Coarse-to-Fine Descriptors for Navigation Assistance.

Sensors (Basel). 2020 Jul 27;20(15):4177. doi: 10.3390/s20154177.

Optimal query-based relevance feedback in medical image retrieval using score fusion-based classification.

J Digit Imaging. 2015 Apr;28(2):160-78. doi: 10.1007/s10278-014-9730-z.

A unified framework for image retrieval using keyword and visual features.

IEEE Trans Image Process. 2005 Jul;14(7):979-89. doi: 10.1109/tip.2005.847289.

Gradually focused fine-grained sketch-based image retrieval.

PLoS One. 2019 May 28;14(5):e0217168. doi: 10.1371/journal.pone.0217168. eCollection 2019.

Panoramic appearance-based recognition of video contents using matching graphs.

IEEE Trans Syst Man Cybern B Cybern. 2004 Feb;34(1):179-99. doi: 10.1109/tsmcb.2003.811770.

Annotating images by mining image search results.

IEEE Trans Pattern Anal Mach Intell. 2008 Nov;30(11):1919-32. doi: 10.1109/TPAMI.2008.127.

Texture-based medical image retrieval in compressed domain using compressive sensing.

Int J Bioinform Res Appl. 2014;10(2):129-44. doi: 10.1504/IJBRA.2014.059519.

Content Based Image Retrieval by Using Color Descriptor and Discrete Wavelet Transform.

J Med Syst. 2018 Jan 25;42(3):44. doi: 10.1007/s10916-017-0880-7.

A new way for multidimensional medical data management: volume of interest (VOI)-based retrieval of medical images with visual and functional features.

IEEE Trans Inf Technol Biomed. 2006 Jul;10(3):598-607. doi: 10.1109/titb.2006.872045.

引用本文的文献

Estimating the position and orientation of a mobile robot with respect to a trajectory using omnidirectional imaging and global appearance.

PLoS One. 2017 May 2;12(5):e0175938. doi: 10.1371/journal.pone.0175938. eCollection 2017.

Robust Optical Recognition of Cursive Pashto Script Using Scale, Rotation and Location Invariant Approach.

PLoS One. 2015 Sep 14;10(9):e0133648. doi: 10.1371/journal.pone.0133648. eCollection 2015.

A Probabilistic Analysis of Sparse Coded Feature Pooling and Its Application for Image Retrieval.

PLoS One. 2015 Jul 1;10(7):e0131721. doi: 10.1371/journal.pone.0131721. eCollection 2015.

F-formation detection: individuating free-standing conversational groups in images.

PLoS One. 2015 May 21;10(5):e0123783. doi: 10.1371/journal.pone.0123783. eCollection 2015.

Correction method for line extraction in vision measurement.

PLoS One. 2015 May 18;10(5):e0127068. doi: 10.1371/journal.pone.0127068. eCollection 2015.

Ensemble learning for spatial interpolation of soil potassium content based on environmental information.

PLoS One. 2015 Apr 30;10(4):e0124383. doi: 10.1371/journal.pone.0124383. eCollection 2015.

A time-critical adaptive approach for visualizing natural scenes on different devices.

PLoS One. 2015 Feb 27;10(2):e0117586. doi: 10.1371/journal.pone.0117586. eCollection 2015.

Evaluation of the quantitative accuracy of 3D reconstruction of edentulous jaw models with jaw relation based on reference point system alignment.

PLoS One. 2015 Feb 6;10(2):e0117320. doi: 10.1371/journal.pone.0117320. eCollection 2015.

A combined approach to cartographic displacement for buildings based on skeleton and improved elastic beam algorithm.

PLoS One. 2014 Dec 3;9(12):e113953. doi: 10.1371/journal.pone.0113953. eCollection 2014.

本文引用的文献

Mining compact bag-of-patterns for low bit rate mobile visual search.

IEEE Trans Image Process. 2014 Jul;23(7):3099-113. doi: 10.1109/TIP.2014.2324291.

Visual-textual joint relevance learning for tag-based social image search.

IEEE Trans Image Process. 2013 Jan;22(1):363-76. doi: 10.1109/TIP.2012.2202676. Epub 2012 Jun 5.

3-D object retrieval and recognition with hypergraph analysis.

IEEE Trans Image Process. 2012 Sep;21(9):4290-303. doi: 10.1109/TIP.2012.2199502. Epub 2012 May 15.

Approximate nearest neighbor search by residual vector quantization.

Sensors (Basel). 2010;10(12):11259-73. doi: 10.3390/s101211259. Epub 2010 Dec 8.

Task-dependent visual-codebook compression.

IEEE Trans Image Process. 2012 Apr;21(4):2282-93. doi: 10.1109/TIP.2011.2176950. Epub 2011 Nov 22.

Product quantization for nearest neighbor search.

IEEE Trans Pattern Anal Mach Intell. 2011 Jan;33(1):117-28. doi: 10.1109/TPAMI.2010.57.

Optimally sparse representation in general (nonorthogonal) dictionaries via l minimization.

Proc Natl Acad Sci U S A. 2003 Mar 4;100(5):2197-202. doi: 10.1073/pnas.0437847100. Epub 2003 Feb 21.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

利用全景图像和基于压缩感知的视觉描述符进行设备上的移动视觉位置识别。

On-device mobile visual location recognition by using panoramic images and compressed sensing based visual descriptors.

机构信息

School of Computer Science and Technology, Huazhong University of Science & Technology, Wuhan, People's Republic of China.

出版信息

PLoS One. 2014 Jun 3;9(6):e98806. doi: 10.1371/journal.pone.0098806. eCollection 2014.

DOI:10.1371/journal.pone.0098806

PMID:24892288

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4043852/

Abstract

摘要

利用全景图像和基于压缩感知的视觉描述符进行设备上的移动视觉位置识别。

On-device mobile visual location recognition by using panoramic images and compressed sensing based visual descriptors.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

利用全景图像和基于压缩感知的视觉描述符进行设备上的移动视觉位置识别。

On-device mobile visual location recognition by using panoramic images and compressed sensing based visual descriptors.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献