• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用全景图像和基于压缩感知的视觉描述符进行设备上的移动视觉位置识别。

On-device mobile visual location recognition by using panoramic images and compressed sensing based visual descriptors.

机构信息

School of Computer Science and Technology, Huazhong University of Science & Technology, Wuhan, People's Republic of China.

出版信息

PLoS One. 2014 Jun 3;9(6):e98806. doi: 10.1371/journal.pone.0098806. eCollection 2014.

DOI:10.1371/journal.pone.0098806
PMID:24892288
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4043852/
Abstract

Mobile Visual Location Recognition (MVLR) has attracted a lot of researchers' attention in the past few years. Existing MVLR applications commonly use Query-by-Example (QBE) based image retrieval principle to fulfill the location recognition task. However, the QBE framework is not reliable enough due to the variations in the capture conditions and viewpoint changes between the query image and the database images. To solve the above problem, we make following contributions to the design of a panorama based on-device MVLR system. Firstly, we design a heading (from digital compass) aware BOF (Bag-of-features) model to generate the descriptors of panoramic images. Our approach fully considers the characteristics of the panoramic images and can facilitate the panorama based on-device MVLR to a large degree. Secondly, to search high dimensional visual descriptors directly on mobile devices, we propose an effective bilinear compressed sensing based encoding method. While being fast and accurate enough for on-device implementation, our algorithm can also reduce the memory usage of projection matrix significantly. Thirdly, we also release a panoramas database as well as a set of test panoramic quires which can be used as a new benchmark to facilitate further research in the area. Experimental results prove the effectiveness of the proposed methods for on-device MVLR applications.

摘要

移动视觉定位识别 (MVLR) 在过去几年中引起了许多研究人员的关注。现有的 MVLR 应用程序通常使用基于示例查询 (QBE) 的图像检索原理来完成定位识别任务。然而,由于查询图像和数据库图像之间的捕获条件和视角变化,QBE 框架不够可靠。为了解决上述问题,我们为基于设备的全景 MVLR 系统的设计做出了以下贡献。首先,我们设计了一个基于方向(来自数字指南针)感知的 BOF(特征袋)模型来生成全景图像的描述符。我们的方法充分考虑了全景图像的特点,可以在很大程度上促进基于设备的全景 MVLR。其次,为了在移动设备上直接搜索高维视觉描述符,我们提出了一种有效的基于双线性压缩感知的编码方法。我们的算法既快速又准确,足以在设备上实现,同时还可以显著减少投影矩阵的内存使用。第三,我们还发布了一个全景数据库以及一组测试全景查询,可以作为一个新的基准来促进该领域的进一步研究。实验结果证明了所提出的方法在基于设备的 MVLR 应用中的有效性。

相似文献

1
On-device mobile visual location recognition by using panoramic images and compressed sensing based visual descriptors.利用全景图像和基于压缩感知的视觉描述符进行设备上的移动视觉位置识别。
PLoS One. 2014 Jun 3;9(6):e98806. doi: 10.1371/journal.pone.0098806. eCollection 2014.
2
A Panoramic Localizer Based on Coarse-to-Fine Descriptors for Navigation Assistance.基于由粗到精描述符的全景定位器,用于导航辅助。
Sensors (Basel). 2020 Jul 27;20(15):4177. doi: 10.3390/s20154177.
3
Optimal query-based relevance feedback in medical image retrieval using score fusion-based classification.基于分数融合分类的医学图像检索中基于查询的最优相关反馈
J Digit Imaging. 2015 Apr;28(2):160-78. doi: 10.1007/s10278-014-9730-z.
4
A unified framework for image retrieval using keyword and visual features.一种使用关键词和视觉特征进行图像检索的统一框架。
IEEE Trans Image Process. 2005 Jul;14(7):979-89. doi: 10.1109/tip.2005.847289.
5
Gradually focused fine-grained sketch-based image retrieval.逐渐聚焦的细粒度基于草图的图像检索。
PLoS One. 2019 May 28;14(5):e0217168. doi: 10.1371/journal.pone.0217168. eCollection 2019.
6
Panoramic appearance-based recognition of video contents using matching graphs.使用匹配图基于全景外观对视频内容进行识别。
IEEE Trans Syst Man Cybern B Cybern. 2004 Feb;34(1):179-99. doi: 10.1109/tsmcb.2003.811770.
7
Annotating images by mining image search results.通过挖掘图像搜索结果来标注图像。
IEEE Trans Pattern Anal Mach Intell. 2008 Nov;30(11):1919-32. doi: 10.1109/TPAMI.2008.127.
8
Texture-based medical image retrieval in compressed domain using compressive sensing.基于纹理的医学图像在压缩域中利用压缩感知进行检索。
Int J Bioinform Res Appl. 2014;10(2):129-44. doi: 10.1504/IJBRA.2014.059519.
9
Content Based Image Retrieval by Using Color Descriptor and Discrete Wavelet Transform.基于颜色描述符和离散小波变换的图像检索。
J Med Syst. 2018 Jan 25;42(3):44. doi: 10.1007/s10916-017-0880-7.
10
A new way for multidimensional medical data management: volume of interest (VOI)-based retrieval of medical images with visual and functional features.多维医学数据管理的新方法:基于感兴趣体积(VOI)的具有视觉和功能特征的医学图像检索。
IEEE Trans Inf Technol Biomed. 2006 Jul;10(3):598-607. doi: 10.1109/titb.2006.872045.

引用本文的文献

1
Estimating the position and orientation of a mobile robot with respect to a trajectory using omnidirectional imaging and global appearance.使用全向成像和全局外观估计移动机器人相对于轨迹的位置和方向。
PLoS One. 2017 May 2;12(5):e0175938. doi: 10.1371/journal.pone.0175938. eCollection 2017.
2
Robust Optical Recognition of Cursive Pashto Script Using Scale, Rotation and Location Invariant Approach.使用尺度、旋转和位置不变方法对普什图文草体进行稳健光学识别。
PLoS One. 2015 Sep 14;10(9):e0133648. doi: 10.1371/journal.pone.0133648. eCollection 2015.
3
A Probabilistic Analysis of Sparse Coded Feature Pooling and Its Application for Image Retrieval.

本文引用的文献

1
Mining compact bag-of-patterns for low bit rate mobile visual search.挖掘紧凑的模式袋以用于低比特率移动视觉搜索。
IEEE Trans Image Process. 2014 Jul;23(7):3099-113. doi: 10.1109/TIP.2014.2324291.
2
Visual-textual joint relevance learning for tag-based social image search.基于标签的社会图像搜索中的视觉-文本联合相关性学习。
IEEE Trans Image Process. 2013 Jan;22(1):363-76. doi: 10.1109/TIP.2012.2202676. Epub 2012 Jun 5.
3
3-D object retrieval and recognition with hypergraph analysis.基于超图分析的三维目标检索与识别。
稀疏编码特征池化的概率分析及其在图像检索中的应用
PLoS One. 2015 Jul 1;10(7):e0131721. doi: 10.1371/journal.pone.0131721. eCollection 2015.
4
F-formation detection: individuating free-standing conversational groups in images.F形编队检测:在图像中识别独立的对话群组。
PLoS One. 2015 May 21;10(5):e0123783. doi: 10.1371/journal.pone.0123783. eCollection 2015.
5
Correction method for line extraction in vision measurement.视觉测量中直线提取的校正方法。
PLoS One. 2015 May 18;10(5):e0127068. doi: 10.1371/journal.pone.0127068. eCollection 2015.
6
Ensemble learning for spatial interpolation of soil potassium content based on environmental information.基于环境信息的土壤钾含量空间插值集成学习
PLoS One. 2015 Apr 30;10(4):e0124383. doi: 10.1371/journal.pone.0124383. eCollection 2015.
7
A time-critical adaptive approach for visualizing natural scenes on different devices.一种用于在不同设备上可视化自然场景的时间关键型自适应方法。
PLoS One. 2015 Feb 27;10(2):e0117586. doi: 10.1371/journal.pone.0117586. eCollection 2015.
8
Evaluation of the quantitative accuracy of 3D reconstruction of edentulous jaw models with jaw relation based on reference point system alignment.基于参考点系统对齐的无牙颌模型与颌关系三维重建定量准确性评估。
PLoS One. 2015 Feb 6;10(2):e0117320. doi: 10.1371/journal.pone.0117320. eCollection 2015.
9
A combined approach to cartographic displacement for buildings based on skeleton and improved elastic beam algorithm.一种基于骨架和改进弹性梁算法的建筑物地图位移组合方法。
PLoS One. 2014 Dec 3;9(12):e113953. doi: 10.1371/journal.pone.0113953. eCollection 2014.
IEEE Trans Image Process. 2012 Sep;21(9):4290-303. doi: 10.1109/TIP.2012.2199502. Epub 2012 May 15.
4
Approximate nearest neighbor search by residual vector quantization.基于残差向量量化的近似最近邻搜索。
Sensors (Basel). 2010;10(12):11259-73. doi: 10.3390/s101211259. Epub 2010 Dec 8.
5
Task-dependent visual-codebook compression.任务相关的视觉码本压缩。
IEEE Trans Image Process. 2012 Apr;21(4):2282-93. doi: 10.1109/TIP.2011.2176950. Epub 2011 Nov 22.
6
Product quantization for nearest neighbor search.基于乘积量化的最近邻搜索。
IEEE Trans Pattern Anal Mach Intell. 2011 Jan;33(1):117-28. doi: 10.1109/TPAMI.2010.57.
7
Optimally sparse representation in general (nonorthogonal) dictionaries via l minimization.通过 l 最小化实现一般(非正交)字典中的最优稀疏表示。
Proc Natl Acad Sci U S A. 2003 Mar 4;100(5):2197-202. doi: 10.1073/pnas.0437847100. Epub 2003 Feb 21.