• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

将局部图像描述符聚合到紧凑代码中。

Aggregating local image descriptors into compact codes.

机构信息

INRIA, Campus de Beaulieu, 35042 Rennes, France.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2012 Sep;34(9):1704-16. doi: 10.1109/TPAMI.2011.235.

DOI:10.1109/TPAMI.2011.235
PMID:22156101
Abstract

This paper addresses the problem of large-scale image search. Three constraints have to be taken into account: search accuracy, efficiency, and memory usage. We first present and evaluate different ways of aggregating local image descriptors into a vector and show that the Fisher kernel achieves better performance than the reference bag-of-visual words approach for any given vector dimension. We then jointly optimize dimensionality reduction and indexing in order to obtain a precise vector comparison as well as a compact representation. The evaluation shows that the image representation can be reduced to a few dozen bytes while preserving high accuracy. Searching a 100 million image data set takes about 250 ms on one processor core.

摘要

本文解决了大规模图像搜索问题。需要考虑三个约束条件:搜索精度、效率和内存使用。我们首先提出并评估了将局部图像描述符聚合到向量中的不同方法,并表明在任何给定的向量维度下,Fisher 核的性能都优于参考的视觉词汇袋方法。然后,我们联合优化降维和索引,以获得精确的向量比较和紧凑的表示。评估结果表明,在保持高精度的同时,图像表示可以减少到几十个字节。在一个处理器核上搜索一个包含 1 亿张图像的数据集大约需要 250 毫秒。

相似文献

1
Aggregating local image descriptors into compact codes.将局部图像描述符聚合到紧凑代码中。
IEEE Trans Pattern Anal Mach Intell. 2012 Sep;34(9):1704-16. doi: 10.1109/TPAMI.2011.235.
2
Mixture of Subspaces Image Representation and Compact Coding for Large-Scale Image Retrieval.子空间图像表示与紧凑编码的混合方法在大规模图像检索中的应用。
IEEE Trans Pattern Anal Mach Intell. 2015 Jul;37(7):1469-79. doi: 10.1109/TPAMI.2014.2382092.
3
Mining compact bag-of-patterns for low bit rate mobile visual search.挖掘紧凑的模式袋以用于低比特率移动视觉搜索。
IEEE Trans Image Process. 2014 Jul;23(7):3099-113. doi: 10.1109/TIP.2014.2324291.
4
Generating descriptive visual words and visual phrases for large-scale image applications.生成描述性视觉词汇和视觉短语,用于大规模图像应用。
IEEE Trans Image Process. 2011 Sep;20(9):2664-77. doi: 10.1109/TIP.2011.2128333. Epub 2011 Mar 17.
5
Accurate image search using the contextual dissimilarity measure.基于上下文差异测度的精确图像搜索。
IEEE Trans Pattern Anal Mach Intell. 2010 Jan;32(1):2-11. doi: 10.1109/TPAMI.2008.285.
6
Cross-indexing of binary SIFT codes for large-scale image search.二进制 SIFT 代码的交叉索引在大规模图像搜索中的应用。
IEEE Trans Image Process. 2014 May;23(5):2047-57. doi: 10.1109/TIP.2014.2312283.
7
Improving Large-Scale Image Retrieval Through Robust Aggregation of Local Descriptors.通过稳健的局部描述符聚合来改进大规模图像检索。
IEEE Trans Pattern Anal Mach Intell. 2017 Sep;39(9):1783-1796. doi: 10.1109/TPAMI.2016.2613873. Epub 2016 Sep 27.
8
Coding of amino acids by texture descriptors.基于纹理特征的氨基酸编码。
Artif Intell Med. 2010 Jan;48(1):43-50. doi: 10.1016/j.artmed.2009.10.001. Epub 2009 Nov 4.
9
Modeling semantic aspects for cross-media image indexing.跨媒体图像索引的语义方面建模
IEEE Trans Pattern Anal Mach Intell. 2007 Oct;29(10):1802-17. doi: 10.1109/TPAMI.2007.1097.
10
Interferences in Match Kernels.匹配核中的干扰。
IEEE Trans Pattern Anal Mach Intell. 2017 Sep;39(9):1797-1810. doi: 10.1109/TPAMI.2016.2615621. Epub 2016 Oct 6.

引用本文的文献

1
DINO-Mix enhancing visual place recognition with foundational vision model and feature mixing.DINO-Mix通过基础视觉模型和特征混合增强视觉场所识别。
Sci Rep. 2024 Sep 27;14(1):22100. doi: 10.1038/s41598-024-73853-3.
2
Res2Net-based multi-scale and multi-attention model for traffic scene image classification.基于 Res2Net 的交通场景图像分类的多尺度和多注意力模型。
PLoS One. 2024 May 20;19(5):e0300017. doi: 10.1371/journal.pone.0300017. eCollection 2024.
3
Duplex-Hierarchy Representation Learning for Remote Sensing Image Classification.
用于遥感图像分类的双层次表示学习
Sensors (Basel). 2024 Feb 9;24(4):1130. doi: 10.3390/s24041130.
4
Contextual Patch-NetVLAD: Context-Aware Patch Feature Descriptor and Patch Matching Mechanism for Visual Place Recognition.上下文补丁网络局部聚合描述符:用于视觉场所识别的上下文感知补丁特征描述符和补丁匹配机制
Sensors (Basel). 2024 Jan 28;24(3):855. doi: 10.3390/s24030855.
5
Loop Closure Detection Method Based on Similarity Differences between Image Blocks.基于图像块相似性差异的闭环检测方法
Sensors (Basel). 2023 Oct 22;23(20):8632. doi: 10.3390/s23208632.
6
Fusion of Appearance and Motion Features for Daily Activity Recognition from Egocentric Perspective.从自我中心视角融合外观与运动特征进行日常活动识别
Sensors (Basel). 2023 Jul 30;23(15):6804. doi: 10.3390/s23156804.
7
Artificial intelligence and digital biomarker in precision pathology guiding immune therapy selection and precision oncology.人工智能和数字生物标志物在精准病理学指导免疫治疗选择和精准肿瘤学中的应用。
Cancer Rep (Hoboken). 2023 Jul;6(7):e1796. doi: 10.1002/cnr2.1796. Epub 2023 Feb 22.
8
Maturity Grading and Identification of Fruit Based on Unsupervised Image Clustering.基于无监督图像聚类的水果成熟度分级与识别
Foods. 2022 Nov 25;11(23):3800. doi: 10.3390/foods11233800.
9
SIFT-CNN: When Convolutional Neural Networks Meet Dense SIFT Descriptors for Image and Sequence Classification.SIFT-CNN:当卷积神经网络与密集SIFT描述符相遇用于图像和序列分类时。
J Imaging. 2022 Sep 21;8(10):256. doi: 10.3390/jimaging8100256.
10
Loop Closure Detection Based on Residual Network and Capsule Network for Mobile Robot.基于残差网络和胶囊网络的移动机器人闭环检测。
Sensors (Basel). 2022 Sep 21;22(19):7137. doi: 10.3390/s22197137.