一种基于心理图像进行图像类别搜索的统计框架。

Ferecatu Marin, Geman Donald

TSI Department, Institut Telecom, Telecom Paristech, 46, rue Barrault, 75634 Paris, France.

IEEE Trans Pattern Anal Mach Intell. 2009 Jun;31(6):1087-101. doi: 10.1109/TPAMI.2008.259.

Starting from a member of an image database designated the "query image," traditional image retrieval techniques, for example, search by visual similarity, allow one to locate additional instances of a target category residing in the database. However, in many cases, the query image or, more generally, the target category, resides only in the mind of the user as a set of subjective visual patterns, psychological impressions, or "mental pictures." Consequently, since image databases available today are often unstructured and lack reliable semantic annotations, it is often not obvious how to initiate a search session; this is the "page zero problem." We propose a new statistical framework based on relevance feedback to locate an instance of a semantic category in an unstructured image database with no semantic annotation. A search session is initiated from a random sample of images. At each retrieval round, the user is asked to select one image from among a set of displayed images-the one that is closest in his opinion to the target class. The matching is then "mental." Performance is measured by the number of iterations necessary to display an image which satisfies the user, at which point standard techniques can be employed to display other instances. Our core contribution is a Bayesian formulation which scales to large databases. The two key components are a response model which accounts for the user's subjective perception of similarity and a display algorithm which seeks to maximize the flow of information. Experiments with real users and two databases of 20,000 and 60,000 images demonstrate the efficiency of the search process.

从图像数据库中指定为“查询图像”的成员开始，传统的图像检索技术，例如通过视觉相似性进行搜索，允许人们在数据库中定位目标类别中存在的其他实例。然而，在许多情况下，查询图像，或者更一般地说，目标类别，仅作为一组主观视觉模式、心理印象或“心理图像”存在于用户的脑海中。因此，由于当今可用的图像数据库通常是无结构的且缺乏可靠的语义注释，通常不清楚如何启动搜索会话；这就是“零页问题”。我们提出了一种基于相关反馈的新统计框架，用于在没有语义注释的无结构图像数据库中定位语义类别的实例。搜索会话从图像的随机样本开始。在每一轮检索中，要求用户从一组显示的图像中选择一幅——他认为最接近目标类别的那幅。然后这种匹配是“心理上的”。性能通过显示满足用户的图像所需的迭代次数来衡量，此时可以采用标准技术来显示其他实例。我们的核心贡献是一种可扩展到大型数据库的贝叶斯公式。两个关键组件是一个响应模型，它考虑了用户对相似性的主观感知，以及一个显示算法，它试图最大化信息流。对真实用户以及包含20000张和60000张图像的两个数据库进行的实验证明了搜索过程的效率。

相似文献

A statistical framework for image category search from a mental picture.

IEEE Trans Pattern Anal Mach Intell. 2009 Jun;31(6):1087-101. doi: 10.1109/TPAMI.2008.259.

Annotating images by mining image search results.

IEEE Trans Pattern Anal Mach Intell. 2008 Nov;30(11):1919-32. doi: 10.1109/TPAMI.2008.127.

Integrating relevance feedback techniques for image retrieval using reinforcement learning.

IEEE Trans Pattern Anal Mach Intell. 2005 Oct;27(10):1536-51. doi: 10.1109/TPAMI.2005.201.

Attention-based dynamic visual search using inner-scene similarity: algorithms and bounds.

IEEE Trans Pattern Anal Mach Intell. 2006 Feb;28(2):251-64. doi: 10.1109/TPAMI.2006.28.

Effective proximity retrieval by ordering permutations.

IEEE Trans Pattern Anal Mach Intell. 2008 Sep;30(9):1647-58. doi: 10.1109/TPAMI.2007.70815.

Content based image retrieval using unclean positive examples.

IEEE Trans Image Process. 2009 Oct;18(10):2370-5. doi: 10.1109/TIP.2009.2026669. Epub 2009 Jul 6.

VisualRank: applying PageRank to large-scale image search.

IEEE Trans Pattern Anal Mach Intell. 2008 Nov;30(11):1877-90. doi: 10.1109/TPAMI.2008.121.

An efficient Earth Mover's Distance algorithm for robust histogram comparison.

IEEE Trans Pattern Anal Mach Intell. 2007 May;29(5):840-53. doi: 10.1109/TPAMI.2007.1058.

Toward objective evaluation of image segmentation algorithms.

IEEE Trans Pattern Anal Mach Intell. 2007 Jun;29(6):929-44. doi: 10.1109/TPAMI.2007.1046.

Medical image retrieval with probabilistic multi-class support vector machine classifiers and adaptive similarity fusion.

Comput Med Imaging Graph. 2008 Mar;32(2):95-108. doi: 10.1016/j.compmedimag.2007.10.001. Epub 2007 Nov 26.

引用本文的文献

Obtaining psychological embeddings through joint kernel and metric learning.

Behav Res Methods. 2019 Oct;51(5):2180-2193. doi: 10.3758/s13428-019-01285-3.

Suppr 超能文献

核心技术专利：CN118964589B侵权必究

相似文献

A statistical framework for image category search from a mental picture.

IEEE Trans Pattern Anal Mach Intell. 2009 Jun;31(6):1087-101. doi: 10.1109/TPAMI.2008.259.

Annotating images by mining image search results.

IEEE Trans Pattern Anal Mach Intell. 2008 Nov;30(11):1919-32. doi: 10.1109/TPAMI.2008.127.

Integrating relevance feedback techniques for image retrieval using reinforcement learning.

IEEE Trans Pattern Anal Mach Intell. 2005 Oct;27(10):1536-51. doi: 10.1109/TPAMI.2005.201.

Attention-based dynamic visual search using inner-scene similarity: algorithms and bounds.

IEEE Trans Pattern Anal Mach Intell. 2006 Feb;28(2):251-64. doi: 10.1109/TPAMI.2006.28.

Effective proximity retrieval by ordering permutations.

IEEE Trans Pattern Anal Mach Intell. 2008 Sep;30(9):1647-58. doi: 10.1109/TPAMI.2007.70815.

Content based image retrieval using unclean positive examples.

IEEE Trans Image Process. 2009 Oct;18(10):2370-5. doi: 10.1109/TIP.2009.2026669. Epub 2009 Jul 6.

VisualRank: applying PageRank to large-scale image search.

IEEE Trans Pattern Anal Mach Intell. 2008 Nov;30(11):1877-90. doi: 10.1109/TPAMI.2008.121.

An efficient Earth Mover's Distance algorithm for robust histogram comparison.

IEEE Trans Pattern Anal Mach Intell. 2007 May;29(5):840-53. doi: 10.1109/TPAMI.2007.1058.

Toward objective evaluation of image segmentation algorithms.

IEEE Trans Pattern Anal Mach Intell. 2007 Jun;29(6):929-44. doi: 10.1109/TPAMI.2007.1046.

Medical image retrieval with probabilistic multi-class support vector machine classifiers and adaptive similarity fusion.

Comput Med Imaging Graph. 2008 Mar;32(2):95-108. doi: 10.1016/j.compmedimag.2007.10.001. Epub 2007 Nov 26.

引用本文的文献

Obtaining psychological embeddings through joint kernel and metric learning.

Behav Res Methods. 2019 Oct;51(5):2180-2193. doi: 10.3758/s13428-019-01285-3.

A statistical framework for image category search from a mental picture.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献