• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过挖掘图像搜索结果来标注图像。

Annotating images by mining image search results.

作者信息

Wang Xin-Jing, Zhang Lei, Li Xirong, Ma Wei-Ying

机构信息

Microsoft Research Asia, 4F Sigma Center, 49 Zhichun Road, Haidan District, Beijing 100190, PR China.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2008 Nov;30(11):1919-32. doi: 10.1109/TPAMI.2008.127.

DOI:10.1109/TPAMI.2008.127
PMID:18787241
Abstract

Although it has been studied for years by the computer vision and machine learning communities, image annotation is still far from practical. In this paper, we propose a novel attempt at model-free image annotation, which is a data-driven approach that annotates images by mining their search results. Some 2.4 million images with their surrounding text are collected from a few photo forums to support this approach. The entire process is formulated in a divide-and-conquer framework where a query keyword is provided along with the uncaptioned image to improve both the effectiveness and efficiency. This is helpful when the collected data set is not dense everywhere. In this sense, our approach contains three steps: 1) the search process to discover visually and semantically similar search results, 2) the mining process to identify salient terms from textual descriptions of the search results, and 3) the annotation rejection process to filter out noisy terms yielded by Step 2. To ensure real-time annotation, two key techniques are leveraged-one is to map the high-dimensional image visual features into hash codes, the other is to implement it as a distributed system, of which the search and mining processes are provided as Web services. As a typical result, the entire process finishes in less than 1 second. Since no training data set is required, our approach enables annotating with unlimited vocabulary and is highly scalable and robust to outliers. Experimental results on both real Web images and a benchmark image data set show the effectiveness and efficiency of the proposed algorithm. It is also worth noting that, although the entire approach is illustrated within the divide-and conquer framework, a query keyword is not crucial to our current implementation. We provide experimental results to prove this.

摘要

尽管计算机视觉和机器学习领域已经对图像标注进行了多年研究,但它仍远未达到实用阶段。在本文中,我们提出了一种无模型图像标注的全新尝试,这是一种通过挖掘图像搜索结果来对图像进行标注的数据驱动方法。我们从一些照片论坛收集了约240万张带有周边文本的图像来支持这种方法。整个过程被构建在一个分治框架中,其中在提供未加标题的图像时会给出一个查询关键词,以提高有效性和效率。当收集的数据集并非处处密集时,这很有帮助。从这个意义上讲,我们的方法包含三个步骤:1)搜索过程,以发现视觉和语义上相似的搜索结果;2)挖掘过程,从搜索结果的文本描述中识别显著术语;3)标注筛选过程,以过滤掉步骤2产生的噪声术语。为确保实时标注,我们利用了两项关键技术——一是将高维图像视觉特征映射为哈希码,另一是将其实现为分布式系统,其中搜索和挖掘过程作为网络服务提供。作为一个典型结果,整个过程在不到1秒内完成。由于无需训练数据集,我们的方法能够使用无限词汇进行标注,并且具有高度可扩展性且对异常值具有鲁棒性。在真实网络图像和基准图像数据集上的实验结果表明了所提算法的有效性和效率。还值得注意的是,尽管整个方法是在分治框架内阐述的,但查询关键词对我们当前的实现并非至关重要。我们提供了实验结果来证明这一点。

相似文献

1
Annotating images by mining image search results.通过挖掘图像搜索结果来标注图像。
IEEE Trans Pattern Anal Mach Intell. 2008 Nov;30(11):1919-32. doi: 10.1109/TPAMI.2008.127.
2
Automatic semantic annotation of real-world web images.真实世界网络图像的自动语义标注
IEEE Trans Pattern Anal Mach Intell. 2008 Nov;30(11):1933-44. doi: 10.1109/TPAMI.2008.125.
3
VisualRank: applying PageRank to large-scale image search.视觉排名:将网页排名应用于大规模图像搜索。
IEEE Trans Pattern Anal Mach Intell. 2008 Nov;30(11):1877-90. doi: 10.1109/TPAMI.2008.121.
4
Real-time computerized annotation of pictures.图片的实时计算机化标注。
IEEE Trans Pattern Anal Mach Intell. 2008 Jun;30(6):985-1002. doi: 10.1109/TPAMI.2007.70847.
5
Document image retrieval through word shape coding.通过单词形状编码进行文档图像检索。
IEEE Trans Pattern Anal Mach Intell. 2008 Nov;30(11):1913-8. doi: 10.1109/TPAMI.2008.89.
6
Localized content-based image retrieval.基于内容的局部图像检索。
IEEE Trans Pattern Anal Mach Intell. 2008 Nov;30(11):1902-12. doi: 10.1109/TPAMI.2008.112.
7
A statistical framework for image category search from a mental picture.一种基于心理图像进行图像类别搜索的统计框架。
IEEE Trans Pattern Anal Mach Intell. 2009 Jun;31(6):1087-101. doi: 10.1109/TPAMI.2008.259.
8
Geometry-based image retrieval in binary image databases.二值图像数据库中基于几何的图像检索
IEEE Trans Pattern Anal Mach Intell. 2008 Jun;30(6):1003-13. doi: 10.1109/TPAMI.2008.37.
9
Supervised learning of semantic classes for image annotation and retrieval.用于图像标注和检索的语义类别的监督学习。
IEEE Trans Pattern Anal Mach Intell. 2007 Mar;29(3):394-410. doi: 10.1109/TPAMI.2007.61.
10
Content based image retrieval using unclean positive examples.使用不纯净正例的基于内容的图像检索。
IEEE Trans Image Process. 2009 Oct;18(10):2370-5. doi: 10.1109/TIP.2009.2026669. Epub 2009 Jul 6.