• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

大规模网络数据支持的个人照片文本查询。

Textual query of personal photos facilitated by large-scale web data.

机构信息

School of Computer Engineering, Nanyang Technological University, 50 Nanyang Avenue, Blk N4, Singapore 639798.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2011 May;33(5):1022-36. doi: 10.1109/TPAMI.2010.142.

DOI:10.1109/TPAMI.2010.142
PMID:20714015
Abstract

The rapid popularization of digital cameras and mobile phone cameras has led to an explosive growth of personal photo collections by consumers. In this paper, we present a real-time textual query-based personal photo retrieval system by leveraging millions of Web images and their associated rich textual descriptions (captions, categories, etc.). After a user provides a textual query (e.g., “water”), our system exploits the inverted file to automatically find the positive Web images that are related to the textual query “water” as well as the negative Web images that are irrelevant to the textual query. Based on these automatically retrieved relevant and irrelevant Web images, we employ three simple but effective classification methods, k-Nearest Neighbor (kNN), decision stumps, and linear SVM, to rank personal photos. To further improve the photo retrieval performance, we propose two relevance feedback methods via cross-domain learning, which effectively utilize both the Web images and personal images. In particular, our proposed crossdomain learning methods can learn robust classifiers with only a very limited amount of labeled personal photos from the user by leveraging the prelearned linear SVM classifiers in real time. We further propose an incremental cross-domain learning method in order to significantly accelerate the relevance feedback process on large consumer photo databases. Extensive experiments on two consumer photo data sets demonstrate the effectiveness and efficiency of our system, which is also inherently not limited by any predefined lexicon.

摘要

数码相机和手机相机的迅速普及,导致消费者个人照片收藏呈爆炸式增长。在本文中,我们提出了一个实时基于文本查询的个人照片检索系统,该系统利用了数以百万计的网络图像及其相关的丰富文本描述(标题、类别等)。用户提供文本查询(例如“water”)后,我们的系统利用倒排文件自动找到与文本查询“water”相关的正例网络图像以及与文本查询不相关的负例网络图像。基于这些自动检索到的相关和不相关的网络图像,我们采用了三种简单而有效的分类方法,k-最近邻(kNN)、决策树桩和线性 SVM,对个人照片进行排序。为了进一步提高照片检索性能,我们通过跨域学习提出了两种相关反馈方法,有效地利用了网络图像和个人图像。特别是,我们提出的跨域学习方法可以通过实时利用预先学习的线性 SVM 分类器,仅使用用户提供的非常有限数量的带标注个人照片,学习到鲁棒的分类器。我们进一步提出了一种增量跨域学习方法,以便在大型消费者照片数据库上显著加速相关反馈过程。在两个消费者照片数据集上的广泛实验表明了我们系统的有效性和效率,而且它不受任何预定义词汇的限制。

相似文献

1
Textual query of personal photos facilitated by large-scale web data.大规模网络数据支持的个人照片文本查询。
IEEE Trans Pattern Anal Mach Intell. 2011 May;33(5):1022-36. doi: 10.1109/TPAMI.2010.142.
2
Improving Web image search by bag-based reranking.基于包的重新排序改进网络图像搜索。
IEEE Trans Image Process. 2011 Nov;20(11):3280-90. doi: 10.1109/TIP.2011.2159227. Epub 2011 Jun 9.
3
A unified relevance feedback framework for web image retrieval.一种用于网络图像检索的统一相关反馈框架。
IEEE Trans Image Process. 2009 Jun;18(6):1350-7. doi: 10.1109/TIP.2009.2017128. Epub 2009 Apr 7.
4
Visual event recognition in videos by learning from Web data.从网络数据中学习的视频中视觉事件识别。
IEEE Trans Pattern Anal Mach Intell. 2012 Sep;34(9):1667-80. doi: 10.1109/TPAMI.2011.265.
5
A framework for querying a database for structural information on 3D images of macromolecules: A web-based query-by-content prototype on the BioImage macromolecular server.用于在数据库中查询大分子三维图像结构信息的框架:基于网络的BioImage大分子服务器内容查询原型。
J Struct Biol. 1999 Apr-May;125(2-3):112-22. doi: 10.1006/jsbi.1999.4102.
6
Annotating images by mining image search results.通过挖掘图像搜索结果来标注图像。
IEEE Trans Pattern Anal Mach Intell. 2008 Nov;30(11):1919-32. doi: 10.1109/TPAMI.2008.127.
7
Modeling semantic aspects for cross-media image indexing.跨媒体图像索引的语义方面建模
IEEE Trans Pattern Anal Mach Intell. 2007 Oct;29(10):1802-17. doi: 10.1109/TPAMI.2007.1097.
8
A boosting framework for visuality-preserving distance metric learning and its application to medical image retrieval.一种保持视觉保真度的距离度量学习的提升框架及其在医学图像检索中的应用。
IEEE Trans Pattern Anal Mach Intell. 2010 Jan;32(1):30-44. doi: 10.1109/TPAMI.2008.273.
9
Face photo-sketch synthesis and recognition.面部照片-素描合成与识别。
IEEE Trans Pattern Anal Mach Intell. 2009 Nov;31(11):1955-67. doi: 10.1109/TPAMI.2008.222.
10
Domain transfer multiple kernel learning.域迁移多核学习。
IEEE Trans Pattern Anal Mach Intell. 2012 Mar;34(3):465-79. doi: 10.1109/TPAMI.2011.114.

引用本文的文献

1
Personalized Image Classification by Semantic Embedding and Active Learning.基于语义嵌入和主动学习的个性化图像分类
Entropy (Basel). 2020 Nov 18;22(11):1314. doi: 10.3390/e22111314.