• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用组稀疏性的自动图像标注与检索

Automatic image annotation and retrieval using group sparsity.

作者信息

Zhang Shaoting, Huang Junzhou, Li Hongsheng, Metaxas Dimitris N

机构信息

Department of Computer Science, Rutgers University, Piscataway, NJ 08854-8019, USA.

出版信息

IEEE Trans Syst Man Cybern B Cybern. 2012 Jun;42(3):838-49. doi: 10.1109/TSMCB.2011.2179533. Epub 2012 Jan 10.

DOI:10.1109/TSMCB.2011.2179533
PMID:22249744
Abstract

Automatically assigning relevant text keywords to images is an important problem. Many algorithms have been proposed in the past decade and achieved good performance. Efforts have focused upon model representations of keywords, whereas properties of features have not been well investigated. In most cases, a group of features is preselected, yet important feature properties are not well used to select features. In this paper, we introduce a regularization-based feature selection algorithm to leverage both the sparsity and clustering properties of features, and incorporate it into the image annotation task. Using this group-sparsity-based method, the whole group of features [e.g., red green blue (RGB) or hue, saturation, and value (HSV)] is either selected or removed. Thus, we do not need to extract this group of features when new data comes. A novel approach is also proposed to iteratively obtain similar and dissimilar pairs from both the keyword similarity and the relevance feedback. Thus, keyword similarity is modeled in the annotation framework. We also show that our framework can be employed in image retrieval tasks by selecting different image pairs. Extensive experiments are designed to compare the performance between features, feature combinations, and regularization-based feature selection methods applied on the image annotation task, which gives insight into the properties of features in the image annotation task. The experimental results demonstrate that the group-sparsity-based method is more accurate and stable than others.

摘要

自动为图像分配相关文本关键词是一个重要问题。在过去十年中已经提出了许多算法并取得了良好性能。以往的工作主要集中在关键词的模型表示上,而特征的属性尚未得到充分研究。在大多数情况下,会预先选择一组特征,但重要的特征属性并未被很好地用于特征选择。在本文中,我们引入一种基于正则化的特征选择算法,以利用特征的稀疏性和聚类属性,并将其纳入图像标注任务。使用这种基于组稀疏性的方法,整个特征组(例如红绿蓝(RGB)或色调、饱和度和明度(HSV))要么被选中,要么被移除。因此,当新数据到来时,我们无需提取这组特征。还提出了一种新颖的方法,从关键词相似度和相关反馈中迭代地获取相似和不相似的图像对。这样,关键词相似度就在标注框架中得到了建模。我们还表明,通过选择不同的图像对,我们的框架可用于图像检索任务。我们设计了大量实验,以比较在图像标注任务中应用的特征、特征组合以及基于正则化的特征选择方法之间的性能,这有助于深入了解图像标注任务中特征的属性。实验结果表明,基于组稀疏性的方法比其他方法更准确、更稳定。

相似文献

1
Automatic image annotation and retrieval using group sparsity.使用组稀疏性的自动图像标注与检索
IEEE Trans Syst Man Cybern B Cybern. 2012 Jun;42(3):838-49. doi: 10.1109/TSMCB.2011.2179533. Epub 2012 Jan 10.
2
Localized content-based image retrieval.基于内容的局部图像检索。
IEEE Trans Pattern Anal Mach Intell. 2008 Nov;30(11):1902-12. doi: 10.1109/TPAMI.2008.112.
3
Annotating images by mining image search results.通过挖掘图像搜索结果来标注图像。
IEEE Trans Pattern Anal Mach Intell. 2008 Nov;30(11):1919-32. doi: 10.1109/TPAMI.2008.127.
4
Contextual kernel and spectral methods for learning the semantics of images.基于上下文核和谱方法的图像语义学习。
IEEE Trans Image Process. 2011 Jun;20(6):1739-50. doi: 10.1109/TIP.2010.2103082. Epub 2010 Dec 30.
5
Supervised learning of semantic classes for image annotation and retrieval.用于图像标注和检索的语义类别的监督学习。
IEEE Trans Pattern Anal Mach Intell. 2007 Mar;29(3):394-410. doi: 10.1109/TPAMI.2007.61.
6
A discriminative kernel-based approach to rank images from text queries.一种基于判别核的方法,用于根据文本查询对图像进行排序。
IEEE Trans Pattern Anal Mach Intell. 2008 Aug;30(8):1371-84. doi: 10.1109/TPAMI.2007.70791.
7
Design of multimodal dissimilarity spaces for retrieval of video documents.用于视频文档检索的多模态差异空间设计
IEEE Trans Pattern Anal Mach Intell. 2008 Sep;30(9):1520-33. doi: 10.1109/TPAMI.2007.70801.
8
Integrating an automatic classification method into the medical image retrieval process.将一种自动分类方法集成到医学图像检索过程中。
AMIA Annu Symp Proc. 2008 Nov 6;2008:747-51.
9
Semantic-gap-oriented active learning for multilabel image annotation.面向语义鸿沟的多标签图像标注主动学习
IEEE Trans Image Process. 2012 Apr;21(4):2354-60. doi: 10.1109/TIP.2011.2180916. Epub 2011 Dec 21.
10
Universal and adapted vocabularies for generic visual categorization.用于通用视觉分类的通用和适应性词汇表。
IEEE Trans Pattern Anal Mach Intell. 2008 Jul;30(7):1243-56. doi: 10.1109/TPAMI.2007.70755.

引用本文的文献

1
Joint Patch and Multi-label Learning for Facial Action Unit Detection.用于面部动作单元检测的联合补丁与多标签学习
Proc IEEE Comput Soc Conf Comput Vis Pattern Recognit. 2015 Jun;2015:2207-2216. doi: 10.1109/CVPR.2015.7298833.
2
Multimodal entity coreference for cervical dysplasia diagnosis.多模态实体共指用于宫颈发育不良诊断。
IEEE Trans Med Imaging. 2015 Jan;34(1):229-45. doi: 10.1109/TMI.2014.2352311. Epub 2014 Aug 27.
3
Improving low-dose blood-brain barrier permeability quantification using sparse high-dose induced prior for Patlak model.
利用Patlak模型的稀疏高剂量诱导先验改进低剂量血脑屏障通透性定量分析
Med Image Anal. 2014 Aug;18(6):866-80. doi: 10.1016/j.media.2013.09.008. Epub 2013 Oct 17.
4
Phrasal Paraphrase Based Question Reformulation for Archived Question Retrieval.基于短语改写的已归档问题检索问题重述。
PLoS One. 2013 Jun 21;8(6):e64601. doi: 10.1371/journal.pone.0064601. Print 2013.