基于语义嵌入和主动学习的个性化图像分类

Personalized Image Classification by Semantic Embedding and Active Learning.

作者信息

Song Mofei

机构信息

School of Computer Science and Engineering, Southeast University, Nanjing 211189, China.

出版信息

Entropy (Basel). 2020 Nov 18;22(11):1314. doi: 10.3390/e22111314.

DOI:10.3390/e22111314

PMID:33287081

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7712870/

Abstract

Currently, deep learning has shown state-of-the-art performance in image classification with pre-defined taxonomy. However, in a more real-world scenario, different users usually have different classification intents given an image collection. To satisfactorily personalize the requirement, we propose an interactive image classification system with an offline representation learning stage and an online classification stage. During the offline stage, we learn a deep model to extract the feature with higher flexibility and scalability for different users' preferences. Instead of training the model only with the inter-class discrimination, we also encode the similarity between the semantic-embedding vectors of the category labels into the model. This makes the extracted feature adapt to multiple taxonomies with different granularities. During the online session, an annotation task iteratively alternates with a high-throughput verification task. When performing the verification task, the users are only required to indicate the incorrect prediction without giving the exact category label. For each iteration, our system chooses the images to be annotated or verified based on interactive efficiency optimization. To provide a high interactive rate, a unified active learning algorithm is used to search the optimal annotation and verification set by minimizing the expected time cost. After interactive annotation and verification, the new classified images are used to train a customized classifier online, which reflects the user-adaptive intent of categorization. The learned classifier is then used for subsequent annotation and verification tasks. Experimental results under several public image datasets show that our method outperforms existing methods.

摘要

当前，深度学习在具有预定义分类法的图像分类中已展现出最先进的性能。然而，在更真实的场景中，给定一个图像集，不同用户通常有不同的分类意图。为了令人满意地满足个性化需求，我们提出了一个具有离线表示学习阶段和在线分类阶段的交互式图像分类系统。在离线阶段，我们学习一个深度模型，以针对不同用户的偏好提取具有更高灵活性和可扩展性的特征。我们不仅使用类间判别来训练模型，还将类别标签的语义嵌入向量之间的相似性编码到模型中。这使得提取的特征能够适应具有不同粒度的多个分类法。在在线会话期间，一个标注任务与一个高通量验证任务交替进行。在执行验证任务时，只要求用户指出错误的预测，而无需给出确切的类别标签。对于每次迭代，我们的系统基于交互效率优化选择要标注或验证的图像。为了提供高交互率，使用一种统一的主动学习算法，通过最小化预期时间成本来搜索最优的标注和验证集。经过交互式标注和验证后，新分类的图像用于在线训练定制的分类器，该分类器反映了用户自适应的分类意图。然后，学习到的分类器用于后续的标注和验证任务。在几个公共图像数据集上的实验结果表明，我们的方法优于现有方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be16/7712870/69a3e00ece96/entropy-22-01314-g001.jpg

相似文献

Personalized Image Classification by Semantic Embedding and Active Learning.

Entropy (Basel). 2020 Nov 18;22(11):1314. doi: 10.3390/e22111314.

SLED: Semantic Label Embedding Dictionary Representation for Multilabel Image Annotation.

IEEE Trans Image Process. 2015 Sep;24(9):2746-59. doi: 10.1109/TIP.2015.2428055. Epub 2015 Apr 29.

Introspective Deep Metric Learning.

IEEE Trans Pattern Anal Mach Intell. 2024 Apr;46(4):1964-1980. doi: 10.1109/TPAMI.2023.3312311. Epub 2024 Mar 6.

A novel end-to-end classifier using domain transferred deep convolutional neural networks for biomedical images.

Comput Methods Programs Biomed. 2017 Mar;140:283-293. doi: 10.1016/j.cmpb.2016.12.019. Epub 2017 Jan 6.

A boosting framework for visuality-preserving distance metric learning and its application to medical image retrieval.

IEEE Trans Pattern Anal Mach Intell. 2010 Jan;32(1):30-44. doi: 10.1109/TPAMI.2008.273.

Two-dimensional multilabel active learning with an efficient online adaptation model for image classification.

IEEE Trans Pattern Anal Mach Intell. 2009 Oct;31(10):1880-97. doi: 10.1109/TPAMI.2008.218.

Label-Embedding for Image Classification.

IEEE Trans Pattern Anal Mach Intell. 2016 Jul;38(7):1425-38. doi: 10.1109/TPAMI.2015.2487986. Epub 2015 Oct 7.

Tasks Integrated Networks: Joint Detection and Retrieval for Image Search.

IEEE Trans Pattern Anal Mach Intell. 2022 Jan;44(1):456-473. doi: 10.1109/TPAMI.2020.3009758. Epub 2021 Dec 7.

Error-Tolerant Deep Learning for Remote Sensing Image Scene Classification.

IEEE Trans Cybern. 2021 Apr;51(4):1756-1768. doi: 10.1109/TCYB.2020.2989241. Epub 2021 Mar 17.

Deep Label Distribution Learning With Label Ambiguity.

IEEE Trans Image Process. 2017 Jun;26(6):2825-2838. doi: 10.1109/TIP.2017.2689998. Epub 2017 Mar 30.

引用本文的文献

Human-Centric AI: The Symbiosis of Human and Artificial Intelligence.

Entropy (Basel). 2021 Mar 11;23(3):332. doi: 10.3390/e23030332.

本文引用的文献

Textual query of personal photos facilitated by large-scale web data.

IEEE Trans Pattern Anal Mach Intell. 2011 May;33(5):1022-36. doi: 10.1109/TPAMI.2010.142.

Convergent tree-reweighted message passing for energy minimization.

IEEE Trans Pattern Anal Mach Intell. 2006 Oct;28(10):1568-83. doi: 10.1109/TPAMI.2006.200.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于语义嵌入和主动学习的个性化图像分类

Personalized Image Classification by Semantic Embedding and Active Learning.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献