Fan Jianping, Gao Yuli, Luo Hangzai
Department of Computer Science, University of North Carolina, Charlotte, NC 28223, USA.
IEEE Trans Image Process. 2008 Mar;17(3):407-26. doi: 10.1109/TIP.2008.916999.
In this paper, we have developed a new scheme for achieving multilevel annotations of large-scale images automatically. To achieve more sufficient representation of various visual properties of the images, both the global visual features and the local visual features are extracted for image content representation. To tackle the problem of huge intraconcept visual diversity, multiple types of kernels are integrated to characterize the diverse visual similarity relationships between the images more precisely, and a multiple kernel learning algorithm is developed for SVM image classifier training. To address the problem of huge interconcept visual similarity, a novel multitask learning algorithm is developed to learn the correlated classifiers for the sibling image concepts under the same parent concept and enhance their discrimination and adaptation power significantly. To tackle the problem of huge intraconcept visual diversity for the image concepts at the higher levels of the concept ontology, a novel hierarchical boosting algorithm is developed to learn their ensemble classifiers hierarchically. In order to assist users on selecting more effective hypotheses for image classifier training, we have developed a novel hyperbolic framework for large-scale image visualization and interactive hypotheses assessment. Our experiments on large-scale image collections have also obtained very positive results.
在本文中,我们开发了一种新的方案来自动实现大规模图像的多级标注。为了更充分地表示图像的各种视觉属性,提取了全局视觉特征和局部视觉特征用于图像内容表示。为了解决概念内视觉多样性巨大的问题,集成了多种类型的核以更精确地表征图像之间多样的视觉相似关系,并开发了一种多核学习算法用于支持向量机(SVM)图像分类器训练。为了解决概念间视觉相似性巨大的问题,开发了一种新颖的多任务学习算法来学习同一父概念下兄弟图像概念的相关分类器,并显著增强它们的辨别力和适应能力。为了解决概念本体较高层次的图像概念的概念内视觉多样性巨大的问题,开发了一种新颖的分层提升算法来分层学习它们的集成分类器。为了帮助用户为图像分类器训练选择更有效的假设,我们开发了一种用于大规模图像可视化和交互式假设评估的新颖双曲框架。我们在大规模图像集上的实验也取得了非常积极的成果。