Computer Science Department, Xiamen University, Xiamen 361005, China.
IEEE Trans Image Process. 2012 Nov;21(11):4636-48. doi: 10.1109/TIP.2012.2207395. Epub 2012 Jul 10.
In image processing, cartoon character classification, retrieval, and synthesis are critical, so that cartoonists can effectively and efficiently make cartoons by reusing existing cartoon data. To successfully achieve these tasks, it is essential to extract visual features that comprehensively represent cartoon characters and to construct an accurate distance metric to precisely measure the dissimilarities between cartoon characters. In this paper, we introduce three visual features, color histogram, shape context, and skeleton, to characterize the color, shape, and action, respectively, of a cartoon character. These three features are complementary to each other, and each feature set is regarded as a single view. However, it is improper to concatenate these three features into a long vector, because they have different physical properties, and simply concatenating them into a high-dimensional feature vector will suffer from the so-called curse of dimensionality. Hence, we propose a semisupervised multiview distance metric learning (SSM-DML). SSM-DML learns the multiview distance metrics from multiple feature sets and from the labels of unlabeled cartoon characters simultaneously, under the umbrella of graph-based semisupervised learning. SSM-DML discovers complementary characteristics of different feature sets through an alternating optimization-based iterative algorithm. Therefore, SSM-DML can simultaneously accomplish cartoon character classification and dissimilarity measurement. On the basis of SSM-DML, we develop a novel system that composes the modules of multiview cartoon character classification, multiview graph-based cartoon synthesis, and multiview retrieval-based cartoon synthesis. Experimental evaluations based on the three modules suggest the effectiveness of SSM-DML in cartoon applications.
在图像处理中,卡通人物的分类、检索和合成至关重要,这样漫画家就可以通过重用现有的卡通数据来有效地、高效率地制作卡通。为了成功地完成这些任务,提取能够全面表示卡通人物的视觉特征,并构建精确的距离度量来准确地测量卡通人物之间的差异是至关重要的。在本文中,我们引入了三个视觉特征,即颜色直方图、形状上下文和骨架,分别用于描述卡通人物的颜色、形状和动作。这三个特征是互补的,每个特征集都被视为单个视图。然而,将这三个特征直接串联成一个长向量是不合适的,因为它们具有不同的物理属性,简单地将它们串联成一个高维特征向量会受到所谓的维度诅咒的影响。因此,我们提出了一种半监督多视图距离度量学习(SSM-DML)方法。SSM-DML 基于基于图的半监督学习,同时从多个特征集和未标记卡通人物的标签中学习多视图距离度量。SSM-DML 通过基于交替优化的迭代算法发现不同特征集的互补特征。因此,SSM-DML 可以同时完成卡通人物分类和相似度测量。在 SSM-DML 的基础上,我们开发了一个新的系统,该系统包含了多视图卡通人物分类、基于多视图的卡通人物合成以及基于多视图检索的卡通人物合成三个模块。基于这三个模块的实验评估表明了 SSM-DML 在卡通应用中的有效性。