Yu Jun, Liu Dongquan, Tao Dacheng, Seah Hock Soon
Department of Computer Science, Xiamen University, Xiamen 361005, China.
IEEE Trans Syst Man Cybern B Cybern. 2012 Oct;42(5):1413-27. doi: 10.1109/TSMCB.2012.2192108. Epub 2012 Apr 25.
How do we retrieve cartoon characters accurately? Or how to synthesize new cartoon clips smoothly and efficiently from the cartoon library? Both questions are important for animators and cartoon enthusiasts to design and create new cartoons by utilizing existing cartoon materials. The first key issue to answer those questions is to find a proper representation that describes the cartoon character effectively. In this paper, we consider multiple features from different views, i.e., color histogram, Hausdorff edge feature, and skeleton feature, to represent cartoon characters with different colors, shapes, and gestures. Each visual feature reflects a unique characteristic of a cartoon character, and they are complementary to each other for retrieval and synthesis. However, how to combine the three visual features is the second key issue of our application. By simply concatenating them into a long vector, it will end up with the so-called "curse of dimensionality," let alone their heterogeneity embedded in different visual feature spaces. Here, we introduce a semisupervised multiview subspace learning (semi-MSL) algorithm, to encode different features in a unified space. Specifically, under the patch alignment framework, semi-MSL uses the discriminative information from labeled cartoon characters in the construction of local patches where the manifold structure revealed by unlabeled cartoon characters is utilized to capture the geometric distribution. The experimental evaluations based on both cartoon character retrieval and clip synthesis demonstrate the effectiveness of the proposed method for cartoon application. Moreover, additional results of content-based image retrieval on benchmark data suggest the generality of semi-MSL for other applications.
我们如何准确检索卡通角色?或者如何从卡通库中流畅且高效地合成新的卡通片段?这两个问题对于动画师和卡通爱好者利用现有的卡通素材来设计和创作新卡通来说都很重要。回答这些问题的第一个关键问题是找到一种能有效描述卡通角色的合适表示方法。在本文中,我们从不同视角考虑多种特征,即颜色直方图、豪斯多夫边缘特征和骨架特征,以表示具有不同颜色、形状和姿态的卡通角色。每个视觉特征都反映了卡通角色的一个独特特性,并且它们在检索和合成方面相互补充。然而,如何将这三个视觉特征结合起来是我们应用的第二个关键问题。通过简单地将它们连接成一个长向量,最终会导致所谓的“维数灾难”,更不用说它们嵌入在不同视觉特征空间中的异质性了。在此,我们引入一种半监督多视角子空间学习(semi - MSL)算法,以便在统一空间中对不同特征进行编码。具体而言,在补丁对齐框架下,semi - MSL在构建局部补丁时利用来自有标签卡通角色的判别信息,同时利用无标签卡通角色揭示的流形结构来捕获几何分布。基于卡通角色检索和片段合成的实验评估证明了所提方法在卡通应用中的有效性。此外,在基准数据上基于内容的图像检索的额外结果表明semi - MSL在其他应用中的通用性。