Zheng Yixiao, Pang Kaiyue, Das Ayan, Chang Dongliang, Song Yi-Zhe, Ma Zhanyu
IEEE Trans Image Process. 2024;33:2266-2278. doi: 10.1109/TIP.2024.3374196. Epub 2024 Mar 21.
The problem of sketch semantic segmentation is far from being solved. Despite existing methods exhibiting near-saturating performances on simple sketches with high recognisability, they suffer serious setbacks when the target sketches are products of an imaginative process with high degree of creativity. We hypothesise that human creativity, being highly individualistic, induces a significant shift in distribution of sketches, leading to poor model generalisation. Such hypothesis, backed by empirical evidences, opens the door for a solution that explicitly disentangles creativity while learning sketch representations. We materialise this by crafting a learnable creativity estimator that assigns a scalar score of creativity to each sketch. It follows that we introduce CreativeSeg, a learning-to-learn framework that leverages the estimator in order to learn creativity-agnostic representation, and eventually the downstream semantic segmentation task. We empirically verify the superiority of CreativeSeg on the recent "Creative Birds" and "Creative Creatures" creative sketch datasets. Through a human study, we further strengthen the case that the learned creativity score does indeed have a positive correlation with the subjective creativity of human. Codes are available at https://github.com/PRIS-CV/Sketch-CS.
草图语义分割问题远未得到解决。尽管现有方法在具有高可识别性的简单草图上表现出接近饱和的性能,但当目标草图是具有高度创造性的想象过程的产物时,它们会遭受严重挫折。我们假设,高度个性化的人类创造力会导致草图分布发生显著变化,从而导致模型泛化能力较差。这一假设得到了实证证据的支持,为一种在学习草图表示时明确区分创造力的解决方案打开了大门。我们通过构建一个可学习的创造力估计器来实现这一点,该估计器为每个草图分配一个创造力标量分数。在此基础上,我们引入了CreativeSeg,这是一个学习学习框架,它利用该估计器来学习与创造力无关的表示,并最终完成下游语义分割任务。我们通过实验验证了CreativeSeg在最近的“创意鸟类”和“创意生物”创意草图数据集上的优越性。通过一项人类研究,我们进一步证明了所学习的创造力分数确实与人类的主观创造力呈正相关。代码可在https://github.com/PRIS-CV/Sketch-CS获取。