Qiao Xiaotian, Cao Ying, Lau Rynson W H
IEEE Trans Vis Comput Graph. 2023 Sep;29(9):3922-3936. doi: 10.1109/TVCG.2022.3171839. Epub 2023 Aug 1.
With the goal of making contents easy to understand, memorize and share, a clear and easy-to-follow layout is important for visual notes. Unfortunately, since visual notes are often taken by the designers in real time while watching a video or listening to a presentation, the contents are usually not carefully structured, resulting in layouts that may be difficult for others to follow. In this article, we address this problem by proposing a novel approach to automatically optimize the layouts of visual notes. Our approach predicts the design order of a visual note and then warps the contents along the predicted design order such that the visual note can be easier to follow and understand. At the core of our approach is a learning-based framework to reason about the element-wise design orders of visual notes. In particular, we first propose a hierarchical LSTM-based architecture to predict a grid-based design order of the visual note, based on the graphical and textual information. We then derive the element-wise order from the grid-based prediction. Such an idea allows our network to be weakly-supervised, i.e., making it possible to predict dense grid-based orders from visual notes with only coarse annotations. We evaluate the effectiveness of our approach on visual notes with diverse content densities and layouts. The results show that our network can predict plausible design orders for various types of visual notes and our approach can effectively optimize their layouts in order for them to be easier to follow.
为了使内容易于理解、记忆和分享,清晰且易于遵循的布局对于视觉笔记很重要。不幸的是,由于视觉笔记通常是设计师在观看视频或听演讲时实时记录的,内容通常没有经过精心组织,导致布局可能让其他人难以理解。在本文中,我们通过提出一种新颖的方法来自动优化视觉笔记的布局来解决这个问题。我们的方法预测视觉笔记的设计顺序,然后按照预测的设计顺序对内容进行变形,以使视觉笔记更易于理解和遵循。我们方法的核心是一个基于学习的框架,用于推断视觉笔记的逐元素设计顺序。具体来说,我们首先提出一种基于分层长短期记忆网络(LSTM)的架构,根据图形和文本信息预测视觉笔记基于网格的设计顺序。然后,我们从基于网格的预测中得出逐元素顺序。这种想法使我们的网络能够进行弱监督,即仅通过粗略注释就能从视觉笔记中预测密集的基于网格的顺序。我们在具有不同内容密度和布局的视觉笔记上评估了我们方法的有效性。结果表明,我们的网络可以为各种类型的视觉笔记预测合理的设计顺序,并且我们的方法可以有效地优化它们的布局,使其更易于理解。