Wang Yuxin, Xie Hongtao, Wang Zixiao, Qu Yadong, Zhang Yongdong
IEEE Trans Image Process. 2023;32:4567-4580. doi: 10.1109/TIP.2023.3290517. Epub 2023 Aug 17.
As a crucial application in privacy protection, scene text removal (STR) has received amounts of attention in recent years. However, existing approaches coarsely erasing texts from images ignore two important properties: the background texture integrity (BI) and the text erasure exhaustivity (EE). These two properties directly determine the erasure performance, and how to maintain them in a single network is the core problem for STR task. In this paper, we attribute the lack of BI and EE properties to the implicit erasure guidance and imbalanced multi-stage erasure respectively. To improve these two properties, we propose a new ProgrEssively Region-based scene Text eraser (PERT). There are three key contributions in our study. First, a novel explicit erasure guidance is proposed to enhance the BI property. Different from implicit erasure guidance modifying all the pixels in the entire image, our explicit one accurately performs stroke-level modification with only bounding-box level annotations. Second, a new balanced multi-stage erasure is constructed to improve the EE property. By balancing the learning difficulty and network structure among progressive stages, each stage takes an equal step towards the text-erased image to ensure the erasure exhaustivity. Third, we propose two new evaluation metrics called BI-metric and EE-metric, which make up the shortcomings of current evaluation tools in analyzing BI and EE properties. Compared with previous methods, PERT outperforms them by a large margin in both BI-metric ( ↑ 6.13 %) and EE-metric ( ↑ 1.9 %), obtaining SOTA results with high speed (71 FPS) and at least 25% lower parameter complexity. Code will be available at https://github.com/wangyuxin87/PERT.
作为隐私保护中的一项关键应用,场景文本去除(STR)近年来受到了大量关注。然而,现有的从图像中粗略擦除文本的方法忽略了两个重要特性:背景纹理完整性(BI)和文本擦除彻底性(EE)。这两个特性直接决定了擦除性能,而如何在单个网络中保持它们是STR任务的核心问题。在本文中,我们分别将BI和EE特性的缺失归因于隐式擦除引导和不平衡的多阶段擦除。为了改善这两个特性,我们提出了一种新的基于区域的渐进式场景文本擦除器(PERT)。我们的研究有三个关键贡献。首先,提出了一种新颖的显式擦除引导来增强BI特性。与修改整个图像中所有像素的隐式擦除引导不同,我们的显式擦除引导仅使用边界框级别的注释就能准确地进行笔画级别的修改。其次,构建了一种新的平衡多阶段擦除来改善EE特性。通过平衡渐进阶段之间的学习难度和网络结构,每个阶段朝着文本擦除后的图像迈出相等的步伐,以确保擦除彻底性。第三,我们提出了两个新的评估指标,称为BI指标和EE指标,弥补了当前评估工具在分析BI和EE特性方面的不足。与先前的方法相比,PERT在BI指标(提高6.13%)和EE指标(提高1.9%)方面都大幅优于它们,以高速(71帧每秒)获得了SOTA结果,并且参数复杂度至少降低了25%。代码将在https://github.com/wangyuxin87/PERT上提供。