• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

去除场景文本的真正需求是什么?探索背景完整性和擦除彻底性属性。

What is the Real Need for Scene Text Removal? Exploring the Background Integrity and Erasure Exhaustivity Properties.

作者信息

Wang Yuxin, Xie Hongtao, Wang Zixiao, Qu Yadong, Zhang Yongdong

出版信息

IEEE Trans Image Process. 2023;32:4567-4580. doi: 10.1109/TIP.2023.3290517. Epub 2023 Aug 17.

DOI:10.1109/TIP.2023.3290517
PMID:37556339
Abstract

As a crucial application in privacy protection, scene text removal (STR) has received amounts of attention in recent years. However, existing approaches coarsely erasing texts from images ignore two important properties: the background texture integrity (BI) and the text erasure exhaustivity (EE). These two properties directly determine the erasure performance, and how to maintain them in a single network is the core problem for STR task. In this paper, we attribute the lack of BI and EE properties to the implicit erasure guidance and imbalanced multi-stage erasure respectively. To improve these two properties, we propose a new ProgrEssively Region-based scene Text eraser (PERT). There are three key contributions in our study. First, a novel explicit erasure guidance is proposed to enhance the BI property. Different from implicit erasure guidance modifying all the pixels in the entire image, our explicit one accurately performs stroke-level modification with only bounding-box level annotations. Second, a new balanced multi-stage erasure is constructed to improve the EE property. By balancing the learning difficulty and network structure among progressive stages, each stage takes an equal step towards the text-erased image to ensure the erasure exhaustivity. Third, we propose two new evaluation metrics called BI-metric and EE-metric, which make up the shortcomings of current evaluation tools in analyzing BI and EE properties. Compared with previous methods, PERT outperforms them by a large margin in both BI-metric ( ↑ 6.13 %) and EE-metric ( ↑ 1.9 %), obtaining SOTA results with high speed (71 FPS) and at least 25% lower parameter complexity. Code will be available at https://github.com/wangyuxin87/PERT.

摘要

作为隐私保护中的一项关键应用,场景文本去除(STR)近年来受到了大量关注。然而,现有的从图像中粗略擦除文本的方法忽略了两个重要特性:背景纹理完整性(BI)和文本擦除彻底性(EE)。这两个特性直接决定了擦除性能,而如何在单个网络中保持它们是STR任务的核心问题。在本文中,我们分别将BI和EE特性的缺失归因于隐式擦除引导和不平衡的多阶段擦除。为了改善这两个特性,我们提出了一种新的基于区域的渐进式场景文本擦除器(PERT)。我们的研究有三个关键贡献。首先,提出了一种新颖的显式擦除引导来增强BI特性。与修改整个图像中所有像素的隐式擦除引导不同,我们的显式擦除引导仅使用边界框级别的注释就能准确地进行笔画级别的修改。其次,构建了一种新的平衡多阶段擦除来改善EE特性。通过平衡渐进阶段之间的学习难度和网络结构,每个阶段朝着文本擦除后的图像迈出相等的步伐,以确保擦除彻底性。第三,我们提出了两个新的评估指标,称为BI指标和EE指标,弥补了当前评估工具在分析BI和EE特性方面的不足。与先前的方法相比,PERT在BI指标(提高6.13%)和EE指标(提高1.9%)方面都大幅优于它们,以高速(71帧每秒)获得了SOTA结果,并且参数复杂度至少降低了25%。代码将在https://github.com/wangyuxin87/PERT上提供。

相似文献

1
What is the Real Need for Scene Text Removal? Exploring the Background Integrity and Erasure Exhaustivity Properties.去除场景文本的真正需求是什么?探索背景完整性和擦除彻底性属性。
IEEE Trans Image Process. 2023;32:4567-4580. doi: 10.1109/TIP.2023.3290517. Epub 2023 Aug 17.
2
EraseNet: End-to-End Text Removal in the Wild.擦除网络:野外端到端文本擦除
IEEE Trans Image Process. 2020 Aug 28;PP. doi: 10.1109/TIP.2020.3018859.
3
Stroke-Based Scene Text Erasing Using Synthetic Data for Training.基于笔画的场景文本擦除:使用合成数据进行训练
IEEE Trans Image Process. 2021;30:9306-9320. doi: 10.1109/TIP.2021.3125260. Epub 2021 Nov 12.
4
A Scene-Text Synthesis Engine Achieved Through Learning From Decomposed Real-World Data.一种通过从分解的现实世界数据中学习实现的场景文本合成引擎。
IEEE Trans Image Process. 2023;32:5837-5851. doi: 10.1109/TIP.2023.3326685. Epub 2023 Nov 1.
5
Arbitrary Shape Text Detection via Segmentation With Probability Maps.基于概率图分割的任意形状文本检测。
IEEE Trans Pattern Anal Mach Intell. 2023 Mar;45(3):2736-2750. doi: 10.1109/TPAMI.2022.3176122. Epub 2023 Feb 3.
6
TextField: Learning a Deep Direction Field for Irregular Scene Text Detection.文本字段:学习用于不规则场景文本检测的深度方向场。
IEEE Trans Image Process. 2019 Nov;28(11):5566-5579. doi: 10.1109/TIP.2019.2900589. Epub 2019 Feb 21.
7
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
8
Composed Image Retrieval via Explicit Erasure and Replenishment With Semantic Alignment.通过显式擦除和语义对齐补充实现合成图像检索
IEEE Trans Image Process. 2022;31:5976-5988. doi: 10.1109/TIP.2022.3204213. Epub 2022 Sep 15.
9
Semi-Supervised Pixel-Level Scene Text Segmentation by Mutually Guided Network.基于相互引导网络的半监督像素级场景文本分割
IEEE Trans Image Process. 2021;30:8212-8221. doi: 10.1109/TIP.2021.3113157. Epub 2021 Sep 30.
10
ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Spotting.ABINet++:面向场景文本定位的自主、双向和迭代语言建模。
IEEE Trans Pattern Anal Mach Intell. 2023 Jun;45(6):7123-7141. doi: 10.1109/TPAMI.2022.3223908. Epub 2023 May 5.