• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于任意形状场景文本检测的模糊语义

Fuzzy Semantics for Arbitrary-Shaped Scene Text Detection.

作者信息

Wang Fangfang, Xu Xiaogang, Chen Yifeng, Li Xi

出版信息

IEEE Trans Image Process. 2023;32:1-12. doi: 10.1109/TIP.2022.3201467. Epub 2022 Dec 19.

DOI:10.1109/TIP.2022.3201467
PMID:36040943
Abstract

To robustly detect arbitrary-shaped scene texts, bottom-up methods are widely explored for their flexibility. Due to the highly homogeneous texture and cluttered distribution of scene texts, it is nontrivial for segmentation-based methods to discover the separatrixes between adjacent instances. To effectively separate nearby texts, many methods adopt the seed expansion strategy that segments shrunken text regions as seed areas, and then iteratively expands the seed areas into intact text regions. In seek of a more straightforward way that does not rely on seed area segmentation and avoid possible error accumulation brought by iterative processing, we propose a redundancy removal strategy. In this work, we directly explore two types of fuzzy semantics-text and separatrix-that do not possess specific boundaries, and separate cluttered instances by excluding the separatrix pixels from text regions. To deal with the fuzzy semantic boundaries, we also conduct reliability analysis in both optimization and inference stage to suppress false positive pixels at ambiguous locations. Experiments on benchmark datasets demonstrate the effectiveness of our method.

摘要

为了稳健地检测任意形状的场景文本,自底向上的方法因其灵活性而被广泛探索。由于场景文本具有高度均匀的纹理和杂乱的分布,基于分割的方法要发现相邻实例之间的分隔线并非易事。为了有效分离附近的文本,许多方法采用种子扩展策略,即将缩小的文本区域分割为种子区域,然后将种子区域迭代扩展为完整的文本区域。为了寻找一种更直接的方法,该方法不依赖种子区域分割且避免迭代处理带来的可能误差积累,我们提出了一种冗余去除策略。在这项工作中,我们直接探索两种不具有特定边界的模糊语义——文本和分隔线,并通过从文本区域中排除分隔线像素来分离杂乱的实例。为了处理模糊的语义边界,我们还在优化和推理阶段进行可靠性分析,以抑制模糊位置的误报像素。在基准数据集上的实验证明了我们方法的有效性。

相似文献

1
Fuzzy Semantics for Arbitrary-Shaped Scene Text Detection.用于任意形状场景文本检测的模糊语义
IEEE Trans Image Process. 2023;32:1-12. doi: 10.1109/TIP.2022.3201467. Epub 2022 Dec 19.
2
A Robust Method: Arbitrary Shape Text Detection Combining Semantic and Position Information.一种鲁棒方法:结合语义和位置信息的任意形状文本检测。
Sensors (Basel). 2022 Dec 18;22(24):9982. doi: 10.3390/s22249982.
3
Arbitrary Shape Text Detection via Segmentation With Probability Maps.基于概率图分割的任意形状文本检测。
IEEE Trans Pattern Anal Mach Intell. 2023 Mar;45(3):2736-2750. doi: 10.1109/TPAMI.2022.3176122. Epub 2023 Feb 3.
4
Kernel Proposal Network for Arbitrary Shape Text Detection.用于任意形状文本检测的内核提议网络。
IEEE Trans Neural Netw Learn Syst. 2023 Nov;34(11):8731-8742. doi: 10.1109/TNNLS.2022.3152596. Epub 2023 Oct 27.
5
Boundary TextSpotter: Toward Arbitrary-Shaped Scene Text Spotting.边界文本检测:迈向任意形状场景文本检测
IEEE Trans Image Process. 2022;31:6200-6212. doi: 10.1109/TIP.2022.3206615. Epub 2022 Sep 28.
6
Unambiguous Text Localization, Retrieval, and Recognition for Cluttered Scenes.用于混杂场景的无歧义文本定位、检索和识别。
IEEE Trans Pattern Anal Mach Intell. 2022 Mar;44(3):1638-1652. doi: 10.1109/TPAMI.2020.3018491. Epub 2022 Feb 3.
7
DenseTextPVT: Pyramid Vision Transformer with Deep Multi-Scale Feature Refinement Network for Dense Text Detection.DenseTextPVT:基于深度多尺度特征细化网络的金字塔视觉 Transformer 用于密集文本检测。
Sensors (Basel). 2023 Jun 25;23(13):5889. doi: 10.3390/s23135889.
8
TextField: Learning a Deep Direction Field for Irregular Scene Text Detection.文本字段:学习用于不规则场景文本检测的深度方向场。
IEEE Trans Image Process. 2019 Nov;28(11):5566-5579. doi: 10.1109/TIP.2019.2900589. Epub 2019 Feb 21.
9
HGR-Net: Hierarchical Graph Reasoning Network for Arbitrary Shape Scene Text Detection.HGR-Net:用于任意形状场景文本检测的分层图推理网络。
IEEE Trans Image Process. 2023;32:4142-4155. doi: 10.1109/TIP.2023.3294822. Epub 2023 Jul 20.
10
Multi-Spectral Fusion Based Approach for Arbitrarily Oriented Scene Text Detection in Video Images.基于多光谱融合的视频图像中任意方向场景文本检测方法。
IEEE Trans Image Process. 2015 Nov;24(11):4488-501. doi: 10.1109/TIP.2015.2465169. Epub 2015 Aug 5.

引用本文的文献

1
DPNet: Scene text detection based on dual perspective CNN-transformer.DPNet:基于双视角 CNN-Transformer 的场景文本检测。
PLoS One. 2024 Oct 21;19(10):e0309286. doi: 10.1371/journal.pone.0309286. eCollection 2024.