• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

HGR-Net:用于任意形状场景文本检测的分层图推理网络。

HGR-Net: Hierarchical Graph Reasoning Network for Arbitrary Shape Scene Text Detection.

作者信息

Bi Hengyue, Xu Canhui, Shi Cao, Liu Guozhu, Zhang Honghong, Li Yuteng, Dong Junyu

出版信息

IEEE Trans Image Process. 2023;32:4142-4155. doi: 10.1109/TIP.2023.3294822. Epub 2023 Jul 20.

DOI:10.1109/TIP.2023.3294822
PMID:37459262
Abstract

As a prerequisite step of scene text reading, scene text detection is known as a challenging task due to natural scene text diversity and variability. Most existing methods either adopt bottom-up sub-text component extraction or focus on top-down text contour regression. From a hybrid perspective, we explore hierarchical text instance-level and component-level representation for arbitrarily-shaped scene text detection. In this work, we propose a novel Hierarchical Graph Reasoning Network (HGR-Net), which consists of a Text Feature Extraction Network (TFEN) and a Text Relation Learner Network (TRLN). TFEN adaptively learns multi-grained text candidates based on shared convolutional feature maps, including instance-level text contours and component-level quadrangles. In TRLN, an inter-text graph is constructed to explore global contextual information with position-awareness between text instances, and an intra-text graph is designed to estimate geometric attributes for establishing component-level linkages. Next, we bridge the cross-feed interaction between instance-level and component-level, and it further achieves hierarchical relational reasoning by learning complementary graph embeddings across levels. Experiments conducted on three publicly available benchmarks SCUT-CTW1500, Total-Text, and ICDAR15 have demonstrated that HGR-Net achieves state-of-the-art performance on arbitrary orientation and arbitrary shape scene text detection.

摘要

作为场景文本阅读的前提步骤,由于自然场景文本的多样性和变异性,场景文本检测是一项具有挑战性的任务。大多数现有方法要么采用自下而上的子文本组件提取,要么专注于自上而下的文本轮廓回归。从混合的角度出发,我们探索用于任意形状场景文本检测的层次化文本实例级和组件级表示。在这项工作中,我们提出了一种新颖的层次图推理网络(HGR-Net),它由文本特征提取网络(TFEN)和文本关系学习网络(TRLN)组成。TFEN基于共享卷积特征图自适应地学习多粒度文本候选,包括实例级文本轮廓和组件级四边形。在TRLN中,构建一个文本间图以探索文本实例之间具有位置感知的全局上下文信息,并设计一个文本内图来估计几何属性以建立组件级联系。接下来,我们在实例级和组件级之间建立交叉馈送交互,并通过跨级别学习互补图嵌入进一步实现层次关系推理。在三个公开基准数据集SCUT-CTW1500、Total-Text和ICDAR15上进行的实验表明,HGR-Net在任意方向和任意形状的场景文本检测中都取得了领先的性能。

相似文献

1
HGR-Net: Hierarchical Graph Reasoning Network for Arbitrary Shape Scene Text Detection.HGR-Net:用于任意形状场景文本检测的分层图推理网络。
IEEE Trans Image Process. 2023;32:4142-4155. doi: 10.1109/TIP.2023.3294822. Epub 2023 Jul 20.
2
Irregular Scene Text Detection Based on a Graph Convolutional Network.基于图卷积网络的不规则场景文本检测。
Sensors (Basel). 2023 Jan 17;23(3):1070. doi: 10.3390/s23031070.
3
CM-Net: Concentric Mask Based Arbitrary-Shaped Text Detection.CM-Net:基于同心掩码的任意形状文本检测
IEEE Trans Image Process. 2022;31:2864-2877. doi: 10.1109/TIP.2022.3141844. Epub 2022 Apr 8.
4
Boundary TextSpotter: Toward Arbitrary-Shaped Scene Text Spotting.边界文本检测:迈向任意形状场景文本检测
IEEE Trans Image Process. 2022;31:6200-6212. doi: 10.1109/TIP.2022.3206615. Epub 2022 Sep 28.
5
Arbitrary Shape Text Detection via Segmentation With Probability Maps.基于概率图分割的任意形状文本检测。
IEEE Trans Pattern Anal Mach Intell. 2023 Mar;45(3):2736-2750. doi: 10.1109/TPAMI.2022.3176122. Epub 2023 Feb 3.
6
Mixed-Supervised Scene Text Detection With Expectation-Maximization Algorithm.基于期望最大化算法的混合监督场景文本检测
IEEE Trans Image Process. 2022;31:5513-5528. doi: 10.1109/TIP.2022.3197987. Epub 2022 Aug 22.
7
A Robust Method: Arbitrary Shape Text Detection Combining Semantic and Position Information.一种鲁棒方法:结合语义和位置信息的任意形状文本检测。
Sensors (Basel). 2022 Dec 18;22(24):9982. doi: 10.3390/s22249982.
8
TextField: Learning a Deep Direction Field for Irregular Scene Text Detection.文本字段:学习用于不规则场景文本检测的深度方向场。
IEEE Trans Image Process. 2019 Nov;28(11):5566-5579. doi: 10.1109/TIP.2019.2900589. Epub 2019 Feb 21.
9
SLOAN: Scale-Adaptive Orientation Attention Network for Scene Text Recognition.斯隆:用于场景文本识别的尺度自适应方向注意网络。
IEEE Trans Image Process. 2021;30:1687-1701. doi: 10.1109/TIP.2020.3045602. Epub 2021 Jan 14.
10
Attention-Based Scene Text Detection on Dual Feature Fusion.基于注意力的双特征融合场景文本检测。
Sensors (Basel). 2022 Nov 23;22(23):9072. doi: 10.3390/s22239072.