• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用掩码紧密度文本检测器的任意形状场景文本检测

Arbitrarily Shaped Scene Text Detection with a Mask Tightness Text Detector.

作者信息

Liu Yuliang, Jin Lianwen, Fang Chuanming

出版信息

IEEE Trans Image Process. 2019 Nov 26. doi: 10.1109/TIP.2019.2954218.

DOI:10.1109/TIP.2019.2954218
PMID:31794397
Abstract

Scene text in the environment is complicated. It can exist in arbitrary text fonts, sizes or shapes. Although scene text detection has witnessed considerable progress in recent years, the detection of text with complex shapes, especially curved text, remains challenging. Datasets with adequate samples to overcome the problem presented by curved text (or other irregularly shaped text) have been introduced only recently; however, the performance of the reported methods on these datasets is unsatisfactory. Therefore, detecting arbitrarily shaped text remains a challenging. This motivated us to propose the Mask Tightness Text Detector (Mask TTD) to improve text detection performance. Mask TTD uses a tightness prior and text frontier learning to enhance pixel-wise mask prediction. In addition, it achieves mutual promotion by integrating a branch for the polygonal boundary of each text region, which significantly improves the detection performance of arbitrarily shaped text. Experiments demonstrate that Mask TTD can achieve state-ofthe-art performance on existing curved text datasets (CTW1500, Total-text, and CUTE80) and three common benchmark datasets (RCTW-17, MSRA-TD500, and ICDAR 2015). It is worth mentioning that on CTW1500, our method can outperform previous methods, especially at higher intersection over union (IoU) thresholds (16% higher than the next-best method with an IoU threshold of 0.8), which demonstrates its potential for tight text detection. Moreover, on the largest Chinese-based dataset RCTW-17, Mask TTD outperforms other methods by a large margin in terms of both the Average Precision and F-measure, showing its powerful generalization ability.

摘要

环境中的场景文本复杂多样。它可以呈现为任意的字体、大小或形状。尽管近年来场景文本检测取得了显著进展,但复杂形状文本(尤其是弯曲文本)的检测仍然具有挑战性。直到最近才引入了具有足够样本的数据集来克服弯曲文本(或其他不规则形状文本)带来的问题;然而,已报道方法在这些数据集上的性能并不理想。因此,检测任意形状的文本仍然是一项具有挑战性的任务。这促使我们提出掩码紧密度文本检测器(Mask TTD)以提高文本检测性能。Mask TTD利用紧密度先验和文本边界学习来增强逐像素掩码预测。此外,它通过整合每个文本区域多边形边界的分支实现相互促进,显著提高了任意形状文本的检测性能。实验表明,Mask TTD在现有的弯曲文本数据集(CTW1500、Total-text和CUTE80)以及三个常见基准数据集(RCTW-17、MSRA-TD500和ICDAR 2015)上能够达到领先的性能。值得一提的是,在CTW1500数据集上,我们的方法优于先前的方法,特别是在更高的交并比(IoU)阈值下(在IoU阈值为0.8时比次优方法高16%),这证明了其在紧密文本检测方面的潜力。此外,在最大的中文数据集RCTW-17上,Mask TTD在平均精度和F值方面均大幅优于其他方法,显示出其强大的泛化能力。

相似文献

1
Arbitrarily Shaped Scene Text Detection with a Mask Tightness Text Detector.使用掩码紧密度文本检测器的任意形状场景文本检测
IEEE Trans Image Process. 2019 Nov 26. doi: 10.1109/TIP.2019.2954218.
2
TextField: Learning a Deep Direction Field for Irregular Scene Text Detection.文本字段:学习用于不规则场景文本检测的深度方向场。
IEEE Trans Image Process. 2019 Nov;28(11):5566-5579. doi: 10.1109/TIP.2019.2900589. Epub 2019 Feb 21.
3
Mixed-Supervised Scene Text Detection With Expectation-Maximization Algorithm.基于期望最大化算法的混合监督场景文本检测
IEEE Trans Image Process. 2022;31:5513-5528. doi: 10.1109/TIP.2022.3197987. Epub 2022 Aug 22.
4
Arbitrary Shape Text Detection via Segmentation With Probability Maps.基于概率图分割的任意形状文本检测。
IEEE Trans Pattern Anal Mach Intell. 2023 Mar;45(3):2736-2750. doi: 10.1109/TPAMI.2022.3176122. Epub 2023 Feb 3.
5
Boundary TextSpotter: Toward Arbitrary-Shaped Scene Text Spotting.边界文本检测:迈向任意形状场景文本检测
IEEE Trans Image Process. 2022;31:6200-6212. doi: 10.1109/TIP.2022.3206615. Epub 2022 Sep 28.
6
A Robust Method: Arbitrary Shape Text Detection Combining Semantic and Position Information.一种鲁棒方法:结合语义和位置信息的任意形状文本检测。
Sensors (Basel). 2022 Dec 18;22(24):9982. doi: 10.3390/s22249982.
7
CM-Net: Concentric Mask Based Arbitrary-Shaped Text Detection.CM-Net:基于同心掩码的任意形状文本检测
IEEE Trans Image Process. 2022;31:2864-2877. doi: 10.1109/TIP.2022.3141844. Epub 2022 Apr 8.
8
R-YOLO: A Real-Time Text Detector for Natural Scenes with Arbitrary Rotation.R-YOLO:一种用于任意旋转自然场景的实时文本检测器。
Sensors (Basel). 2021 Jan 28;21(3):888. doi: 10.3390/s21030888.
9
A real-time arbitrary-shape text detector.实时任意形状文本检测器。
PLoS One. 2024 Apr 16;19(4):e0302234. doi: 10.1371/journal.pone.0302234. eCollection 2024.
10
Multi-Spectral Fusion Based Approach for Arbitrarily Oriented Scene Text Detection in Video Images.基于多光谱融合的视频图像中任意方向场景文本检测方法。
IEEE Trans Image Process. 2015 Nov;24(11):4488-501. doi: 10.1109/TIP.2015.2465169. Epub 2015 Aug 5.

引用本文的文献

1
DPNet: Scene text detection based on dual perspective CNN-transformer.DPNet:基于双视角 CNN-Transformer 的场景文本检测。
PLoS One. 2024 Oct 21;19(10):e0309286. doi: 10.1371/journal.pone.0309286. eCollection 2024.
2
HubNet: An E2E Model for Wheel Hub Text Detection and Recognition Using Global and Local Features.HubNet:一种利用全局和局部特征进行轮毂文本检测与识别的端到端模型。
Sensors (Basel). 2024 Sep 24;24(19):6183. doi: 10.3390/s24196183.
3
A Straightforward and Efficient Instance-Aware Curved Text Detector.一种直接有效的实例感知的弯曲文本检测器。
Sensors (Basel). 2021 Mar 10;21(6):1945. doi: 10.3390/s21061945.