• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

视频/场景图像中文字组件的轮廓恢复用于识别。

Contour Restoration of Text Components for Recognition in Video/Scene Images.

出版信息

IEEE Trans Image Process. 2016 Dec;25(12):5622-5634. doi: 10.1109/TIP.2016.2607426. Epub 2016 Sep 8.

DOI:10.1109/TIP.2016.2607426
PMID:27623587
Abstract

Text recognition in video/natural scene images has gained significant attention in the field of image processing in many computer vision applications, which is much more challenging than recognition in plain background images. In this paper, we aim to restore complete character contours in video/scene images from gray values, in contrast to the conventional techniques that consider edge images/binary information as inputs for text detection and recognition. We explore and utilize the strengths of zero crossing points given by the Laplacian to identify stroke candidate pixels (SPC). For each SPC pair, we propose new symmetry features based on gradient magnitude and Fourier phase angles to identify probable stroke candidate pairs (PSCP). The same symmetry properties are proposed at the PSCP level to choose seed stroke candidate pairs (SSCP). Finally, an iterative algorithm is proposed for SSCP to restore complete character contours. Experimental results on benchmark databases, namely, the ICDAR family of video and natural scenes, Street View Data, and MSRA data sets, show that the proposed technique outperforms the existing techniques in terms of both quality measures and recognition rate. We also show that character contour restoration is effective for text detection in video and natural scene images.

摘要

视频/自然场景图像中的文本识别在许多计算机视觉应用中的图像处理领域引起了广泛关注,这比在纯色背景图像中的识别更具挑战性。在本文中,我们的目标是从灰度值恢复视频/场景图像中的完整字符轮廓,与传统技术不同,传统技术将边缘图像/二进制信息作为文本检测和识别的输入。我们探索并利用拉普拉斯算子给出的过零点的优势来识别笔画候选像素 (SPC)。对于每个 SPC 对,我们提出了新的基于梯度幅度和傅里叶相位角的对称特征来识别可能的笔画候选对 (PSCP)。在 PSCP 级别提出了相同的对称特性来选择种子笔画候选对 (SSCP)。最后,提出了一种用于 SSCP 的迭代算法来恢复完整的字符轮廓。在基准数据库(即 ICDAR 系列视频和自然场景、街景数据和 MSRA 数据集)上的实验结果表明,该技术在质量度量和识别率方面均优于现有技术。我们还表明,字符轮廓恢复对于视频和自然场景图像中的文本检测是有效的。

相似文献

1
Contour Restoration of Text Components for Recognition in Video/Scene Images.视频/场景图像中文字组件的轮廓恢复用于识别。
IEEE Trans Image Process. 2016 Dec;25(12):5622-5634. doi: 10.1109/TIP.2016.2607426. Epub 2016 Sep 8.
2
Multi-Spectral Fusion Based Approach for Arbitrarily Oriented Scene Text Detection in Video Images.基于多光谱融合的视频图像中任意方向场景文本检测方法。
IEEE Trans Image Process. 2015 Nov;24(11):4488-501. doi: 10.1109/TIP.2015.2465169. Epub 2015 Aug 5.
3
Scene text recognition in mobile applications by character descriptor and structure configuration.移动端应用中的场景文字识别:基于字符描述符和结构配置
IEEE Trans Image Process. 2014 Jul;23(7):2972-82. doi: 10.1109/TIP.2014.2317980.
4
Cursive-Text: A Comprehensive Dataset for End-to-End Urdu Text Recognition in Natural Scene Images.连笔文本:用于自然场景图像中乌尔都语文本端到端识别的综合数据集。
Data Brief. 2020 May 21;31:105749. doi: 10.1016/j.dib.2020.105749. eCollection 2020 Aug.
5
Localizing text in scene images by boundary clustering, stroke segmentation, and string fragment classification.通过边界聚类、笔画分割和字符串片段分类实现场景图像中的文本本地化。
IEEE Trans Image Process. 2012 Sep;21(9):4256-68. doi: 10.1109/TIP.2012.2199327. Epub 2012 May 15.
6
Robust Text Detection in Natural Scene Images.自然场景图像中的鲁棒文本检测。
IEEE Trans Pattern Anal Mach Intell. 2014 May;36(5):970-83. doi: 10.1109/TPAMI.2013.182.
7
Tracking Based Multi-Orientation Scene Text Detection: A Unified Framework With Dynamic Programming.基于跟踪的多方向场景文本检测:一种具有动态规划的统一框架。
IEEE Trans Image Process. 2017 Jul;26(7):3235-3248. doi: 10.1109/TIP.2017.2695104. Epub 2017 Apr 18.
8
An End-to-End Trainable Neural Network for Image-Based Sequence Recognition and Its Application to Scene Text Recognition.基于图像的序列识别的端到端可训练神经网络及其在场景文本识别中的应用。
IEEE Trans Pattern Anal Mach Intell. 2017 Nov;39(11):2298-2304. doi: 10.1109/TPAMI.2016.2646371. Epub 2016 Dec 29.
9
A new approach for overlay text detection and extraction from complex video scene.一种从复杂视频场景中检测和提取叠加文本的新方法。
IEEE Trans Image Process. 2009 Feb;18(2):401-11. doi: 10.1109/TIP.2008.2008225. Epub 2008 Dec 16.
10
A hybrid approach to detect and localize texts in natural scene images.一种用于检测和定位自然场景图像中文本的混合方法。
IEEE Trans Image Process. 2011 Mar;20(3):800-13. doi: 10.1109/TIP.2010.2070803. Epub 2010 Sep 2.