• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过字符外观和结构建模从场景图像中提取文本

Text Extraction from Scene Images by Character Appearance and Structure Modeling.

作者信息

Yi Chucai, Tian Yingli

机构信息

The Graduate Center and the City College of New York, City University of New York, New York, NY 10016 USA.

出版信息

Comput Vis Image Underst. 2013 Feb 1;117(2):182-194. doi: 10.1016/j.cviu.2012.11.002.

DOI:10.1016/j.cviu.2012.11.002
PMID:23316111
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3539806/
Abstract

In this paper, we propose a novel algorithm to detect text information from natural scene images. Scene text classification and detection are still open research topics. Our proposed algorithm is able to model both character appearance and structure to generate representative and discriminative text descriptors. The contributions of this paper include three aspects: 1) a new character appearance model by a structure correlation algorithm which extracts discriminative appearance features from detected interest points of character samples; 2) a new text descriptor based on structons and correlatons, which model character structure by structure differences among character samples and structure component co-occurrence; and 3) a new text region localization method by combining color decomposition, character contour refinement, and string line alignment to localize character candidates and refine detected text regions. We perform three groups of experiments to evaluate the effectiveness of our proposed algorithm, including text classification, text detection, and character identification. The evaluation results on benchmark datasets demonstrate that our algorithm achieves the state-of-the-art performance on scene text classification and detection, and significantly outperforms the existing algorithms for character identification.

摘要

在本文中,我们提出了一种从自然场景图像中检测文本信息的新颖算法。场景文本分类和检测仍是开放的研究课题。我们提出的算法能够对字符外观和结构进行建模,以生成具有代表性和区分性的文本描述符。本文的贡献包括三个方面:1)一种通过结构相关算法得到的新字符外观模型,该算法从检测到的字符样本兴趣点中提取区分性外观特征;2)一种基于结构元和相关性的新文本描述符,它通过字符样本之间的结构差异和结构组件共现来对字符结构进行建模;3)一种通过结合颜色分解、字符轮廓细化和字符串线对齐来定位字符候选并细化检测到的文本区域的新文本区域定位方法。我们进行了三组实验来评估我们提出算法的有效性,包括文本分类、文本检测和字符识别。在基准数据集上的评估结果表明,我们的算法在场景文本分类和检测方面达到了当前的最优性能,并且在字符识别方面显著优于现有算法。

相似文献

1
Text Extraction from Scene Images by Character Appearance and Structure Modeling.通过字符外观和结构建模从场景图像中提取文本
Comput Vis Image Underst. 2013 Feb 1;117(2):182-194. doi: 10.1016/j.cviu.2012.11.002.
2
Scene text recognition in mobile applications by character descriptor and structure configuration.移动端应用中的场景文字识别:基于字符描述符和结构配置
IEEE Trans Image Process. 2014 Jul;23(7):2972-82. doi: 10.1109/TIP.2014.2317980.
3
Text string detection from natural scenes by structure-based partition and grouping.基于结构划分和分组的自然场景文本字符串检测。
IEEE Trans Image Process. 2011 Sep;20(9):2594-605. doi: 10.1109/TIP.2011.2126586. Epub 2011 Mar 14.
4
Image-to-Character-to-Word Transformers for Accurate Scene Text Recognition.用于精确场景文本识别的图像到字符再到单词的变换器
IEEE Trans Pattern Anal Mach Intell. 2023 Nov;45(11):12908-12921. doi: 10.1109/TPAMI.2022.3230962. Epub 2023 Oct 3.
5
Scene text detection via extremal region based double threshold convolutional network classification.基于极值区域的双阈值卷积网络分类的场景文本检测
PLoS One. 2017 Aug 18;12(8):e0182227. doi: 10.1371/journal.pone.0182227. eCollection 2017.
6
Localizing text in scene images by boundary clustering, stroke segmentation, and string fragment classification.通过边界聚类、笔画分割和字符串片段分类实现场景图像中的文本本地化。
IEEE Trans Image Process. 2012 Sep;21(9):4256-68. doi: 10.1109/TIP.2012.2199327. Epub 2012 May 15.
7
Cursive-Text: A Comprehensive Dataset for End-to-End Urdu Text Recognition in Natural Scene Images.连笔文本:用于自然场景图像中乌尔都语文本端到端识别的综合数据集。
Data Brief. 2020 May 21;31:105749. doi: 10.1016/j.dib.2020.105749. eCollection 2020 Aug.
8
Irregular Scene Text Detection Based on a Graph Convolutional Network.基于图卷积网络的不规则场景文本检测。
Sensors (Basel). 2023 Jan 17;23(3):1070. doi: 10.3390/s23031070.
9
Robust Text Detection in Natural Scene Images.自然场景图像中的鲁棒文本检测。
IEEE Trans Pattern Anal Mach Intell. 2014 May;36(5):970-83. doi: 10.1109/TPAMI.2013.182.
10
Real-Time Lexicon-Free Scene Text Localization and Recognition.实时无词典场景文本定位与识别。
IEEE Trans Pattern Anal Mach Intell. 2016 Sep;38(9):1872-85. doi: 10.1109/TPAMI.2015.2496234. Epub 2015 Oct 30.

引用本文的文献

1
Scene text detection via extremal region based double threshold convolutional network classification.基于极值区域的双阈值卷积网络分类的场景文本检测
PLoS One. 2017 Aug 18;12(8):e0182227. doi: 10.1371/journal.pone.0182227. eCollection 2017.

本文引用的文献

1
Text Detection in Natural Scene Images by Stroke Gabor Words.基于笔画Gabor词的自然场景图像文本检测
Proc Int Conf Doc Anal Recognit. 2011;2011:177-181. doi: 10.1109/ICDAR.2011.44.
2
Text string detection from natural scenes by structure-based partition and grouping.基于结构划分和分组的自然场景文本字符串检测。
IEEE Trans Image Process. 2011 Sep;20(9):2594-605. doi: 10.1109/TIP.2011.2126586. Epub 2011 Mar 14.
3
Text from corners: a novel approach to detect text and caption in videos.从角落提取文本:一种新颖的视频中检测文本和标题的方法。
IEEE Trans Image Process. 2011 Mar;20(3):790-9. doi: 10.1109/TIP.2010.2068553. Epub 2010 Aug 19.
4
Scene text recognition using similarity and a lexicon with sparse belief propagation.使用相似性和带有稀疏信念传播的词典进行场景文本识别。
IEEE Trans Pattern Anal Mach Intell. 2009 Oct;31(10):1733-46. doi: 10.1109/TPAMI.2009.38.
5
Text extraction and document image segmentation using matched wavelets and MRF model.使用匹配小波和马尔可夫随机场模型进行文本提取与文档图像分割。
IEEE Trans Image Process. 2007 Aug;16(8):2117-28. doi: 10.1109/tip.2007.900098.
6
Automatic detection and recognition of signs from natural scenes.自然场景中标志的自动检测与识别。
IEEE Trans Image Process. 2004 Jan;13(1):87-99. doi: 10.1109/tip.2003.819223.