Suppr超能文献

通过字符外观和结构建模从场景图像中提取文本

Text Extraction from Scene Images by Character Appearance and Structure Modeling.

作者信息

Yi Chucai, Tian Yingli

机构信息

The Graduate Center and the City College of New York, City University of New York, New York, NY 10016 USA.

出版信息

Comput Vis Image Underst. 2013 Feb 1;117(2):182-194. doi: 10.1016/j.cviu.2012.11.002.

Abstract

In this paper, we propose a novel algorithm to detect text information from natural scene images. Scene text classification and detection are still open research topics. Our proposed algorithm is able to model both character appearance and structure to generate representative and discriminative text descriptors. The contributions of this paper include three aspects: 1) a new character appearance model by a structure correlation algorithm which extracts discriminative appearance features from detected interest points of character samples; 2) a new text descriptor based on structons and correlatons, which model character structure by structure differences among character samples and structure component co-occurrence; and 3) a new text region localization method by combining color decomposition, character contour refinement, and string line alignment to localize character candidates and refine detected text regions. We perform three groups of experiments to evaluate the effectiveness of our proposed algorithm, including text classification, text detection, and character identification. The evaluation results on benchmark datasets demonstrate that our algorithm achieves the state-of-the-art performance on scene text classification and detection, and significantly outperforms the existing algorithms for character identification.

摘要

在本文中,我们提出了一种从自然场景图像中检测文本信息的新颖算法。场景文本分类和检测仍是开放的研究课题。我们提出的算法能够对字符外观和结构进行建模,以生成具有代表性和区分性的文本描述符。本文的贡献包括三个方面:1)一种通过结构相关算法得到的新字符外观模型,该算法从检测到的字符样本兴趣点中提取区分性外观特征;2)一种基于结构元和相关性的新文本描述符,它通过字符样本之间的结构差异和结构组件共现来对字符结构进行建模;3)一种通过结合颜色分解、字符轮廓细化和字符串线对齐来定位字符候选并细化检测到的文本区域的新文本区域定位方法。我们进行了三组实验来评估我们提出算法的有效性,包括文本分类、文本检测和字符识别。在基准数据集上的评估结果表明,我们的算法在场景文本分类和检测方面达到了当前的最优性能,并且在字符识别方面显著优于现有算法。

相似文献

3
4
Image-to-Character-to-Word Transformers for Accurate Scene Text Recognition.用于精确场景文本识别的图像到字符再到单词的变换器
IEEE Trans Pattern Anal Mach Intell. 2023 Nov;45(11):12908-12921. doi: 10.1109/TPAMI.2022.3230962. Epub 2023 Oct 3.
9
Robust Text Detection in Natural Scene Images.自然场景图像中的鲁棒文本检测。
IEEE Trans Pattern Anal Mach Intell. 2014 May;36(5):970-83. doi: 10.1109/TPAMI.2013.182.
10
Real-Time Lexicon-Free Scene Text Localization and Recognition.实时无词典场景文本定位与识别。
IEEE Trans Pattern Anal Mach Intell. 2016 Sep;38(9):1872-85. doi: 10.1109/TPAMI.2015.2496234. Epub 2015 Oct 30.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验