• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

移动端应用中的场景文字识别:基于字符描述符和结构配置

Scene text recognition in mobile applications by character descriptor and structure configuration.

出版信息

IEEE Trans Image Process. 2014 Jul;23(7):2972-82. doi: 10.1109/TIP.2014.2317980.

DOI:10.1109/TIP.2014.2317980
PMID:24759989
Abstract

Text characters and strings in natural scene can provide valuable information for many applications. Extracting text directly from natural scene images or videos is a challenging task because of diverse text patterns and variant background interferences. This paper proposes a method of scene text recognition from detected text regions. In text detection, our previously proposed algorithms are applied to obtain text regions from scene image. First, we design a discriminative character descriptor by combining several state-of-the-art feature detectors and descriptors. Second, we model character structure at each character class by designing stroke configuration maps. Our algorithm design is compatible with the application of scene text extraction in smart mobile devices. An Android-based demo system is developed to show the effectiveness of our proposed method on scene text information extraction from nearby objects. The demo system also provides us some insight into algorithm design and performance improvement of scene text extraction. The evaluation results on benchmark data sets demonstrate that our proposed scheme of text recognition is comparable with the best existing methods.

摘要

自然场景中的文本字符和字符串可为许多应用提供有价值的信息。由于文本模式多样且背景干扰多样,直接从自然场景图像或视频中提取文本是一项具有挑战性的任务。本文提出了一种从检测到的文本区域中识别场景文本的方法。在文本检测中,我们应用之前提出的算法从场景图像中获取文本区域。首先,我们设计了一种判别字符描述符,通过结合几种最先进的特征检测器和描述符来实现。其次,我们通过设计笔画配置图来对每个字符类的字符结构进行建模。我们的算法设计与智能移动设备上的场景文本提取应用兼容。我们开发了一个基于 Android 的演示系统,以展示我们提出的从附近物体中提取场景文本信息的方法的有效性。该演示系统还为我们提供了一些关于场景文本提取的算法设计和性能改进的见解。在基准数据集上的评估结果表明,我们提出的文本识别方案可与现有最佳方法相媲美。

相似文献

1
Scene text recognition in mobile applications by character descriptor and structure configuration.移动端应用中的场景文字识别:基于字符描述符和结构配置
IEEE Trans Image Process. 2014 Jul;23(7):2972-82. doi: 10.1109/TIP.2014.2317980.
2
Localizing text in scene images by boundary clustering, stroke segmentation, and string fragment classification.通过边界聚类、笔画分割和字符串片段分类实现场景图像中的文本本地化。
IEEE Trans Image Process. 2012 Sep;21(9):4256-68. doi: 10.1109/TIP.2012.2199327. Epub 2012 May 15.
3
Text Extraction from Scene Images by Character Appearance and Structure Modeling.通过字符外观和结构建模从场景图像中提取文本
Comput Vis Image Underst. 2013 Feb 1;117(2):182-194. doi: 10.1016/j.cviu.2012.11.002.
4
A new approach for overlay text detection and extraction from complex video scene.一种从复杂视频场景中检测和提取叠加文本的新方法。
IEEE Trans Image Process. 2009 Feb;18(2):401-11. doi: 10.1109/TIP.2008.2008225. Epub 2008 Dec 16.
5
Toward integrated scene text reading.面向集成场景文本阅读。
IEEE Trans Pattern Anal Mach Intell. 2014 Feb;36(2):375-87. doi: 10.1109/TPAMI.2013.126.
6
A unified framework for multioriented text detection and recognition.多方向文本检测与识别的统一框架。
IEEE Trans Image Process. 2014 Nov;23(11):4737-49. doi: 10.1109/TIP.2014.2353813. Epub 2014 Sep 4.
7
Character independent font recognition on a single Chinese character.基于单个汉字的字符独立字体识别。
IEEE Trans Pattern Anal Mach Intell. 2007 Feb;29(2):195-204. doi: 10.1109/TPAMI.2007.26.
8
Simple and efficient method for region of interest value extraction from picture archiving and communication system viewer with optical character recognition software and macro program.利用光学字符识别软件和宏程序从图像存档与通信系统查看器中提取感兴趣区域值的简单高效方法。
Acad Radiol. 2015 Jan;22(1):113-6. doi: 10.1016/j.acra.2014.07.003. Epub 2014 Aug 12.
9
Monocular visual scene understanding: understanding multi-object traffic scenes.单目视觉场景理解:理解多目标交通场景。
IEEE Trans Pattern Anal Mach Intell. 2013 Apr;35(4):882-97. doi: 10.1109/TPAMI.2012.174.
10
A thousand words in a scene.一个场景中有一千个单词。
IEEE Trans Pattern Anal Mach Intell. 2007 Sep;29(9):1575-89. doi: 10.1109/TPAMI.2007.1155.

引用本文的文献

1
A real-time arbitrary-shape text detector.实时任意形状文本检测器。
PLoS One. 2024 Apr 16;19(4):e0302234. doi: 10.1371/journal.pone.0302234. eCollection 2024.
2
Attention Guided Feature Encoding for Scene Text Recognition.用于场景文本识别的注意力引导特征编码
J Imaging. 2022 Oct 8;8(10):276. doi: 10.3390/jimaging8100276.