• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

自然图像中多朝向文本检测的旋转不变特征。

Rotation-invariant features for multi-oriented text detection in natural images.

机构信息

Department of Electronics and Information Engineering, Huazhong University of Science and Technology, Wuhan, China.

出版信息

PLoS One. 2013 Aug 5;8(8):e70173. doi: 10.1371/journal.pone.0070173. Print 2013.

DOI:10.1371/journal.pone.0070173
PMID:23940544
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3734103/
Abstract

Texts in natural scenes carry rich semantic information, which can be used to assist a wide range of applications, such as object recognition, image/video retrieval, mapping/navigation, and human computer interaction. However, most existing systems are designed to detect and recognize horizontal (or near-horizontal) texts. Due to the increasing popularity of mobile-computing devices and applications, detecting texts of varying orientations from natural images under less controlled conditions has become an important but challenging task. In this paper, we propose a new algorithm to detect texts of varying orientations. Our algorithm is based on a two-level classification scheme and two sets of features specially designed for capturing the intrinsic characteristics of texts. To better evaluate the proposed method and compare it with the competing algorithms, we generate a comprehensive dataset with various types of texts in diverse real-world scenes. We also propose a new evaluation protocol, which is more suitable for benchmarking algorithms for detecting texts in varying orientations. Experiments on benchmark datasets demonstrate that our system compares favorably with the state-of-the-art algorithms when handling horizontal texts and achieves significantly enhanced performance on variant texts in complex natural scenes.

摘要

自然场景中的文本包含丰富的语义信息,可用于辅助各种应用,例如对象识别、图像/视频检索、制图/导航和人机交互。然而,现有的大多数系统都是为检测和识别水平(或接近水平)文本而设计的。由于移动计算设备和应用的日益普及,在较少受控条件下从自然图像中检测不同方向的文本已成为一项重要但具有挑战性的任务。在本文中,我们提出了一种新的算法来检测不同方向的文本。我们的算法基于两级分类方案和两组专门设计的特征,用于捕获文本的内在特征。为了更好地评估所提出的方法并将其与竞争算法进行比较,我们生成了一个包含各种类型文本的综合数据集,这些文本来自不同的真实场景。我们还提出了一种新的评估协议,该协议更适合用于基准测试不同方向文本检测算法。在基准数据集上的实验表明,当处理水平文本时,我们的系统优于最先进的算法,并且在复杂自然场景中的变体文本方面性能显著提高。

相似文献

1
Rotation-invariant features for multi-oriented text detection in natural images.自然图像中多朝向文本检测的旋转不变特征。
PLoS One. 2013 Aug 5;8(8):e70173. doi: 10.1371/journal.pone.0070173. Print 2013.
2
A unified framework for multioriented text detection and recognition.多方向文本检测与识别的统一框架。
IEEE Trans Image Process. 2014 Nov;23(11):4737-49. doi: 10.1109/TIP.2014.2353813. Epub 2014 Sep 4.
3
R-YOLO: A Real-Time Text Detector for Natural Scenes with Arbitrary Rotation.R-YOLO:一种用于任意旋转自然场景的实时文本检测器。
Sensors (Basel). 2021 Jan 28;21(3):888. doi: 10.3390/s21030888.
4
Recognition of pornographic web pages by classifying texts and images.通过对文本和图像进行分类来识别色情网页。
IEEE Trans Pattern Anal Mach Intell. 2007 Jun;29(6):1019-34. doi: 10.1109/TPAMI.2007.1133.
5
Scene text recognition in mobile applications by character descriptor and structure configuration.移动端应用中的场景文字识别:基于字符描述符和结构配置
IEEE Trans Image Process. 2014 Jul;23(7):2972-82. doi: 10.1109/TIP.2014.2317980.
6
Conjunctive patches subspace learning with side information for collaborative image retrieval.基于连接补丁子空间学习的带有辅助信息的协同图像检索。
IEEE Trans Image Process. 2012 Aug;21(8):3707-20. doi: 10.1109/TIP.2012.2195014. Epub 2012 Apr 17.
7
DenseTextPVT: Pyramid Vision Transformer with Deep Multi-Scale Feature Refinement Network for Dense Text Detection.DenseTextPVT:基于深度多尺度特征细化网络的金字塔视觉 Transformer 用于密集文本检测。
Sensors (Basel). 2023 Jun 25;23(13):5889. doi: 10.3390/s23135889.
8
Scale-invariant features for 3-D mesh models.用于 3D 网格模型的尺度不变特征。
IEEE Trans Image Process. 2012 May;21(5):2758-69. doi: 10.1109/TIP.2012.2183142. Epub 2012 Jan 9.
9
Data classification with radial basis function networks based on a novel kernel density estimation algorithm.基于一种新型核密度估计算法的径向基函数网络数据分类
IEEE Trans Neural Netw. 2005 Jan;16(1):225-36. doi: 10.1109/TNN.2004.836229.
10
Illumination normalization with time-dependent intrinsic images for video surveillance.用于视频监控的基于时间相关固有图像的光照归一化
IEEE Trans Pattern Anal Mach Intell. 2004 Oct;26(10):1336-47. doi: 10.1109/TPAMI.2004.86.

引用本文的文献

1
Morphological background detection and illumination normalization of text image with poor lighting.光照条件较差的文本图像的形态学背景检测与光照归一化
PLoS One. 2014 Nov 26;9(11):e110991. doi: 10.1371/journal.pone.0110991. eCollection 2014.

本文引用的文献

1
Real-Time Lexicon-Free Scene Text Localization and Recognition.实时无词典场景文本定位与识别。
IEEE Trans Pattern Anal Mach Intell. 2016 Sep;38(9):1872-85. doi: 10.1109/TPAMI.2015.2496234. Epub 2015 Oct 30.
2
Localizing text in scene images by boundary clustering, stroke segmentation, and string fragment classification.通过边界聚类、笔画分割和字符串片段分类实现场景图像中的文本本地化。
IEEE Trans Image Process. 2012 Sep;21(9):4256-68. doi: 10.1109/TIP.2012.2199327. Epub 2012 May 15.
3
RASL: robust alignment by sparse and low-rank decomposition for linearly correlated images.
RASL:基于稀疏和低秩分解的线性相关图像鲁棒配准。
IEEE Trans Pattern Anal Mach Intell. 2012 Nov;34(11):2233-46. doi: 10.1109/TPAMI.2011.282.
4
A computational approach to edge detection.一种基于计算的边缘检测方法。
IEEE Trans Pattern Anal Mach Intell. 1986 Jun;8(6):679-98.
5
Text string detection from natural scenes by structure-based partition and grouping.基于结构划分和分组的自然场景文本字符串检测。
IEEE Trans Image Process. 2011 Sep;20(9):2594-605. doi: 10.1109/TIP.2011.2126586. Epub 2011 Mar 14.
6
A hybrid approach to detect and localize texts in natural scene images.一种用于检测和定位自然场景图像中文本的混合方法。
IEEE Trans Image Process. 2011 Mar;20(3):800-13. doi: 10.1109/TIP.2010.2070803. Epub 2010 Sep 2.
7
A Laplacian approach to multi-oriented text detection in video.基于拉普拉斯方法的视频多方向文本检测
IEEE Trans Pattern Anal Mach Intell. 2011 Feb;33(2):412-9. doi: 10.1109/TPAMI.2010.166.
8
Text from corners: a novel approach to detect text and caption in videos.从角落提取文本:一种新颖的视频中检测文本和标题的方法。
IEEE Trans Image Process. 2011 Mar;20(3):790-9. doi: 10.1109/TIP.2010.2068553. Epub 2010 Aug 19.
9
Robust face recognition via sparse representation.基于稀疏表示的鲁棒人脸识别。
IEEE Trans Pattern Anal Mach Intell. 2009 Feb;31(2):210-27. doi: 10.1109/TPAMI.2008.79.
10
Morphological text extraction from images.从图像中提取形态学文本。
IEEE Trans Image Process. 2000;9(11):1978-83. doi: 10.1109/83.877220.