• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于检测和定位自然场景图像中文本的混合方法。

A hybrid approach to detect and localize texts in natural scene images.

机构信息

National Laboratory of Pattern Recognition (NLPR), Institute of Automation, Chinese Academy of Sciences (CASIA), Beijing 100190, China.

出版信息

IEEE Trans Image Process. 2011 Mar;20(3):800-13. doi: 10.1109/TIP.2010.2070803. Epub 2010 Sep 2.

DOI:10.1109/TIP.2010.2070803
PMID:20813645
Abstract

Text detection and localization in natural scene images is important for content-based image analysis. This problem is challenging due to the complex background, the non-uniform illumination, the variations of text font, size and line orientation. In this paper, we present a hybrid approach to robustly detect and localize texts in natural scene images. A text region detector is designed to estimate the text existing confidence and scale information in image pyramid, which help segment candidate text components by local binarization. To efficiently filter out the non-text components, a conditional random field (CRF) model considering unary component properties and binary contextual component relationships with supervised parameter learning is proposed. Finally, text components are grouped into text lines/words with a learning-based energy minimization method. Since all the three stages are learning-based, there are very few parameters requiring manual tuning. Experimental results evaluated on the ICDAR 2005 competition dataset show that our approach yields higher precision and recall performance compared with state-of-the-art methods. We also evaluated our approach on a multilingual image dataset with promising results.

摘要

文本检测和定位在自然场景图像中对于基于内容的图像分析非常重要。由于复杂的背景、不均匀的光照、文本字体、大小和行方向的变化,这个问题具有挑战性。在本文中,我们提出了一种混合方法来稳健地检测和定位自然场景图像中的文本。设计了一个文本区域检测器来估计图像金字塔中存在的文本置信度和尺度信息,这有助于通过局部二值化分割候选文本组件。为了有效地过滤掉非文本组件,提出了一种考虑一元组件属性和二元上下文组件关系的条件随机场(CRF)模型,并进行了有监督的参数学习。最后,使用基于学习的能量最小化方法将文本组件组合成文本行/单词。由于所有三个阶段都是基于学习的,因此需要手动调整的参数很少。在 ICDAR 2005 竞赛数据集上的实验结果表明,与最先进的方法相比,我们的方法具有更高的精度和召回性能。我们还在一个多语言图像数据集上评估了我们的方法,取得了有前景的结果。

相似文献

1
A hybrid approach to detect and localize texts in natural scene images.一种用于检测和定位自然场景图像中文本的混合方法。
IEEE Trans Image Process. 2011 Mar;20(3):800-13. doi: 10.1109/TIP.2010.2070803. Epub 2010 Sep 2.
2
Robust Text Detection in Natural Scene Images.自然场景图像中的鲁棒文本检测。
IEEE Trans Pattern Anal Mach Intell. 2014 May;36(5):970-83. doi: 10.1109/TPAMI.2013.182.
3
Scene text detection via connected component clustering and nontext filtering.基于连通分量聚类和非文本过滤的场景文本检测。
IEEE Trans Image Process. 2013 Jun;22(6):2296-305. doi: 10.1109/TIP.2013.2249082.
4
Multi-Spectral Fusion Based Approach for Arbitrarily Oriented Scene Text Detection in Video Images.基于多光谱融合的视频图像中任意方向场景文本检测方法。
IEEE Trans Image Process. 2015 Nov;24(11):4488-501. doi: 10.1109/TIP.2015.2465169. Epub 2015 Aug 5.
5
A new approach for overlay text detection and extraction from complex video scene.一种从复杂视频场景中检测和提取叠加文本的新方法。
IEEE Trans Image Process. 2009 Feb;18(2):401-11. doi: 10.1109/TIP.2008.2008225. Epub 2008 Dec 16.
6
Scene text detection via extremal region based double threshold convolutional network classification.基于极值区域的双阈值卷积网络分类的场景文本检测
PLoS One. 2017 Aug 18;12(8):e0182227. doi: 10.1371/journal.pone.0182227. eCollection 2017.
7
A discriminative kernel-based approach to rank images from text queries.一种基于判别核的方法,用于根据文本查询对图像进行排序。
IEEE Trans Pattern Anal Mach Intell. 2008 Aug;30(8):1371-84. doi: 10.1109/TPAMI.2007.70791.
8
Text-Attentional Convolutional Neural Network for Scene Text Detection.基于注意力机制的卷积神经网络场景文本检测方法
IEEE Trans Image Process. 2016 Jun;25(6):2529-41. doi: 10.1109/TIP.2016.2547588.
9
Multi-Orientation Scene Text Detection with Adaptive Clustering.多方向场景文本检测的自适应聚类方法。
IEEE Trans Pattern Anal Mach Intell. 2015 Sep;37(9):1930-7. doi: 10.1109/TPAMI.2014.2388210.
10
Scene text deblurring using text-specific multiscale dictionaries.基于文本特定多尺度字典的场景文本去模糊。
IEEE Trans Image Process. 2015 Apr;24(4):1302-14. doi: 10.1109/TIP.2015.2400217.

引用本文的文献

1
An Intelligent System to Sense Textual Cues for Location Assistance in Autonomous Vehicles.用于自动驾驶车辆中位置辅助的感知文本线索的智能系统。
Sensors (Basel). 2023 May 6;23(9):4537. doi: 10.3390/s23094537.
2
R-YOLO: A Real-Time Text Detector for Natural Scenes with Arbitrary Rotation.R-YOLO:一种用于任意旋转自然场景的实时文本检测器。
Sensors (Basel). 2021 Jan 28;21(3):888. doi: 10.3390/s21030888.
3
DeTEXT: A Database for Evaluating Text Extraction from Biomedical Literature Figures.DeTEXT:一个用于评估从生物医学文献图表中提取文本的数据库。
PLoS One. 2015 May 7;10(5):e0126200. doi: 10.1371/journal.pone.0126200. eCollection 2015.
4
Morphological background detection and illumination normalization of text image with poor lighting.光照条件较差的文本图像的形态学背景检测与光照归一化
PLoS One. 2014 Nov 26;9(11):e110991. doi: 10.1371/journal.pone.0110991. eCollection 2014.
5
Rotation-invariant features for multi-oriented text detection in natural images.自然图像中多朝向文本检测的旋转不变特征。
PLoS One. 2013 Aug 5;8(8):e70173. doi: 10.1371/journal.pone.0070173. Print 2013.