• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于级联卷积神经网络的场景文本检测与分割

Scene Text Detection and Segmentation based on Cascaded Convolution Neural Networks.

作者信息

Tang Youbao, Wu Xiangqian

出版信息

IEEE Trans Image Process. 2017 Mar;26(3):1509-1520. doi: 10.1109/TIP.2017.2656474. Epub 2017 Jan 20.

DOI:10.1109/TIP.2017.2656474
PMID:28113342
Abstract

Scene text detection and segmentation are two important and challenging research problems in the field of computer vision. This paper proposes a novel method for scene text detection and segmentation based on cascaded convolution neural networks (CNNs). In this method, a CNN based text-aware candidate text region (CTR) extraction model (named detection network, DNet) is designed and trained using both the edges and the whole regions of text, with which coarse CTRs are detected. A CNN based CTR refinement model (named segmentation network, SNet) is then constructed to precisely segment the coarse CTRs into text to get the refined CTRs. With DNet and SNet, much fewer CTRs are extracted than with traditional approaches while more true text regions are kept. The refined CTRs are finally classified using a CNN based CTR classification model (named classification network, CNet) to get the final text regions. All of these CNN based models are modified from VGGNet-16. Extensive experiments on three benchmark datasets demonstrate that the proposed method achieves state-of-the-art performance and greatly outperforms other scene text detection and segmentation approaches.

摘要

场景文本检测与分割是计算机视觉领域中两个重要且具有挑战性的研究问题。本文提出了一种基于级联卷积神经网络(CNN)的场景文本检测与分割新方法。在该方法中,设计并训练了一种基于CNN的文本感知候选文本区域(CTR)提取模型(称为检测网络,DNet),它利用文本的边缘和整个区域进行训练,通过该模型检测出粗略的CTR。然后构建一个基于CNN的CTR细化模型(称为分割网络,SNet),将粗略的CTR精确分割成文本以获得细化的CTR。借助DNet和SNet,与传统方法相比,提取的CTR数量更少,同时保留了更多真实文本区域。最后,使用基于CNN的CTR分类模型(称为分类网络,CNet)对细化的CTR进行分类,以获得最终的文本区域。所有这些基于CNN的模型均是从VGGNet - 16修改而来。在三个基准数据集上进行的大量实验表明,所提出的方法取得了领先的性能,并且大大优于其他场景文本检测与分割方法。

相似文献

1
Scene Text Detection and Segmentation based on Cascaded Convolution Neural Networks.基于级联卷积神经网络的场景文本检测与分割
IEEE Trans Image Process. 2017 Mar;26(3):1509-1520. doi: 10.1109/TIP.2017.2656474. Epub 2017 Jan 20.
2
Edge Preserving and Multi-Scale Contextual Neural Network for Salient Object Detection.边缘保持和多尺度上下文神经网络的显著目标检测。
IEEE Trans Image Process. 2018;27(1):121-134. doi: 10.1109/TIP.2017.2756825.
3
Scene text detection via extremal region based double threshold convolutional network classification.基于极值区域的双阈值卷积网络分类的场景文本检测
PLoS One. 2017 Aug 18;12(8):e0182227. doi: 10.1371/journal.pone.0182227. eCollection 2017.
4
Segmentation of organs-at-risks in head and neck CT images using convolutional neural networks.使用卷积神经网络对头颈部CT图像中的危险器官进行分割。
Med Phys. 2017 Feb;44(2):547-557. doi: 10.1002/mp.12045.
5
Neural Network-Oriented Big Data Model for Yoga Movement Recognition.基于神经网络的瑜伽动作识别大数据模型。
Comput Intell Neurosci. 2021 Oct 30;2021:4334024. doi: 10.1155/2021/4334024. eCollection 2021.
6
Robust Vehicle Detection in Aerial Images Based on Cascaded Convolutional Neural Networks.基于级联卷积神经网络的航空图像中稳健车辆检测
Sensors (Basel). 2017 Nov 24;17(12):2720. doi: 10.3390/s17122720.
7
TextField: Learning a Deep Direction Field for Irregular Scene Text Detection.文本字段:学习用于不规则场景文本检测的深度方向场。
IEEE Trans Image Process. 2019 Nov;28(11):5566-5579. doi: 10.1109/TIP.2019.2900589. Epub 2019 Feb 21.
8
Automatic bladder segmentation from CT images using deep CNN and 3D fully connected CRF-RNN.利用深度卷积神经网络和 3D 全连接条件随机场循环神经网络自动进行 CT 图像的膀胱分割。
Int J Comput Assist Radiol Surg. 2018 Jul;13(7):967-975. doi: 10.1007/s11548-018-1733-7. Epub 2018 Mar 19.
9
Embedding topological features into convolutional neural network salient object detection.将拓扑特征嵌入卷积神经网络显著目标检测中。
Neural Netw. 2020 Jan;121:308-318. doi: 10.1016/j.neunet.2019.09.009. Epub 2019 Sep 25.
10
Convolution neural networks for real-time needle detection and localization in 2D ultrasound.卷积神经网络在 2D 超声中实时针检测与定位。
Int J Comput Assist Radiol Surg. 2018 May;13(5):647-657. doi: 10.1007/s11548-018-1721-y. Epub 2018 Mar 6.

引用本文的文献

1
CleanPage: Fast and Clean Document and Whiteboard Capture.CleanPage:快速且清晰的文档和白板捕捉工具。
J Imaging. 2020 Oct 1;6(10):102. doi: 10.3390/jimaging6100102.
2
Text Detection Using Multi-Stage Region Proposal Network Sensitive to Text Scale.基于多阶段区域建议网络的文本检测方法,该方法对文本尺度敏感。
Sensors (Basel). 2021 Feb 9;21(4):1232. doi: 10.3390/s21041232.
3
SSD vs. YOLO for Detection of Outdoor Urban Advertising Panels under Multiple Variabilities.基于多种变化的户外城市广告看板检测的 SSD 与 YOLO 比较
Sensors (Basel). 2020 Aug 15;20(16):4587. doi: 10.3390/s20164587.
4
Automated Counting of Rice Panicle by Applying Deep Learning Model to Images from Unmanned Aerial Vehicle Platform.通过将深度学习模型应用于无人机平台图像实现水稻穗自动计数
Sensors (Basel). 2019 Jul 13;19(14):3106. doi: 10.3390/s19143106.