• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于细粒度特征表示的维吾尔语场景文本检测。

Scene Uyghur Text Detection Based on Fine-Grained Feature Representation.

机构信息

School of Information Science and Engineering, Xinjiang University, Urumqi 830046, China.

Xinjiang Key Laboratory of Multilingual Information Technology, Xinjiang University, Urumqi 830046, China.

出版信息

Sensors (Basel). 2022 Jun 9;22(12):4372. doi: 10.3390/s22124372.

DOI:10.3390/s22124372
PMID:35746154
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9229707/
Abstract

Scene text detection task aims to precisely localize text in natural environments. At present, the application scenarios of text detection topics have gradually shifted from plain document text to more complex natural scenarios. Objects with similar texture and text morphology in the complex background noise of natural scene images are prone to false recall and difficult to detect multi-scale texts, a multi-directional scene Uyghur text detection model based on fine-grained feature representation and spatial feature fusion is proposed, and feature extraction and feature fusion are improved to enhance the network's ability to represent multi-scale features. In this method, the multiple groups of 3 × 3 convolutional feature groups that are connected like the hierarchical residual to build a residual network for feature extraction, which captures the feature details and increases the receptive field of the network to adapt to multi-scale text and long glued dimensional font detection and suppress false positives of text-like objects. Secondly, an adaptive multi-level feature map fusion strategy is adopted to overcome the inconsistency of information in multi-scale feature map fusion. The proposed model achieves 93.94% and 84.92% F-measure on the self-built Uyghur dataset and the ICDAR2015 dataset, respectively, which improves the accuracy of Uyghur text detection and suppresses false positives.

摘要

场景文本检测任务旨在精确定位自然环境中的文本。目前,文本检测主题的应用场景已经逐渐从普通文档文本转移到更复杂的自然场景。自然场景图像复杂背景噪声中具有相似纹理和文本形态的对象容易产生误召回,难以检测多尺度文本,提出了一种基于细粒度特征表示和空间特征融合的多方向场景维吾尔文文本检测模型,改进了特征提取和特征融合,增强了网络表示多尺度特征的能力。在该方法中,通过连接类似于分层残差的多组 3×3 卷积特征组来构建用于特征提取的残差网络,该网络捕获特征细节并增加网络的感受野,以适应多尺度文本和长粘连维字体检测,并抑制文本类对象的假阳性。其次,采用自适应多级特征图融合策略来克服多尺度特征图融合中的信息不一致性。所提出的模型在自建的维吾尔语数据集和 ICDAR2015 数据集上分别实现了 93.94%和 84.92%的 F-measure,提高了维吾尔语文本检测的准确性并抑制了假阳性。

相似文献

1
Scene Uyghur Text Detection Based on Fine-Grained Feature Representation.基于细粒度特征表示的维吾尔语场景文本检测。
Sensors (Basel). 2022 Jun 9;22(12):4372. doi: 10.3390/s22124372.
2
A Multi-Scale Natural Scene Text Detection Method Based on Attention Feature Extraction and Cascade Feature Fusion.一种基于注意力特征提取和级联特征融合的多尺度自然场景文本检测方法
Sensors (Basel). 2024 Jun 9;24(12):3758. doi: 10.3390/s24123758.
3
Attention-Based Scene Text Detection on Dual Feature Fusion.基于注意力的双特征融合场景文本检测。
Sensors (Basel). 2022 Nov 23;22(23):9072. doi: 10.3390/s22239072.
4
Scene Text Detection Based on Two-Branch Feature Extraction.基于双分支特征提取的场景文本检测。
Sensors (Basel). 2022 Aug 20;22(16):6262. doi: 10.3390/s22166262.
5
SEMPANet: A Modified Path Aggregation Network with Squeeze-Excitation for Scene Text Detection.SEMPA 网络:一种具有挤压激励的改进路径聚合网络,用于场景文本检测。
Sensors (Basel). 2021 Apr 9;21(8):2657. doi: 10.3390/s21082657.
6
Scene Uyghur Recognition Based on Visual Prediction Enhancement.基于视觉预测增强的场景维吾尔语识别
Sensors (Basel). 2023 Oct 20;23(20):8610. doi: 10.3390/s23208610.
7
MR-FPN: Multi-Level Residual Feature Pyramid Text Detection Network Based on Self-Attention Environment.MR-FPN:基于自注意力环境的多尺度残差特征金字塔文本检测网络。
Sensors (Basel). 2022 Apr 27;22(9):3337. doi: 10.3390/s22093337.
8
A Fine-Grained Image Classification and Detection Method Based on Convolutional Neural Network Fused with Attention Mechanism.基于卷积神经网络融合注意力机制的细粒度图像分类与检测方法。
Comput Intell Neurosci. 2022 Sep 14;2022:2974960. doi: 10.1155/2022/2974960. eCollection 2022.
9
AAF-Net: Scene text detection based on attention aggregation features.AAF-Net:基于注意力聚合特征的场景文本检测。
PLoS One. 2022 Aug 5;17(8):e0272322. doi: 10.1371/journal.pone.0272322. eCollection 2022.
10
Aerial images object detection method based on cross-scale multi-feature fusion.基于跨尺度多特征融合的航空图像目标检测方法
Math Biosci Eng. 2023 Aug 9;20(9):16148-16168. doi: 10.3934/mbe.2023721.

引用本文的文献

1
Multi-Scale Feature Fusion of Covariance Pooling Networks for Fine-Grained Visual Recognition.协方差池化网络的多尺度特征融合用于细粒度视觉识别。
Sensors (Basel). 2023 Apr 13;23(8):3970. doi: 10.3390/s23083970.
2
Irregular Scene Text Detection Based on a Graph Convolutional Network.基于图卷积网络的不规则场景文本检测。
Sensors (Basel). 2023 Jan 17;23(3):1070. doi: 10.3390/s23031070.

本文引用的文献

1
Multi-Directional Scene Text Detection Based on Improved YOLOv3.基于改进的 YOLOv3 的多方向场景文本检测
Sensors (Basel). 2021 Jul 16;21(14):4870. doi: 10.3390/s21144870.
2
SEMPANet: A Modified Path Aggregation Network with Squeeze-Excitation for Scene Text Detection.SEMPA 网络:一种具有挤压激励的改进路径聚合网络,用于场景文本检测。
Sensors (Basel). 2021 Apr 9;21(8):2657. doi: 10.3390/s21082657.
3
A Straightforward and Efficient Instance-Aware Curved Text Detector.一种直接有效的实例感知的弯曲文本检测器。
Sensors (Basel). 2021 Mar 10;21(6):1945. doi: 10.3390/s21061945.
4
Res2Net: A New Multi-Scale Backbone Architecture.Res2Net:一种新的多尺度骨干网络架构。
IEEE Trans Pattern Anal Mach Intell. 2021 Feb;43(2):652-662. doi: 10.1109/TPAMI.2019.2938758. Epub 2021 Jan 8.
5
TextField: Learning a Deep Direction Field for Irregular Scene Text Detection.文本字段:学习用于不规则场景文本检测的深度方向场。
IEEE Trans Image Process. 2019 Nov;28(11):5566-5579. doi: 10.1109/TIP.2019.2900589. Epub 2019 Feb 21.
6
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks.更快的 R-CNN:基于区域建议网络的实时目标检测。
IEEE Trans Pattern Anal Mach Intell. 2017 Jun;39(6):1137-1149. doi: 10.1109/TPAMI.2016.2577031. Epub 2016 Jun 6.