• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于局部特征学习的结构化聚类检测用于文本区域提取

Structured Cluster Detection from Local Feature Learning for Text Region Extraction.

作者信息

Lin Huei-Yung, Hsu Chin-Yu

机构信息

Department of Computer Science and Information Engineering, National Taipei University of Technology, Taipei 106, Taiwan.

Department of Electrical Engineering, National Chung Cheng University, Chiayi 621, Taiwan.

出版信息

Entropy (Basel). 2023 Apr 14;25(4):658. doi: 10.3390/e25040658.

DOI:10.3390/e25040658
PMID:37190448
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10137775/
Abstract

The detection of regions of interest is commonly considered as an early stage of information extraction from images. It is used to provide the contents meaningful to human perception for machine vision applications. In this work, a new technique for structured region detection based on the distillation of local image features with clustering analysis is proposed. Different from the existing methods, our approach takes the application-specific reference images for feature learning and extraction. It is able to identify text clusters under the sparsity of feature points derived from the characters. For the localization of structured regions, the cluster with high feature density is calculated and serves as a candidate for region expansion. An iterative adjustment is then performed to enlarge the ROI for complete text coverage. The experiments carried out for text region detection of invoice and banknote demonstrate the effectiveness of the proposed technique.

摘要

感兴趣区域的检测通常被视为从图像中提取信息的早期阶段。它用于为机器视觉应用提供对人类感知有意义的内容。在这项工作中,提出了一种基于局部图像特征聚类分析的结构化区域检测新技术。与现有方法不同,我们的方法采用特定应用的参考图像进行特征学习和提取。它能够在字符衍生的特征点稀疏的情况下识别文本聚类。对于结构化区域的定位,计算具有高特征密度的聚类并将其用作区域扩展的候选。然后进行迭代调整以扩大感兴趣区域以实现完整的文本覆盖。针对发票和钞票的文本区域检测进行的实验证明了所提出技术的有效性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a04e/10137775/4cf0b94311df/entropy-25-00658-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a04e/10137775/f1bd8885f3a6/entropy-25-00658-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a04e/10137775/e563824c7c0b/entropy-25-00658-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a04e/10137775/88c9ad71564e/entropy-25-00658-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a04e/10137775/86316004bd2f/entropy-25-00658-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a04e/10137775/11d9cdb40382/entropy-25-00658-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a04e/10137775/966bd87fa7f6/entropy-25-00658-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a04e/10137775/d187bc998502/entropy-25-00658-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a04e/10137775/8f808563a795/entropy-25-00658-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a04e/10137775/e8173b546aec/entropy-25-00658-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a04e/10137775/f820766e4d19/entropy-25-00658-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a04e/10137775/ab5726310fb9/entropy-25-00658-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a04e/10137775/6846d80c2400/entropy-25-00658-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a04e/10137775/4cf0b94311df/entropy-25-00658-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a04e/10137775/f1bd8885f3a6/entropy-25-00658-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a04e/10137775/e563824c7c0b/entropy-25-00658-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a04e/10137775/88c9ad71564e/entropy-25-00658-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a04e/10137775/86316004bd2f/entropy-25-00658-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a04e/10137775/11d9cdb40382/entropy-25-00658-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a04e/10137775/966bd87fa7f6/entropy-25-00658-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a04e/10137775/d187bc998502/entropy-25-00658-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a04e/10137775/8f808563a795/entropy-25-00658-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a04e/10137775/e8173b546aec/entropy-25-00658-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a04e/10137775/f820766e4d19/entropy-25-00658-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a04e/10137775/ab5726310fb9/entropy-25-00658-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a04e/10137775/6846d80c2400/entropy-25-00658-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a04e/10137775/4cf0b94311df/entropy-25-00658-g013.jpg

相似文献

1
Structured Cluster Detection from Local Feature Learning for Text Region Extraction.基于局部特征学习的结构化聚类检测用于文本区域提取
Entropy (Basel). 2023 Apr 14;25(4):658. doi: 10.3390/e25040658.
2
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
3
Machine Learning-Based Fast Banknote Serial Number Recognition Using Knowledge Distillation and Bayesian Optimization.基于机器学习的纸币序列号快速识别方法,结合知识蒸馏和贝叶斯优化。
Sensors (Basel). 2019 Sep 28;19(19):4218. doi: 10.3390/s19194218.
4
A CCD based machine vision system for real-time text detection.一种基于电荷耦合器件(CCD)的用于实时文本检测的机器视觉系统。
Front Optoelectron. 2020 Dec;13(4):418-424. doi: 10.1007/s12200-019-0854-0. Epub 2019 Aug 5.
5
Breast microcalcifications detection based on fusing features with DTCWT.基于 DTCWT 融合特征的乳腺微钙化检测。
J Xray Sci Technol. 2020;28(2):197-218. doi: 10.3233/XST-190583.
6
Scene Text Detection Based on Two-Branch Feature Extraction.基于双分支特征提取的场景文本检测。
Sensors (Basel). 2022 Aug 20;22(16):6262. doi: 10.3390/s22166262.
7
An Enhancement of Computer Aided Approach for Colon Cancer Detection in WCE Images Using ROI Based Color Histogram and SVM2.基于 ROI 的颜色直方图和 SVM2 的 WCE 图像中结肠癌检测的计算机辅助方法的增强
J Med Syst. 2019 Jan 5;43(2):29. doi: 10.1007/s10916-018-1153-9.
8
Text Extraction from Scene Images by Character Appearance and Structure Modeling.通过字符外观和结构建模从场景图像中提取文本
Comput Vis Image Underst. 2013 Feb 1;117(2):182-194. doi: 10.1016/j.cviu.2012.11.002.
9
Scene text recognition in mobile applications by character descriptor and structure configuration.移动端应用中的场景文字识别:基于字符描述符和结构配置
IEEE Trans Image Process. 2014 Jul;23(7):2972-82. doi: 10.1109/TIP.2014.2317980.
10
Text-based multi-dimensional medical images retrieval according to the features-usage correlation.基于特征-用法相关性的文本型多维医学图像检索。
Med Biol Eng Comput. 2021 Oct;59(10):1993-2017. doi: 10.1007/s11517-021-02392-0. Epub 2021 Aug 20.

引用本文的文献

1
Feature-aware unsupervised lesion segmentation for brain tumor images using fast data density functional transform.基于快速数据密度泛函变换的脑肿瘤图像特征感知无监督病变分割。
Sci Rep. 2023 Aug 21;13(1):13582. doi: 10.1038/s41598-023-40848-5.

本文引用的文献

1
Consistency-Induced Multiview Subspace Clustering.一致性诱导的多视图子空间聚类
IEEE Trans Cybern. 2023 Feb;53(2):832-844. doi: 10.1109/TCYB.2022.3165550. Epub 2023 Jan 13.
2
Soft Subspace Based Ensemble Clustering for Multivariate Time Series Data.基于软子空间的多元时间序列数据集成聚类
IEEE Trans Neural Netw Learn Syst. 2023 Oct;34(10):7761-7774. doi: 10.1109/TNNLS.2022.3146136. Epub 2023 Oct 5.
3
Machine Learning-Based Fast Banknote Serial Number Recognition Using Knowledge Distillation and Bayesian Optimization.
基于机器学习的纸币序列号快速识别方法,结合知识蒸馏和贝叶斯优化。
Sensors (Basel). 2019 Sep 28;19(19):4218. doi: 10.3390/s19194218.
4
Scene text detection via extremal region based double threshold convolutional network classification.基于极值区域的双阈值卷积网络分类的场景文本检测
PLoS One. 2017 Aug 18;12(8):e0182227. doi: 10.1371/journal.pone.0182227. eCollection 2017.
5
Robust Text Detection in Natural Scene Images.自然场景图像中的鲁棒文本检测。
IEEE Trans Pattern Anal Mach Intell. 2014 May;36(5):970-83. doi: 10.1109/TPAMI.2013.182.
6
A unified framework for multioriented text detection and recognition.多方向文本检测与识别的统一框架。
IEEE Trans Image Process. 2014 Nov;23(11):4737-49. doi: 10.1109/TIP.2014.2353813. Epub 2014 Sep 4.
7
Characterness: an indicator of text in the wild.特性格:野外文本的指示器。
IEEE Trans Image Process. 2014 Apr;23(4):1666-77. doi: 10.1109/TIP.2014.2302896.