• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用局部熵滤波图像预处理改进图像二值化方法用于字母数字字符识别目的。

Improvement of Image Binarization Methods Using Image Preprocessing with Local Entropy Filtering for Alphanumerical Character Recognition Purposes.

作者信息

Michalak Hubert, Okarma Krzysztof

机构信息

Faculty of Electrical Engineering, West Pomeranian University of Technology, Szczecin, 70-313 Szczecin, Poland.

出版信息

Entropy (Basel). 2019 Jun 4;21(6):562. doi: 10.3390/e21060562.

DOI:10.3390/e21060562
PMID:33267276
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7515051/
Abstract

Automatic text recognition from the natural images acquired in uncontrolled lighting conditions is a challenging task due to the presence of shadows hindering the shape analysis and classification of individual characters. Since the optical character recognition methods require prior image binarization, the application of classical global thresholding methods in such case makes it impossible to preserve the visibility of all characters. Nevertheless, the use of adaptive binarization does not always lead to satisfactory results for heavily unevenly illuminated document images. In this paper, the image preprocessing methodology with the use of local image entropy filtering is proposed, allowing for the improvement of various commonly used image thresholding methods, which can be useful also for text recognition purposes. The proposed approach was verified using a dataset of 140 differently illuminated document images subjected to further text recognition. Experimental results, expressed as Levenshtein distances and F-Measure values for obtained text strings, are promising and confirm the usefulness of the proposed approach.

摘要

在光照条件不受控制的情况下,从自然图像中进行自动文本识别是一项具有挑战性的任务,因为阴影的存在会阻碍单个字符的形状分析和分类。由于光学字符识别方法需要先进行图像二值化,在这种情况下应用经典的全局阈值化方法无法保留所有字符的可见性。然而,对于光照严重不均匀的文档图像,使用自适应二值化并不总是能得到令人满意的结果。本文提出了一种使用局部图像熵滤波的图像预处理方法,该方法可以改进各种常用的图像阈值化方法,这对于文本识别目的也可能是有用的。使用包含140张不同光照条件的文档图像的数据集进行进一步文本识别,对所提出的方法进行了验证。以得到的文本字符串的莱文斯坦距离和F值表示的实验结果很有前景,并证实了所提出方法的有效性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c3/7515051/4de3c2ee01f7/entropy-21-00562-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c3/7515051/5afe06e72324/entropy-21-00562-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c3/7515051/2568d0e15b49/entropy-21-00562-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c3/7515051/dcd081a68ea6/entropy-21-00562-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c3/7515051/00e4557807d9/entropy-21-00562-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c3/7515051/44d1837fd3a6/entropy-21-00562-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c3/7515051/03830d48edc7/entropy-21-00562-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c3/7515051/1e45965a3d49/entropy-21-00562-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c3/7515051/752a4472df09/entropy-21-00562-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c3/7515051/4de3c2ee01f7/entropy-21-00562-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c3/7515051/5afe06e72324/entropy-21-00562-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c3/7515051/2568d0e15b49/entropy-21-00562-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c3/7515051/dcd081a68ea6/entropy-21-00562-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c3/7515051/00e4557807d9/entropy-21-00562-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c3/7515051/44d1837fd3a6/entropy-21-00562-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c3/7515051/03830d48edc7/entropy-21-00562-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c3/7515051/1e45965a3d49/entropy-21-00562-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c3/7515051/752a4472df09/entropy-21-00562-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c3/7515051/4de3c2ee01f7/entropy-21-00562-g009.jpg

相似文献

1
Improvement of Image Binarization Methods Using Image Preprocessing with Local Entropy Filtering for Alphanumerical Character Recognition Purposes.使用局部熵滤波图像预处理改进图像二值化方法用于字母数字字符识别目的。
Entropy (Basel). 2019 Jun 4;21(6):562. doi: 10.3390/e21060562.
2
Robust Combined Binarization Method of Non-Uniformly Illuminated Document Images for Alphanumerical Character Recognition.用于数字字符识别的非均匀光照文档图像的鲁棒联合二值化方法。
Sensors (Basel). 2020 May 21;20(10):2914. doi: 10.3390/s20102914.
3
Robust table recognition for printed document images.稳健的印刷文档图像表格识别。
Math Biosci Eng. 2020 Apr 23;17(4):3203-3223. doi: 10.3934/mbe.2020182.
4
Adaptive, quadratic preprocessing of document images for binarization.用于二值化的文档图像自适应二次预处理。
IEEE Trans Image Process. 1998;7(7):992-9. doi: 10.1109/83.701155.
5
Robust document image binarization technique for degraded document images.用于退化文档图像的健壮文档图像二值化技术。
IEEE Trans Image Process. 2013 Apr;22(4):1408-17. doi: 10.1109/TIP.2012.2231089. Epub 2012 Dec 3.
6
Structure similarity-guided image binarization for automatic segmentation of epidermis surface microstructure images.基于结构相似性引导的图像二值化用于表皮表面微观结构图像的自动分割
J Microsc. 2017 May;266(2):153-165. doi: 10.1111/jmi.12525. Epub 2017 Jan 24.
7
Influence of Color-to-Gray Conversion on the Performance of Document Image Binarization: Toward a Novel Optimization Problem.颜色到灰度转换对文档图像二值化性能的影响:一种新的优化问题。
IEEE Trans Image Process. 2015 Nov;24(11):3637-51. doi: 10.1109/TIP.2015.2442923. Epub 2015 Jun 9.
8
Binarization of color document images via luminance and saturation color features.基于亮度和饱和度颜色特征的彩色文档图像二值化
IEEE Trans Image Process. 2002;11(4):434-51. doi: 10.1109/TIP.2002.999677.
9
Binarization of ESPI fringe patterns based on local entropy.基于局部熵的电子散斑干涉条纹图案二值化
Opt Express. 2019 Oct 28;27(22):32378-32391. doi: 10.1364/OE.27.032378.
10
Effective and fast binarization method for combined degradation on ancient documents.针对古代文献综合降解的有效快速二值化方法。
Heliyon. 2019 Oct 22;5(10):e02613. doi: 10.1016/j.heliyon.2019.e02613. eCollection 2019 Oct.

引用本文的文献

1
A Comprehensive Review on Document Image Binarization.文档图像二值化的全面综述
J Imaging. 2025 Apr 26;11(5):133. doi: 10.3390/jimaging11050133.
2
A computer vision model for the identification and scoring of calcium in aortic valve stenosis: a single-center experience.一种用于识别和评估主动脉瓣狭窄中钙质的计算机视觉模型:单中心经验
Cardiovasc Diagn Ther. 2024 Dec 31;14(6):1029-1037. doi: 10.21037/cdt-24-179. Epub 2024 Dec 16.
3
A Quality, Size and Time Assessment of the Binarization of Documents Photographed by Smartphones.

本文引用的文献

1
Degraded Historical Document Binarization: A Review on Issues, Challenges, Techniques, and Future Directions.退化历史文档二值化:关于问题、挑战、技术及未来方向的综述
J Imaging. 2019 Apr 12;5(4):48. doi: 10.3390/jimaging5040048.
2
Robust document image binarization technique for degraded document images.用于退化文档图像的健壮文档图像二值化技术。
IEEE Trans Image Process. 2013 Apr;22(4):1408-17. doi: 10.1109/TIP.2012.2231089. Epub 2012 Dec 3.
3
Performance evaluation methodology for historical document image binarization.历史文档图像二值化的性能评估方法。
智能手机拍摄文档二值化的质量、尺寸和时间评估
J Imaging. 2023 Feb 13;9(2):41. doi: 10.3390/jimaging9020041.
4
Using Paper Texture for Choosing a Suitable Algorithm for Scanned Document Image Binarization.利用纸张纹理选择适用于扫描文档图像二值化的算法。
J Imaging. 2022 Oct 5;8(10):272. doi: 10.3390/jimaging8100272.
5
CleanPage: Fast and Clean Document and Whiteboard Capture.CleanPage:快速且清晰的文档和白板捕捉工具。
J Imaging. 2020 Oct 1;6(10):102. doi: 10.3390/jimaging6100102.
6
Brain Asymmetry Detection and Machine Learning Classification for Diagnosis of Early Dementia.大脑不对称性检测与机器学习分类在早期痴呆症诊断中的应用。
Sensors (Basel). 2021 Jan 24;21(3):778. doi: 10.3390/s21030778.
7
Entropy in Image Analysis II.图像分析中的熵II。
Entropy (Basel). 2020 Aug 15;22(8):898. doi: 10.3390/e22080898.
8
DICOM segmentation and STL creation for 3D printing: a process and software package comparison for osseous anatomy.用于3D打印的DICOM分割和STL创建:骨解剖结构的流程与软件包比较
3D Print Med. 2020 Jul 31;6(1):17. doi: 10.1186/s41205-020-00069-2.
9
Robust Combined Binarization Method of Non-Uniformly Illuminated Document Images for Alphanumerical Character Recognition.用于数字字符识别的非均匀光照文档图像的鲁棒联合二值化方法。
Sensors (Basel). 2020 May 21;20(10):2914. doi: 10.3390/s20102914.
IEEE Trans Image Process. 2013 Feb;22(2):595-609. doi: 10.1109/TIP.2012.2219550. Epub 2012 Sep 18.