• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

退化历史文档二值化:关于问题、挑战、技术及未来方向的综述

Degraded Historical Document Binarization: A Review on Issues, Challenges, Techniques, and Future Directions.

作者信息

Sulaiman Alaa, Omar Khairuddin, Nasrudin Mohammad F

机构信息

Pattern Recognition Research Group, Centre for Artificial Intelligence Technology, Faculty of Information Science and Technology, Universiti Kebangsaan Malaysia (UKM), Bangi 43600, Selangor, Malaysia.

出版信息

J Imaging. 2019 Apr 12;5(4):48. doi: 10.3390/jimaging5040048.

DOI:10.3390/jimaging5040048
PMID:34460486
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8320943/
Abstract

In this era of digitization, most hardcopy documents are being transformed into digital formats. In the process of transformation, large quantities of documents are stored and preserved through electronic scanning. These documents are available from various sources such as ancient documentation, old legal records, medical reports, music scores, palm leaf, and reports on security-related issues. In particular, ancient and historical documents are hard to read due to their degradation in terms of low contrast and existence of corrupted artefacts. In recent times, degraded document binarization has been studied widely and several approaches were developed to deal with issues and challenges in document binarization. In this paper, a comprehensive review is conducted on the issues and challenges faced during the image binarization process, followed by insights on various methods used for image binarization. This paper also discusses the advanced methods used for the enhancement of degraded documents that improves the quality of documents during the binarization process. Further discussions are made on the effectiveness and robustness of existing methods, and there is still a scope to develop a hybrid approach that can deal with degraded document binarization more effectively.

摘要

在这个数字化时代,大多数纸质文档正在被转换成数字格式。在转换过程中,大量文档通过电子扫描进行存储和保存。这些文档来源广泛,如古代文献、旧法律记录、医疗报告、乐谱、棕榈叶以及与安全相关问题的报告等。特别是古代和历史文档,由于对比度低和存在损坏的伪像而难以阅读。近年来,退化文档二值化受到了广泛研究,并开发了几种方法来处理文档二值化中的问题和挑战。本文对图像二值化过程中面临的问题和挑战进行了全面综述,随后对用于图像二值化的各种方法进行了深入分析。本文还讨论了用于增强退化文档的先进方法,这些方法在二值化过程中提高了文档质量。进一步讨论了现有方法的有效性和鲁棒性,并且仍有开发一种能更有效处理退化文档二值化的混合方法的空间。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c8e/8320943/2f52a42eeea3/jimaging-05-00048-g016.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c8e/8320943/8467560e23c1/jimaging-05-00048-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c8e/8320943/b1c57ee9d0da/jimaging-05-00048-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c8e/8320943/7385f4481277/jimaging-05-00048-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c8e/8320943/919b992e80f9/jimaging-05-00048-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c8e/8320943/e8160404c82b/jimaging-05-00048-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c8e/8320943/28ce59bb7198/jimaging-05-00048-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c8e/8320943/324c232238c9/jimaging-05-00048-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c8e/8320943/cc351b99675e/jimaging-05-00048-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c8e/8320943/6e900b57cf01/jimaging-05-00048-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c8e/8320943/f26b80fdb057/jimaging-05-00048-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c8e/8320943/9e2d1101fa77/jimaging-05-00048-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c8e/8320943/899c6640d924/jimaging-05-00048-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c8e/8320943/17b9dc9dbf28/jimaging-05-00048-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c8e/8320943/62c8bfaed2b8/jimaging-05-00048-g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c8e/8320943/e76ef1241b1b/jimaging-05-00048-g015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c8e/8320943/2f52a42eeea3/jimaging-05-00048-g016.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c8e/8320943/8467560e23c1/jimaging-05-00048-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c8e/8320943/b1c57ee9d0da/jimaging-05-00048-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c8e/8320943/7385f4481277/jimaging-05-00048-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c8e/8320943/919b992e80f9/jimaging-05-00048-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c8e/8320943/e8160404c82b/jimaging-05-00048-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c8e/8320943/28ce59bb7198/jimaging-05-00048-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c8e/8320943/324c232238c9/jimaging-05-00048-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c8e/8320943/cc351b99675e/jimaging-05-00048-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c8e/8320943/6e900b57cf01/jimaging-05-00048-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c8e/8320943/f26b80fdb057/jimaging-05-00048-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c8e/8320943/9e2d1101fa77/jimaging-05-00048-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c8e/8320943/899c6640d924/jimaging-05-00048-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c8e/8320943/17b9dc9dbf28/jimaging-05-00048-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c8e/8320943/62c8bfaed2b8/jimaging-05-00048-g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c8e/8320943/e76ef1241b1b/jimaging-05-00048-g015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c8e/8320943/2f52a42eeea3/jimaging-05-00048-g016.jpg

相似文献

1
Degraded Historical Document Binarization: A Review on Issues, Challenges, Techniques, and Future Directions.退化历史文档二值化:关于问题、挑战、技术及未来方向的综述
J Imaging. 2019 Apr 12;5(4):48. doi: 10.3390/jimaging5040048.
2
Using Paper Texture for Choosing a Suitable Algorithm for Scanned Document Image Binarization.利用纸张纹理选择适用于扫描文档图像二值化的算法。
J Imaging. 2022 Oct 5;8(10):272. doi: 10.3390/jimaging8100272.
3
Effective and fast binarization method for combined degradation on ancient documents.针对古代文献综合降解的有效快速二值化方法。
Heliyon. 2019 Oct 22;5(10):e02613. doi: 10.1016/j.heliyon.2019.e02613. eCollection 2019 Oct.
4
Phase-based binarization of ancient document images: model and applications.基于相位的古文献图像二值化:模型与应用。
IEEE Trans Image Process. 2014 Jul;23(7):2916-30. doi: 10.1109/TIP.2014.2322451. Epub 2014 May 7.
5
DE-GAN: A Conditional Generative Adversarial Network for Document Enhancement.DE-GAN:一种用于文档增强的条件生成对抗网络。
IEEE Trans Pattern Anal Mach Intell. 2022 Mar;44(3):1180-1191. doi: 10.1109/TPAMI.2020.3022406. Epub 2022 Feb 3.
6
Binarization of color document images via luminance and saturation color features.基于亮度和饱和度颜色特征的彩色文档图像二值化
IEEE Trans Image Process. 2002;11(4):434-51. doi: 10.1109/TIP.2002.999677.
7
HMPLMD: Handwritten Malayalam palm leaf manuscript dataset.HMPLMD:马拉雅拉姆语手写棕榈叶手稿数据集。
Data Brief. 2023 Feb 8;47:108960. doi: 10.1016/j.dib.2023.108960. eCollection 2023 Apr.
8
Robust document image binarization technique for degraded document images.用于退化文档图像的健壮文档图像二值化技术。
IEEE Trans Image Process. 2013 Apr;22(4):1408-17. doi: 10.1109/TIP.2012.2231089. Epub 2012 Dec 3.
9
Robust Combined Binarization Method of Non-Uniformly Illuminated Document Images for Alphanumerical Character Recognition.用于数字字符识别的非均匀光照文档图像的鲁棒联合二值化方法。
Sensors (Basel). 2020 May 21;20(10):2914. doi: 10.3390/s20102914.
10
A Quality, Size and Time Assessment of the Binarization of Documents Photographed by Smartphones.智能手机拍摄文档二值化的质量、尺寸和时间评估
J Imaging. 2023 Feb 13;9(2):41. doi: 10.3390/jimaging9020041.

引用本文的文献

1
A Comprehensive Review on Document Image Binarization.文档图像二值化的全面综述
J Imaging. 2025 Apr 26;11(5):133. doi: 10.3390/jimaging11050133.
2
Historical insights at scale: A corpus-wide machine learning analysis of early modern astronomic tables.大规模历史洞察:对早期现代天文表的全语料库机器学习分析
Sci Adv. 2024 Oct 25;10(43):eadj1719. doi: 10.1126/sciadv.adj1719. Epub 2024 Oct 23.
3
DCNet: Noise-Robust Convolutional Neural Networks for Degradation Classification on Ancient Documents.DCNet:用于古代文献退化分类的抗噪声卷积神经网络。

本文引用的文献

1
Retrospective illumination correction of retinal images.视网膜图像的回顾性光照校正。
Int J Biomed Imaging. 2010;2010:780262. doi: 10.1155/2010/780262. Epub 2010 Jul 4.
2
Improving image quality in poor visibility conditions using a physical model for contrast degradation.利用对比降质的物理模型改善低能见度条件下的图像质量。
IEEE Trans Image Process. 1998;7(2):167-79. doi: 10.1109/83.660994.
J Imaging. 2021 Jul 12;7(7):114. doi: 10.3390/jimaging7070114.
4
A Two-Stage Automatic Color Thresholding Technique.一种两阶段自动颜色阈值技术。
Sensors (Basel). 2023 Mar 22;23(6):3361. doi: 10.3390/s23063361.
5
What Binarization Method Is the Best for Amplitude Inline Fresnel Holograms Synthesized for Divergent Beams Using the Direct Search with Random Trajectory Technique?对于使用随机轨迹技术直接搜索为发散光束合成的振幅内联菲涅耳全息图,哪种二值化方法是最佳的?
J Imaging. 2023 Jan 27;9(2):28. doi: 10.3390/jimaging9020028.
6
Adaptive Digital Hologram Binarization Method Based on Local Thresholding, Block Division and Error Diffusion.基于局部阈值处理、分块划分和误差扩散的自适应数字全息图二值化方法
J Imaging. 2022 Jan 18;8(2):15. doi: 10.3390/jimaging8020015.
7
Identification of QR Code Perspective Distortion Based on Edge Directions and Edge Projections Analysis.基于边缘方向和边缘投影分析的二维码透视失真识别
J Imaging. 2020 Jul 10;6(7):67. doi: 10.3390/jimaging6070067.
8
CleanPage: Fast and Clean Document and Whiteboard Capture.CleanPage:快速且清晰的文档和白板捕捉工具。
J Imaging. 2020 Oct 1;6(10):102. doi: 10.3390/jimaging6100102.
9
Improvement of Image Binarization Methods Using Image Preprocessing with Local Entropy Filtering for Alphanumerical Character Recognition Purposes.使用局部熵滤波图像预处理改进图像二值化方法用于字母数字字符识别目的。
Entropy (Basel). 2019 Jun 4;21(6):562. doi: 10.3390/e21060562.
10
Robust Combined Binarization Method of Non-Uniformly Illuminated Document Images for Alphanumerical Character Recognition.用于数字字符识别的非均匀光照文档图像的鲁棒联合二值化方法。
Sensors (Basel). 2020 May 21;20(10):2914. doi: 10.3390/s20102914.