Bataineh Bilal, Tounsi Mohamed, Zamzami Nuha, Janbi Jehan, Abu-Ain Waleed Abdel Karim, AbuAin Tarik, Elnazer Shaima
Software Engineering Department, Faculty of Science and Information Technology, Irbid National University, Irbid 21110, Jordan.
Software Engineering Department, College of Computing, Umm Al-Qura University, Mecca 21955, Saudi Arabia.
J Imaging. 2025 Apr 26;11(5):133. doi: 10.3390/jimaging11050133.
In today's digital age, the conversion of hardcopy documents into digital formats is widespread. This process involves electronically scanning and storing large volumes of documents. These documents come from various sources, including records and reports, camera-captured text and screen snapshots, official documents, newspapers, medical reports, music scores, and more. In the domain of document analysis techniques, an essential step is document image binarization. Its goal is to eliminate unnecessary data from images and preserve only the text. Despite the existence of multiple techniques for binarization, the presence of degradation in document images can hinder their efficacy. The objective of this work is to provide an extensive review and analysis of the document binarization field, emphasizing its importance and addressing the challenges encountered during the image binarization process. Additionally, it provides insights into techniques and methods employed for image binarization. The current paper also introduces benchmark datasets for evaluating binarization accuracy, model training, evaluation metrics, and the effectiveness of recent methods.
在当今数字时代,将硬拷贝文档转换为数字格式的做法十分普遍。这一过程涉及对大量文档进行电子扫描和存储。这些文档来源各异,包括记录与报告、相机捕捉的文本和屏幕截图、官方文件、报纸、医学报告、乐谱等等。在文档分析技术领域,文档图像二值化是关键步骤。其目标是从图像中去除不必要的数据,仅保留文本。尽管存在多种二值化技术,但文档图像的质量下降会影响其效果。本文的目的是对文档二值化领域进行全面综述与分析,强调其重要性,并探讨图像二值化过程中遇到的挑战。此外,还介绍了用于图像二值化的技术和方法。本文还引入了用于评估二值化准确性、模型训练、评估指标以及近期方法有效性的基准数据集。