文档图像二值化的全面综述 - Suppr | 超能文献

文档图像二值化的全面综述

A Comprehensive Review on Document Image Binarization.

作者信息

Bataineh Bilal, Tounsi Mohamed, Zamzami Nuha, Janbi Jehan, Abu-Ain Waleed Abdel Karim, AbuAin Tarik, Elnazer Shaima

机构信息

Software Engineering Department, Faculty of Science and Information Technology, Irbid National University, Irbid 21110, Jordan.

Software Engineering Department, College of Computing, Umm Al-Qura University, Mecca 21955, Saudi Arabia.

出版信息

J Imaging. 2025 Apr 26;11(5):133. doi: 10.3390/jimaging11050133.

DOI:10.3390/jimaging11050133

PMID:40422990

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12112497/

Abstract

In today's digital age, the conversion of hardcopy documents into digital formats is widespread. This process involves electronically scanning and storing large volumes of documents. These documents come from various sources, including records and reports, camera-captured text and screen snapshots, official documents, newspapers, medical reports, music scores, and more. In the domain of document analysis techniques, an essential step is document image binarization. Its goal is to eliminate unnecessary data from images and preserve only the text. Despite the existence of multiple techniques for binarization, the presence of degradation in document images can hinder their efficacy. The objective of this work is to provide an extensive review and analysis of the document binarization field, emphasizing its importance and addressing the challenges encountered during the image binarization process. Additionally, it provides insights into techniques and methods employed for image binarization. The current paper also introduces benchmark datasets for evaluating binarization accuracy, model training, evaluation metrics, and the effectiveness of recent methods.

摘要

在当今数字时代，将硬拷贝文档转换为数字格式的做法十分普遍。这一过程涉及对大量文档进行电子扫描和存储。这些文档来源各异，包括记录与报告、相机捕捉的文本和屏幕截图、官方文件、报纸、医学报告、乐谱等等。在文档分析技术领域，文档图像二值化是关键步骤。其目标是从图像中去除不必要的数据，仅保留文本。尽管存在多种二值化技术，但文档图像的质量下降会影响其效果。本文的目的是对文档二值化领域进行全面综述与分析，强调其重要性，并探讨图像二值化过程中遇到的挑战。此外，还介绍了用于图像二值化的技术和方法。本文还引入了用于评估二值化准确性、模型训练、评估指标以及近期方法有效性的基准数据集。

Suppr 超能文献

文献检索

文件翻译

深度研究

Suppr 超能文献

文献检索

文件翻译

深度研究

文档图像二值化的全面综述

A Comprehensive Review on Document Image Binarization.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文档图像二值化的全面综述

A Comprehensive Review on Document Image Binarization.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献