• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

历史文档图像二值化的性能评估方法。

Performance evaluation methodology for historical document image binarization.

机构信息

Department of Informatics and Telecommunications, National and Kapodistrian University of Athens, Athens, Greece.

出版信息

IEEE Trans Image Process. 2013 Feb;22(2):595-609. doi: 10.1109/TIP.2012.2219550. Epub 2012 Sep 18.

DOI:10.1109/TIP.2012.2219550
PMID:23008259
Abstract

Document image binarization is of great importance in the document image analysis and recognition pipeline since it affects further stages of the recognition process. The evaluation of a binarization method aids in studying its algorithmic behavior, as well as verifying its effectiveness, by providing qualitative and quantitative indication of its performance. This paper addresses a pixel-based binarization evaluation methodology for historical handwritten/machine-printed document images. In the proposed evaluation scheme, the recall and precision evaluation measures are properly modified using a weighting scheme that diminishes any potential evaluation bias. Additional performance metrics of the proposed evaluation scheme consist of the percentage rates of broken and missed text, false alarms, background noise, character enlargement, and merging. Several experiments conducted in comparison with other pixel-based evaluation measures demonstrate the validity of the proposed evaluation scheme.

摘要

文档图像二值化在文档图像分析和识别管道中非常重要,因为它会影响识别过程的后续阶段。二值化方法的评估通过提供其性能的定性和定量指示,有助于研究其算法行为以及验证其有效性。本文提出了一种基于像素的历史手写/机器印刷文档图像二值化评估方法。在提出的评估方案中,使用加权方案适当修改了召回率和精度评估措施,从而减少了任何潜在的评估偏差。所提出的评估方案的其他性能指标包括断字和漏字、误报、背景噪声、字符放大和合并的百分比。与其他基于像素的评估方法进行的几次实验证明了所提出的评估方案的有效性。

相似文献

1
Performance evaluation methodology for historical document image binarization.历史文档图像二值化的性能评估方法。
IEEE Trans Image Process. 2013 Feb;22(2):595-609. doi: 10.1109/TIP.2012.2219550. Epub 2012 Sep 18.
2
Script-independent text line segmentation in freestyle handwritten documents.自由手写文档中与脚本无关的文本行分割
IEEE Trans Pattern Anal Mach Intell. 2008 Aug;30(8):1313-29. doi: 10.1109/TPAMI.2007.70792.
3
Texture for script identification.用于脚本识别的纹理。
IEEE Trans Pattern Anal Mach Intell. 2005 Nov;27(11):1720-32. doi: 10.1109/TPAMI.2005.227.
4
Machine printed text and handwriting identification in noisy document images.噪声文档图像中的机器打印文本和手写识别。
IEEE Trans Pattern Anal Mach Intell. 2004 Mar;26(3):337-53. doi: 10.1109/TPAMI.2004.1262324.
5
An approach to offline handwritten Chinese character recognition based on segment evaluation of adaptive duration.一种基于自适应时长片段评估的离线手写汉字识别方法。
J Zhejiang Univ Sci. 2004 Nov;5(11):1392-7. doi: 10.1631/jzus.2004.1392.
6
Utilization of hierarchical, stochastic relationship modeling for Hangul character recognition.用于韩文文字识别的分层随机关系建模的应用
IEEE Trans Pattern Anal Mach Intell. 2004 Sep;26(9):1185-96. doi: 10.1109/TPAMI.2004.74.
7
Artificial neural networks for document analysis and recognition.用于文档分析与识别的人工神经网络。
IEEE Trans Pattern Anal Mach Intell. 2005 Jan;27(1):23-35. doi: 10.1109/TPAMI.2005.4.
8
Signature detection and matching for document image retrieval.用于文档图像检索的签名检测与匹配。
IEEE Trans Pattern Anal Mach Intell. 2009 Nov;31(11):2015-31. doi: 10.1109/TPAMI.2008.237.
9
A scale space approach for automatically segmenting words from historical handwritten documents.一种用于从历史手写文档中自动分割单词的尺度空间方法。
IEEE Trans Pattern Anal Mach Intell. 2005 Aug;27(8):1212-25. doi: 10.1109/TPAMI.2005.150.
10
Goal-oriented rectification of camera-based document images.基于目标的相机文档图像校正。
IEEE Trans Image Process. 2011 Apr;20(4):910-20. doi: 10.1109/TIP.2010.2080280. Epub 2010 Sep 27.

引用本文的文献

1
Minimizing Bleed-Through Effect in Medieval Manuscripts with Machine Learning and Robust Statistics.利用机器学习和稳健统计方法减少中世纪手稿中的渗色效应
J Imaging. 2025 Apr 28;11(5):136. doi: 10.3390/jimaging11050136.
2
High spatiotemporal mapping of cortical blood flow velocity with an enhanced accuracy.具有更高精度的皮质血流速度的高时空映射。
Biomed Opt Express. 2024 Mar 15;15(4):2419-2432. doi: 10.1364/BOE.520886. eCollection 2024 Apr 1.
3
Using Paper Texture for Choosing a Suitable Algorithm for Scanned Document Image Binarization.
利用纸张纹理选择适用于扫描文档图像二值化的算法。
J Imaging. 2022 Oct 5;8(10):272. doi: 10.3390/jimaging8100272.
4
Spectrum decomposition in Gaussian scale space for uneven illumination image binarization.高斯尺度空间中的谱分解用于不均匀光照图像二值化。
PLoS One. 2021 Apr 30;16(4):e0251014. doi: 10.1371/journal.pone.0251014. eCollection 2021.
5
Improvement of Image Binarization Methods Using Image Preprocessing with Local Entropy Filtering for Alphanumerical Character Recognition Purposes.使用局部熵滤波图像预处理改进图像二值化方法用于字母数字字符识别目的。
Entropy (Basel). 2019 Jun 4;21(6):562. doi: 10.3390/e21060562.
6
Robust Combined Binarization Method of Non-Uniformly Illuminated Document Images for Alphanumerical Character Recognition.用于数字字符识别的非均匀光照文档图像的鲁棒联合二值化方法。
Sensors (Basel). 2020 May 21;20(10):2914. doi: 10.3390/s20102914.