• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于目标的相机文档图像校正。

Goal-oriented rectification of camera-based document images.

机构信息

Department of Informatics and Telecommunications, National and Kapodistrian University of Athens, Greece.

出版信息

IEEE Trans Image Process. 2011 Apr;20(4):910-20. doi: 10.1109/TIP.2010.2080280. Epub 2010 Sep 27.

DOI:10.1109/TIP.2010.2080280
PMID:20876019
Abstract

Document digitization with either flatbed scanners or camera-based systems results in document images which often suffer from warping and perspective distortions that deteriorate the performance of current OCR approaches. In this paper, we present a goal-oriented rectification methodology to compensate for undesirable document image distortions aiming to improve the OCR result. Our approach relies upon a coarse-to-fine strategy. First, a coarse rectification is accomplished with the aid of a computationally low cost transformation which addresses the projection of a curved surface to a 2-D rectangular area. The projection of the curved surface on the plane is guided only by the textual content's appearance in the document image while incorporating a transformation which does not depend on specific model primitives or camera setup parameters. Second, pose normalization is applied on the word level aiming to restore all the local distortions of the document image. Experimental results on various document images with a variety of distortions demonstrate the robustness and effectiveness of the proposed rectification methodology using a consistent evaluation methodology that encounters OCR accuracy and a newly introduced measure using a semi-automatic procedure.

摘要

文档的数字化无论是使用平板扫描仪还是基于摄像头的系统,都会导致文档图像产生扭曲和透视变形等问题,从而降低当前光学字符识别(OCR)方法的性能。在本文中,我们提出了一种面向目标的校正方法,以补偿文档图像的不良变形,从而提高 OCR 结果的质量。我们的方法依赖于一种从粗到精的策略。首先,借助一种计算成本低的变换来完成粗略校正,该变换解决了将曲面投影到 2D 矩形区域的问题。曲面在平面上的投影仅由文档图像中文本内容的外观引导,同时采用一种不依赖于特定模型基元或相机设置参数的变换。其次,在单词级别应用位姿归一化,以恢复文档图像的所有局部变形。在各种具有不同变形的文档图像上进行的实验结果表明,所提出的校正方法具有很强的鲁棒性和有效性,并且使用了一致的评估方法来衡量 OCR 精度和使用半自动过程引入的新度量。

相似文献

1
Goal-oriented rectification of camera-based document images.基于目标的相机文档图像校正。
IEEE Trans Image Process. 2011 Apr;20(4):910-20. doi: 10.1109/TIP.2010.2080280. Epub 2010 Sep 27.
2
Geometric rectification of camera-captured document images.相机拍摄的文档图像的几何校正
IEEE Trans Pattern Anal Mach Intell. 2008 Apr;30(4):591-605. doi: 10.1109/TPAMI.2007.70724.
3
An improved physically-based method for geometric restoration of distorted document images.一种改进的基于物理模型的扭曲文档图像几何恢复方法。
IEEE Trans Pattern Anal Mach Intell. 2008 Apr;30(4):728-34. doi: 10.1109/TPAMI.2007.70831.
4
Restoring warped document images through 3D shape modeling.通过三维形状建模恢复扭曲的文档图像。
IEEE Trans Pattern Anal Mach Intell. 2006 Feb;28(2):195-208. doi: 10.1109/TPAMI.2006.40.
5
Texture for script identification.用于脚本识别的纹理。
IEEE Trans Pattern Anal Mach Intell. 2005 Nov;27(11):1720-32. doi: 10.1109/TPAMI.2005.227.
6
Document image retrieval through word shape coding.通过单词形状编码进行文档图像检索。
IEEE Trans Pattern Anal Mach Intell. 2008 Nov;30(11):1913-8. doi: 10.1109/TPAMI.2008.89.
7
Font adaptive word indexing of modern printed documents.现代印刷文档的字体自适应词索引
IEEE Trans Pattern Anal Mach Intell. 2006 Aug;28(8):1187-99. doi: 10.1109/TPAMI.2006.162.
8
A comparative study of staff removal algorithms.员工移除算法的比较研究。
IEEE Trans Pattern Anal Mach Intell. 2008 May;30(5):753-66. doi: 10.1109/TPAMI.2007.70749.
9
Restoring 2D content from distorted documents.从失真文档中恢复二维内容。
IEEE Trans Pattern Anal Mach Intell. 2007 Nov;29(11):1904-16. doi: 10.1109/TPAMI.2007.1118.
10
Script and language identification in noisy and degraded document images.嘈杂且退化的文档图像中的脚本和语言识别
IEEE Trans Pattern Anal Mach Intell. 2008 Jan;30(1):14-24. doi: 10.1109/TPAMI.2007.1158.