度量弯曲文档图像的校正。

Metric rectification of curved document images.

机构信息

National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Zhongguancun East Road, No. 95, Haidian District, Beijing 100190, P.R. China.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2012 Apr;34(4):707-22. doi: 10.1109/TPAMI.2011.151.

DOI:10.1109/TPAMI.2011.151

PMID:21808093

Abstract

In this paper, we propose a metric rectification method to restore an image from a single camera-captured document image. The core idea is to construct an isometric image mesh by exploiting the geometry of page surface and camera. Our method uses a general cylindrical surface (GCS) to model the curved page shape. Under a few proper assumptions, the printed horizontal text lines are shown to be line convergent symmetric. This property is then used to constrain the estimation of various model parameters under perspective projection. We also introduce a paraperspective projection to approximate the nonlinear perspective projection. A set of close-form formulas is thus derived for the estimate of GCS directrix and document aspect ratio. Our method provides a straightforward framework for image metric rectification. It is insensitive to camera positions, viewing angles, and the shapes of document pages. To evaluate the proposed method, we implemented comprehensive experiments on both synthetic and real-captured images. The results demonstrate the efficiency of our method. We also carried out a comparative experiment on the public CBDAR2007 data set. The experimental results show that our method outperforms the state-of-the-art methods in terms of OCR accuracy and rectification errors.

摘要

在本文中，我们提出了一种度量校正方法，从单相机捕获的文档图像中恢复图像。其核心思想是通过利用页面表面和相机的几何形状来构建等距图像网格。我们的方法使用一般圆柱面（GCS）来建模弯曲的页面形状。在几个适当的假设下，显示打印的水平文本行是线收敛对称的。然后利用该属性约束透视投影下的各种模型参数的估计。我们还引入了一种平行透视投影来近似非线性透视投影。因此，导出了一组用于 GCS 准线和文档纵横比估计的闭式公式。我们的方法为图像度量校正提供了一个直接的框架。它对相机位置、视角和文档页面的形状不敏感。为了评估所提出的方法，我们在合成和真实捕获的图像上进行了全面的实验。结果表明了我们方法的效率。我们还在公共 CBDAR2007 数据集上进行了对比实验。实验结果表明，在 OCR 准确性和校正误差方面，我们的方法优于最先进的方法。

相似文献

Metric rectification of curved document images.度量弯曲文档图像的校正。

IEEE Trans Pattern Anal Mach Intell. 2012 Apr;34(4):707-22. doi: 10.1109/TPAMI.2011.151.

Geometric rectification of camera-captured document images.相机拍摄的文档图像的几何校正

IEEE Trans Pattern Anal Mach Intell. 2008 Apr;30(4):591-605. doi: 10.1109/TPAMI.2007.70724.

Rectification of curved document images based on single view three-dimensional reconstruction.基于单视图三维重建的弯曲文档图像校正

J Opt Soc Am A Opt Image Sci Vis. 2016 Oct 1;33(10):2089-2098. doi: 10.1364/JOSAA.33.002089.

Goal-oriented rectification of camera-based document images.基于目标的相机文档图像校正。

IEEE Trans Image Process. 2011 Apr;20(4):910-20. doi: 10.1109/TIP.2010.2080280. Epub 2010 Sep 27.

Composition of a dewarped and enhanced document image from two view images.从两个视图图像合成去扭曲和增强后的文档图像。

IEEE Trans Image Process. 2009 Jul;18(7):1551-62. doi: 10.1109/TIP.2009.2019301. Epub 2009 May 12.

Baselines Extraction from Curved Document Images via Slope Fields Recovery.通过斜率场恢复从弯曲文档图像中提取基线

IEEE Trans Pattern Anal Mach Intell. 2020 Apr;42(4):793-808. doi: 10.1109/TPAMI.2018.2886900. Epub 2018 Dec 14.

Metric 3D reconstruction and texture acquisition of surfaces of revolution from a single uncalibrated view.基于单幅未校准视图的旋转曲面的度量三维重建与纹理获取。

IEEE Trans Pattern Anal Mach Intell. 2005 Jan;27(1):99-114. doi: 10.1109/TPAMI.2005.14.

Text-Line Detection in Camera-Captured Document Images Using the State Estimation of Connected Components.利用连通分量的状态估计在相机拍摄的文档图像中进行文本行检测

IEEE Trans Image Process. 2016 Nov;25(11):5358-5368. doi: 10.1109/TIP.2016.2607418. Epub 2016 Sep 8.

The effect of border noise on the performance of projection-based page segmentation methods.基于投影的页面分割方法中边界噪声的影响。

IEEE Trans Pattern Anal Mach Intell. 2011 Apr;33(4):846-51. doi: 10.1109/TPAMI.2010.194.

An improved physically-based method for geometric restoration of distorted document images.一种改进的基于物理模型的扭曲文档图像几何恢复方法。

IEEE Trans Pattern Anal Mach Intell. 2008 Apr;30(4):728-34. doi: 10.1109/TPAMI.2007.70831.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

度量弯曲文档图像的校正。

Metric rectification of curved document images.

机构信息

出版信息

相似文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献