• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相机拍摄的文档图像的几何校正

Geometric rectification of camera-captured document images.

作者信息

Liang Jian, DeMenthon Daniel, Doermann David

机构信息

Amazon.com, 701 5th Avenue #614.B, Seattle, WA 98104, USA.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2008 Apr;30(4):591-605. doi: 10.1109/TPAMI.2007.70724.

DOI:10.1109/TPAMI.2007.70724
PMID:18276966
Abstract

Compared to typical scanners, handheld cameras offer convenient, flexible, portable, and non-contact image capture, which enables many new applications and breathes new life into existing ones. However, camera-captured documents may suffer from distortions caused by non-planar document shape and perspective projection, which lead to failure of current OCR technologies. We present a geometric rectification framework for restoring the frontal-flat view of a document from a single camera-captured image. Our approach estimates 3D document shape from texture flow information obtained directly from the image without requiring additional 3D/metric data or prior camera calibration. Our framework provides a unified solution for both planar and curved documents and can be applied in many, especially mobile, camera-based document analysis applications. Experiments show that our method produces results that are significantly more OCR compatible than the original images.

摘要

与传统扫描仪相比,手持相机提供了便捷、灵活、便携且非接触式的图像捕捉方式,这催生了许多新应用,并为现有应用注入了新活力。然而,相机拍摄的文档可能会因文档形状非平面和透视投影而产生失真,这导致当前的光学字符识别(OCR)技术失效。我们提出了一种几何校正框架,用于从单个相机拍摄的图像中恢复文档的正面平视视图。我们的方法直接从图像中获取纹理流信息来估计三维文档形状,无需额外的三维/度量数据或预先进行相机校准。我们的框架为平面和曲面文档提供了统一的解决方案,并且可应用于许多基于相机的文档分析应用,尤其是移动应用。实验表明,我们的方法所产生的结果比原始图像与OCR的兼容性显著更高。

相似文献

1
Geometric rectification of camera-captured document images.相机拍摄的文档图像的几何校正
IEEE Trans Pattern Anal Mach Intell. 2008 Apr;30(4):591-605. doi: 10.1109/TPAMI.2007.70724.
2
An improved physically-based method for geometric restoration of distorted document images.一种改进的基于物理模型的扭曲文档图像几何恢复方法。
IEEE Trans Pattern Anal Mach Intell. 2008 Apr;30(4):728-34. doi: 10.1109/TPAMI.2007.70831.
3
Goal-oriented rectification of camera-based document images.基于目标的相机文档图像校正。
IEEE Trans Image Process. 2011 Apr;20(4):910-20. doi: 10.1109/TIP.2010.2080280. Epub 2010 Sep 27.
4
Restoring warped document images through 3D shape modeling.通过三维形状建模恢复扭曲的文档图像。
IEEE Trans Pattern Anal Mach Intell. 2006 Feb;28(2):195-208. doi: 10.1109/TPAMI.2006.40.
5
Image restoration of arbitrarily warped documents.任意扭曲文档的图像恢复。
IEEE Trans Pattern Anal Mach Intell. 2004 Oct;26(10):1295-306. doi: 10.1109/TPAMI.2004.87.
6
Metric 3D reconstruction and texture acquisition of surfaces of revolution from a single uncalibrated view.基于单幅未校准视图的旋转曲面的度量三维重建与纹理获取。
IEEE Trans Pattern Anal Mach Intell. 2005 Jan;27(1):99-114. doi: 10.1109/TPAMI.2005.14.
7
Restoring 2D content from distorted documents.从失真文档中恢复二维内容。
IEEE Trans Pattern Anal Mach Intell. 2007 Nov;29(11):1904-16. doi: 10.1109/TPAMI.2007.1118.
8
Camera calibration from images of spheres.基于球体图像的相机校准。
IEEE Trans Pattern Anal Mach Intell. 2007 Mar;29(3):499-503. doi: 10.1109/TPAMI.2007.45.
9
High-accuracy and robust localization of large control markers for geometric camera calibration.用于几何相机校准的大型控制标记的高精度稳健定位。
IEEE Trans Pattern Anal Mach Intell. 2009 Feb;31(2):376-83. doi: 10.1109/TPAMI.2008.214.
10
Texture for script identification.用于脚本识别的纹理。
IEEE Trans Pattern Anal Mach Intell. 2005 Nov;27(11):1720-32. doi: 10.1109/TPAMI.2005.227.

引用本文的文献

1
Text Detection in Natural Scene Images by Stroke Gabor Words.基于笔画Gabor词的自然场景图像文本检测
Proc Int Conf Doc Anal Recognit. 2011;2011:177-181. doi: 10.1109/ICDAR.2011.44.
2
Text string detection from natural scenes by structure-based partition and grouping.基于结构划分和分组的自然场景文本字符串检测。
IEEE Trans Image Process. 2011 Sep;20(9):2594-605. doi: 10.1109/TIP.2011.2126586. Epub 2011 Mar 14.