• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

对任意字体和大小的印刷字符的识别。

On the recognition of printed characters of any font and size.

机构信息

Department of Computer Science, University of Washington, Seattle, WA 98195; AT&TBell Laboratories, Murray Hill, NJ 07974.

出版信息

IEEE Trans Pattern Anal Mach Intell. 1987 Feb;9(2):274-88. doi: 10.1109/tpami.1987.4767901.

DOI:10.1109/tpami.1987.4767901
PMID:21869397
Abstract

We describe the current state of a system that recognizes printed text of various fonts and sizes for the Roman alphabet. The system combines several techniques in order to improve the overall recognition rate. Thinning and shape extraction are performed directly on a graph of the run-length encoding of a binary image. The resulting strokes and other shapes are mapped, using a shape-clustering approach, into binary features which are then fed into a statistical Bayesian classifier. Large-scale trials have shown better than 97 percent top choice correct performance on mixtures of six dissimilar fonts, and over 99 percent on most single fonts, over a range of point sizes. Certain remaining confusion classes are disambiguated through contour analysis, and characters suspected of being merged are broken and reclassified. Finally, layout and linguistic context are applied. The results are illustrated by sample pages.

摘要

我们描述了一个能够识别各种字体和大小的罗马字母印刷文本的系统的现状。该系统结合了多种技术,以提高整体识别率。细化和形状提取直接在二值图像的游程编码的图上执行。将得到的笔画和其他形状使用形状聚类方法映射到二进制特征中,然后将其输入到统计贝叶斯分类器中。大规模试验表明,在混合了六种不同字体的情况下,最佳选择的正确率超过 97%,在大多数单一字体的情况下,超过 99%,并且涵盖了各种字号。通过轮廓分析来区分某些剩余的混淆类别,并将可疑合并的字符进行拆分和重新分类。最后,应用布局和语言上下文。通过示例页面来说明结果。

相似文献

1
On the recognition of printed characters of any font and size.对任意字体和大小的印刷字符的识别。
IEEE Trans Pattern Anal Mach Intell. 1987 Feb;9(2):274-88. doi: 10.1109/tpami.1987.4767901.
2
Towards a standardisation of reading charts: Font effects on reading performance-Times New Roman with serifs versus the sans serif font Helvetica.朝向阅读图表的标准化:字体效果对阅读表现的影响——有衬线的 Times New Roman 与无衬线字体 Helvetica 之比较。
Ophthalmic Physiol Opt. 2022 Nov;42(6):1180-1186. doi: 10.1111/opo.13039. Epub 2022 Aug 16.
3
Automatic Generation of Typographic Font From Small Font Subset.从小字体子集自动生成排版字体。
IEEE Comput Graph Appl. 2020 Jan-Feb;40(1):99-111. doi: 10.1109/MCG.2019.2931431. Epub 2019 Jul 31.
4
Selection of the optimum font type and size interface for on screen continuous reading by young adults: an ergonomic approach.为年轻人屏幕连续阅读选择最佳字体类型和大小界面:一种人体工程学方法。
J Hum Ergol (Tokyo). 2011 Dec;40(1-2):47-62.
5
Font tuning associated with expertise in letter perception.
Perception. 2006;35(4):541-59. doi: 10.1068/p5313.
6
Applying image descriptors to the assessment of legibility in Chinese characters.将图像描述符应用于汉字清晰度评估。
Ergonomics. 2003 Jun 20;46(8):825-41. doi: 10.1080/0014013031000109214.
7
Can very small font size enhance memory?非常小的字体能增强记忆力吗?
Mem Cognit. 2018 Aug;46(6):979-993. doi: 10.3758/s13421-018-0816-6.
8
Multilingual character recognition dataset for Moroccan official documents.摩洛哥官方文件的多语言字符识别数据集。
Data Brief. 2023 Dec 13;52:109953. doi: 10.1016/j.dib.2023.109953. eCollection 2024 Feb.
9
The role of spatial frequency channels in letter identification.空间频率通道在字母识别中的作用。
Vision Res. 2002 Apr;42(9):1165-84. doi: 10.1016/s0042-6989(02)00045-7.
10
Text legibility and the letter superiority effect.文本易读性与字母优势效应。
Hum Factors. 2005 Winter;47(4):797-815. doi: 10.1518/001872005775570998.

引用本文的文献

1
Real-Time Thinning Algorithms for 2D and 3D Images using GPU processors.使用GPU处理器的2D和3D图像实时细化算法
J Real Time Image Process. 2020 Oct 28;17(5):1255-1266. doi: 10.1007/s11554-019-00886-7. Epub 2019 May 28.