Suppr超能文献

利用生物启发特征的手写文字识别

Handwritten-word spotting using biologically inspired features.

作者信息

van der Zant Tijn, Schomaker Lambert, Haak Koen

机构信息

AI Department, University of Groningen, Postbus 407, 9700 AK Groningen, The Netherlands.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2008 Nov;30(11):1945-57. doi: 10.1109/TPAMI.2008.144.

Abstract

For quick access to new handwritten collections, current handwriting recognition methods are too cumbersome. They cannot deal with the lack of labeled data and would require extensive laboratory training for each individual script, style, language and collection. We propose a biologically inspired whole-word recognition method which is used to incrementally elicit word labels in a live, web-based annotation system, named Monk. Since human labor should be minimized given the massive amount of image data, it becomes important to rely on robust perceptual mechanisms in the machine. Recent computational models of the neuro-physiology of vision are applied to isolated word classification. A primate cortex-like mechanism allows to classify text-images that have a low frequency of occurrence. Typically these images are the most difficult to retrieve and often contain named entities and are regarded as the most important to people. Usually standard pattern-recognition technology cannot deal with these text-images if there are not enough labeled instances. The results of this retrieval system are compared to normalized word-image matching and appear to be very promising.

摘要

对于快速访问新的手写文集而言,当前的手写识别方法过于繁琐。它们无法处理标记数据不足的问题,并且针对每种手写体、风格、语言和文集都需要进行大量的实验室训练。我们提出了一种受生物学启发的全词识别方法,该方法用于在一个名为Monk的基于网络的实时注释系统中逐步引出单词标签。鉴于海量的图像数据,应尽量减少人工操作,因此依靠机器中强大的感知机制变得很重要。最近的视觉神经生理学计算模型被应用于孤立单词分类。一种类似灵长类动物皮层的机制能够对出现频率较低的文本图像进行分类。通常,这些图像最难检索,并且常常包含命名实体,对人们来说被视为最重要的。如果没有足够的标记实例,标准的模式识别技术通常无法处理这些文本图像。该检索系统的结果与归一化单词图像匹配进行了比较,结果看起来很有前景。

相似文献

1
Handwritten-word spotting using biologically inspired features.利用生物启发特征的手写文字识别
IEEE Trans Pattern Anal Mach Intell. 2008 Nov;30(11):1945-57. doi: 10.1109/TPAMI.2008.144.
2
Texture for script identification.用于脚本识别的纹理。
IEEE Trans Pattern Anal Mach Intell. 2005 Nov;27(11):1720-32. doi: 10.1109/TPAMI.2005.227.
3
Offline loop investigation for handwriting analysis.用于笔迹分析的离线循环研究。
IEEE Trans Pattern Anal Mach Intell. 2009 Feb;31(2):193-209. doi: 10.1109/TPAMI.2008.68.
6
Recognition and verification of unconstrained handwritten words.无约束手写文字的识别与验证
IEEE Trans Pattern Anal Mach Intell. 2005 Oct;27(10):1509-22. doi: 10.1109/TPAMI.2005.207.
8
Offline grammar-based recognition of handwritten sentences.基于语法的手写句子离线识别
IEEE Trans Pattern Anal Mach Intell. 2006 May;28(5):818-21. doi: 10.1109/TPAMI.2006.103.
9
Script-independent text line segmentation in freestyle handwritten documents.自由手写文档中与脚本无关的文本行分割
IEEE Trans Pattern Anal Mach Intell. 2008 Aug;30(8):1313-29. doi: 10.1109/TPAMI.2007.70792.
10
Signature detection and matching for document image retrieval.用于文档图像检索的签名检测与匹配。
IEEE Trans Pattern Anal Mach Intell. 2009 Nov;31(11):2015-31. doi: 10.1109/TPAMI.2008.237.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验