• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于多上下文集成的手写中文文本识别。

Handwritten Chinese text recognition by integrating multiple contexts.

机构信息

National Laboratory of Pattern Recognition,Institute of Automation, Chinese Academy of Sciences, Beijing, P.R. China.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2012 Aug;34(8):1469-81. doi: 10.1109/TPAMI.2011.264.

DOI:10.1109/TPAMI.2011.264
PMID:22201052
Abstract

This paper presents an effective approach for the offline recognition of unconstrained handwritten Chinese texts. Under the general integrated segmentation-and-recognition framework with character oversegmentation, we investigate three important issues: candidate path evaluation, path search, and parameter estimation. For path evaluation, we combine multiple contexts (character recognition scores, geometric and linguistic contexts) from the Bayesian decision view, and convert the classifier outputs to posterior probabilities via confidence transformation. In path search, we use a refined beam search algorithm to improve the search efficiency and, meanwhile, use a candidate character augmentation strategy to improve the recognition accuracy. The combining weights of the path evaluation function are optimized by supervised learning using a Maximum Character Accuracy criterion. We evaluated the recognition performance on a Chinese handwriting database CASIA-HWDB, which contains nearly four million character samples of 7,356 classes and 5,091 pages of unconstrained handwritten texts. The experimental results show that confidence transformation and combining multiple contexts improve the text line recognition performance significantly. On a test set of 1,015 handwritten pages, the proposed approach achieved character-level accurate rate of 90.75 percent and correct rate of 91.39 percent, which are superior by far to the best results reported in the literature.

摘要

本文提出了一种有效的脱机手写体汉字识别方法。在具有字符过分割的通用集成分割与识别框架下,研究了三个重要问题:候选路径评估、路径搜索和参数估计。对于路径评估,我们从贝叶斯决策的角度结合了多个上下文(字符识别得分、几何和语言上下文),并通过置信度转换将分类器输出转换为后验概率。在路径搜索中,我们使用改进的波束搜索算法来提高搜索效率,同时使用候选字符增强策略来提高识别精度。路径评估函数的组合权重通过使用最大字符准确率的监督学习进行优化。我们在包含近四百万个样本、7356 类、5091 页无约束手写文本的 CASIA-HWDB 汉字手写数据库上评估了识别性能。实验结果表明,置信度转换和结合多个上下文显著提高了文本行识别性能。在 1015 页手写测试集上,所提出的方法在字符级上的准确率达到了 90.75%,正确率达到了 91.39%,远优于文献中报道的最佳结果。

相似文献

1
Handwritten Chinese text recognition by integrating multiple contexts.基于多上下文集成的手写中文文本识别。
IEEE Trans Pattern Anal Mach Intell. 2012 Aug;34(8):1469-81. doi: 10.1109/TPAMI.2011.264.
2
Writer adaptation with style transfer mapping.作者改编与风格转换映射。
IEEE Trans Pattern Anal Mach Intell. 2013 Jul;35(7):1773-87. doi: 10.1109/TPAMI.2012.239.
3
An approach to offline handwritten Chinese character recognition based on segment evaluation of adaptive duration.一种基于自适应时长片段评估的离线手写汉字识别方法。
J Zhejiang Univ Sci. 2004 Nov;5(11):1392-7. doi: 10.1631/jzus.2004.1392.
4
Recognition and verification of unconstrained handwritten words.无约束手写文字的识别与验证
IEEE Trans Pattern Anal Mach Intell. 2005 Oct;27(10):1509-22. doi: 10.1109/TPAMI.2005.207.
5
Improving offline handwritten text recognition with hybrid HMM/ANN models.利用混合 HMM/ANN 模型提高离线手写文字识别。
IEEE Trans Pattern Anal Mach Intell. 2011 Apr;33(4):767-79. doi: 10.1109/TPAMI.2010.141.
6
Handwritten Chinese/Japanese text recognition using semi-Markov conditional random fields.基于半马尔可夫条件随机场的手写中文/日文文本识别。
IEEE Trans Pattern Anal Mach Intell. 2013 Oct;35(10):2413-26. doi: 10.1109/TPAMI.2013.49.
7
Offline recognition of unconstrained handwritten texts using HMMs and statistical language models.使用隐马尔可夫模型和统计语言模型对手写文本进行离线识别。
IEEE Trans Pattern Anal Mach Intell. 2004 Jun;26(6):709-20. doi: 10.1109/TPAMI.2004.14.
8
A scale space approach for automatically segmenting words from historical handwritten documents.一种用于从历史手写文档中自动分割单词的尺度空间方法。
IEEE Trans Pattern Anal Mach Intell. 2005 Aug;27(8):1212-25. doi: 10.1109/TPAMI.2005.150.
9
Markov random field-based statistical character structure modeling for handwritten Chinese character recognition.基于马尔可夫随机场的手写汉字识别统计特征结构建模
IEEE Trans Pattern Anal Mach Intell. 2008 May;30(5):767-80. doi: 10.1109/TPAMI.2007.70734.
10
A novel connectionist system for unconstrained handwriting recognition.一种用于无约束手写识别的新型连接主义系统。
IEEE Trans Pattern Anal Mach Intell. 2009 May;31(5):855-68. doi: 10.1109/TPAMI.2008.137.