• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于半马尔可夫条件随机场的手写中文/日文文本识别。

Handwritten Chinese/Japanese text recognition using semi-Markov conditional random fields.

机构信息

Beijing Key Lab of Human-Computer Interaction,Institute of Software, Chinese Academy of Sciences, Beijing, P.R.China.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2013 Oct;35(10):2413-26. doi: 10.1109/TPAMI.2013.49.

DOI:10.1109/TPAMI.2013.49
PMID:23969386
Abstract

This paper proposes a method for handwritten Chinese/Japanese text (character string) recognition based on semi-Markov conditional random fields (semi-CRFs). The high-order semi-CRF model is defined on a lattice containing all possible segmentation-recognition hypotheses of a string to elegantly fuse the scores of candidate character recognition and the compatibilities of geometric and linguistic contexts by representing them in the feature functions. Based on given models of character recognition and compatibilities, the fusion parameters are optimized by minimizing the negative log-likelihood loss with a margin term on a training string sample set. A forward-backward lattice pruning algorithm is proposed to reduce the computation in training when trigram language models are used, and beam search techniques are investigated to accelerate the decoding speed. We evaluate the performance of the proposed method on unconstrained online handwritten text lines of three databases. On the test sets of databases CASIA-OLHWDB (Chinese) and TUAT Kondate (Japanese), the character level correct rates are 95.20 and 95.44 percent, and the accurate rates are 94.54 and 94.55 percent, respectively. On the test set (online handwritten texts) of ICDAR 2011 Chinese handwriting recognition competition, the proposed method outperforms the best system in competition.

摘要

本文提出了一种基于半马尔可夫条件随机场(semi-CRFs)的手写中文/日文文本(字符串)识别方法。高阶 semi-CRF 模型定义在一个包含字符串所有可能分割识别假设的格中,通过在特征函数中表示候选字符识别的得分和几何及语言上下文的兼容性,优雅地融合了它们。基于给定的字符识别模型和兼容性模型,通过在训练字符串样本集上最小化带边界项的负对数似然损失来优化融合参数。提出了一种前向-后向格剪枝算法,在使用三gram 语言模型时减少训练中的计算量,并研究了束搜索技术以加速解码速度。我们在三个数据库的无约束在线手写文本行上评估了所提出方法的性能。在 CASIA-OLHWDB(中文)和 TUAT Kondate(日文)数据库的测试集中,字符级的正确识别率分别为 95.20%和 95.44%,准确率分别为 94.54%和 94.55%。在 ICDAR 2011 中文手写识别竞赛的测试集(在线手写文本)上,所提出的方法优于竞赛中的最佳系统。

相似文献

1
Handwritten Chinese/Japanese text recognition using semi-Markov conditional random fields.基于半马尔可夫条件随机场的手写中文/日文文本识别。
IEEE Trans Pattern Anal Mach Intell. 2013 Oct;35(10):2413-26. doi: 10.1109/TPAMI.2013.49.
2
Offline recognition of unconstrained handwritten texts using HMMs and statistical language models.使用隐马尔可夫模型和统计语言模型对手写文本进行离线识别。
IEEE Trans Pattern Anal Mach Intell. 2004 Jun;26(6):709-20. doi: 10.1109/TPAMI.2004.14.
3
An approach to offline handwritten Chinese character recognition based on segment evaluation of adaptive duration.一种基于自适应时长片段评估的离线手写汉字识别方法。
J Zhejiang Univ Sci. 2004 Nov;5(11):1392-7. doi: 10.1631/jzus.2004.1392.
4
Markov random field-based statistical character structure modeling for handwritten Chinese character recognition.基于马尔可夫随机场的手写汉字识别统计特征结构建模
IEEE Trans Pattern Anal Mach Intell. 2008 May;30(5):767-80. doi: 10.1109/TPAMI.2007.70734.
5
Preprocessing of low-quality handwritten documents using Markov random fields.使用马尔可夫随机场对低质量手写文档进行预处理。
IEEE Trans Pattern Anal Mach Intell. 2009 Jul;31(7):1184-94. doi: 10.1109/TPAMI.2008.126.
6
Handwritten Chinese text recognition by integrating multiple contexts.基于多上下文集成的手写中文文本识别。
IEEE Trans Pattern Anal Mach Intell. 2012 Aug;34(8):1469-81. doi: 10.1109/TPAMI.2011.264.
7
Recognition and verification of unconstrained handwritten words.无约束手写文字的识别与验证
IEEE Trans Pattern Anal Mach Intell. 2005 Oct;27(10):1509-22. doi: 10.1109/TPAMI.2005.207.
8
Effects of classifier structures and training regimes on integrated segmentation and recognition of handwritten numeral strings.
IEEE Trans Pattern Anal Mach Intell. 2004 Nov;26(11):1395-407. doi: 10.1109/tpami.2004.104.
9
Online handwritten shape recognition using segmental hidden Markov models.使用分段隐马尔可夫模型的在线手写形状识别。
IEEE Trans Pattern Anal Mach Intell. 2007 Feb;29(2):205-17. doi: 10.1109/TPAMI.2007.38.
10
Hidden Markov models combining discrete symbols and continuous attributes in handwriting recognition.用于手写识别的结合离散符号与连续属性的隐马尔可夫模型。
IEEE Trans Pattern Anal Mach Intell. 2006 Mar;28(3):458-62. doi: 10.1109/TPAMI.2006.55.

引用本文的文献

1
Generative vs. Discriminative Recognition Models for Off-Line Arabic Handwriting.基于生成式和判别式的离线阿拉伯文手写体识别模型。
Sensors (Basel). 2018 Aug 24;18(9):2786. doi: 10.3390/s18092786.