• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

作为服务的单词识别:一种用于手写文档的无监督且无分割的框架。

Word Spotting as a Service: An Unsupervised and Segmentation-Free Framework for Handwritten Documents.

作者信息

Zagoris Konstantinos, Amanatiadis Angelos, Pratikakis Ioannis

机构信息

Department of Computer Science, Neapolis University, Pafos 8042, Cyprus.

Department of Production and Management Engineering, Democritus University of Thrace, 67132 Xanthi, Greece.

出版信息

J Imaging. 2021 Dec 17;7(12):278. doi: 10.3390/jimaging7120278.

DOI:10.3390/jimaging7120278
PMID:34940745
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8709349/
Abstract

Word spotting strategies employed in historical handwritten documents face many challenges due to variation in the writing style and intense degradation. In this paper, a new method that permits efficient and effective word spotting in handwritten documents is presented that relies upon document-oriented local features that take into account information around representative keypoints and a matching process that incorporates a spatial context in a local proximity search without using any training data. The method relies on a document-oriented keypoint and feature extraction, along with a fast feature matching method. This enables the corresponding methodological pipeline to be both effectively and efficiently employed in the cloud so that word spotting can be realised as a service in modern mobile devices. The effectiveness and efficiency of the proposed method in terms of its matching accuracy, along with its fast retrieval time, respectively, are shown after a consistent evaluation of several historical handwritten datasets.

摘要

由于书写风格的变化和严重的退化,历史手写文档中采用的单词识别策略面临诸多挑战。本文提出了一种新方法,该方法依赖于面向文档的局部特征(考虑代表性关键点周围的信息)和匹配过程(在局部邻近搜索中纳入空间上下文,且不使用任何训练数据),从而能够在手写文档中高效且有效地进行单词识别。该方法依赖于面向文档的关键点和特征提取,以及快速特征匹配方法。这使得相应的方法管道能够在云端有效且高效地应用,从而使单词识别能够在现代移动设备中作为一种服务得以实现。在对多个历史手写数据集进行一致评估后,分别展示了所提方法在匹配准确性方面的有效性以及快速检索时间方面的效率。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/64df/8709349/5f049d6ac63d/jimaging-07-00278-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/64df/8709349/4db3e779d6ba/jimaging-07-00278-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/64df/8709349/f768164ef72e/jimaging-07-00278-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/64df/8709349/e1123b2688e5/jimaging-07-00278-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/64df/8709349/35ccccab7b19/jimaging-07-00278-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/64df/8709349/43a8d42b7af8/jimaging-07-00278-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/64df/8709349/1e621c5cc206/jimaging-07-00278-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/64df/8709349/ff31548c26b1/jimaging-07-00278-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/64df/8709349/3ab28c4a28e2/jimaging-07-00278-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/64df/8709349/5f049d6ac63d/jimaging-07-00278-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/64df/8709349/4db3e779d6ba/jimaging-07-00278-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/64df/8709349/f768164ef72e/jimaging-07-00278-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/64df/8709349/e1123b2688e5/jimaging-07-00278-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/64df/8709349/35ccccab7b19/jimaging-07-00278-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/64df/8709349/43a8d42b7af8/jimaging-07-00278-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/64df/8709349/1e621c5cc206/jimaging-07-00278-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/64df/8709349/ff31548c26b1/jimaging-07-00278-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/64df/8709349/3ab28c4a28e2/jimaging-07-00278-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/64df/8709349/5f049d6ac63d/jimaging-07-00278-g009.jpg

相似文献

1
Word Spotting as a Service: An Unsupervised and Segmentation-Free Framework for Handwritten Documents.作为服务的单词识别:一种用于手写文档的无监督且无分割的框架。
J Imaging. 2021 Dec 17;7(12):278. doi: 10.3390/jimaging7120278.
2
Unsupervised Word Spotting in Historical Handwritten Document Images Using Document-Oriented Local Features.基于面向文档局部特征的无监督历史手写文档图像文字定位。
IEEE Trans Image Process. 2017 Aug;26(8):4032-4041. doi: 10.1109/TIP.2017.2700721. Epub 2017 May 3.
3
Hough Transform-Based Angular Features for Learning-Free Handwritten Keyword Spotting.基于 Hough 变换的角度特征用于无学习的手写关键词定位。
Sensors (Basel). 2021 Jul 7;21(14):4648. doi: 10.3390/s21144648.
4
Efficient Learning-Free Keyword Spotting.高效的免学习关键词识别
IEEE Trans Pattern Anal Mach Intell. 2019 Jul;41(7):1587-1600. doi: 10.1109/TPAMI.2018.2845880. Epub 2018 Jun 11.
5
A Model for Evaluating the Performance of a Multiple Keywords Spotting System for the Transcription of Historical Handwritten Documents.一种用于评估历史手写文档转录的多关键词识别系统性能的模型。
J Imaging. 2020 Nov 3;6(11):117. doi: 10.3390/jimaging6110117.
6
A novel word spotting method based on recurrent neural networks.基于循环神经网络的新型字词定位方法。
IEEE Trans Pattern Anal Mach Intell. 2012 Feb;34(2):211-24. doi: 10.1109/TPAMI.2011.113.
7
A scale space approach for automatically segmenting words from historical handwritten documents.一种用于从历史手写文档中自动分割单词的尺度空间方法。
IEEE Trans Pattern Anal Mach Intell. 2005 Aug;27(8):1212-25. doi: 10.1109/TPAMI.2005.150.
8
Word Spotting and Recognition with Embedded Attributes.基于嵌入式属性的字词定位与识别。
IEEE Trans Pattern Anal Mach Intell. 2014 Dec;36(12):2552-66. doi: 10.1109/TPAMI.2014.2339814.
9
HTR for Greek Historical Handwritten Documents.希腊历史手写文献的高通量转录组测序
J Imaging. 2021 Dec 2;7(12):260. doi: 10.3390/jimaging7120260.
10
One Step Is Not Enough: A Multi-Step Procedure for Building the Training Set of a Query by String Keyword Spotting System to Assist the Transcription of Historical Document.一步是不够的:一种用于通过字符串关键词识别系统构建查询训练集以辅助历史文献转录的多步骤程序。
J Imaging. 2020 Oct 13;6(10):109. doi: 10.3390/jimaging6100109.

本文引用的文献

1
Efficient Learning-Free Keyword Spotting.高效的免学习关键词识别
IEEE Trans Pattern Anal Mach Intell. 2019 Jul;41(7):1587-1600. doi: 10.1109/TPAMI.2018.2845880. Epub 2018 Jun 11.
2
Unsupervised Word Spotting in Historical Handwritten Document Images Using Document-Oriented Local Features.基于面向文档局部特征的无监督历史手写文档图像文字定位。
IEEE Trans Image Process. 2017 Aug;26(8):4032-4041. doi: 10.1109/TIP.2017.2700721. Epub 2017 May 3.
3
Word Spotting and Recognition with Embedded Attributes.基于嵌入式属性的字词定位与识别。
IEEE Trans Pattern Anal Mach Intell. 2014 Dec;36(12):2552-66. doi: 10.1109/TPAMI.2014.2339814.
4
Improving offline handwritten text recognition with hybrid HMM/ANN models.利用混合 HMM/ANN 模型提高离线手写文字识别。
IEEE Trans Pattern Anal Mach Intell. 2011 Apr;33(4):767-79. doi: 10.1109/TPAMI.2010.141.