• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一个用于模式识别研究的、关于天城文和英语的在线多语言数字数据集。

An online multilingual numeral dataset on Devnagari and English languages for pattern recognition research.

作者信息

Jabde Meenal K, Patil Chandrashekhar H, Vibhute Amol D, Mali Shankar

机构信息

School of Computer Science, Dr. Vishwanath Karad MIT World Peace University, Pune, MH, India.

Symbiosis Institute of Computer Studies and Research (SICSR), Symbiosis International (Deemed University), Pune-411016, MH, India.

出版信息

Data Brief. 2023 Oct 31;51:109743. doi: 10.1016/j.dib.2023.109743. eCollection 2023 Dec.

DOI:10.1016/j.dib.2023.109743
PMID:38020443
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10654529/
Abstract

The real-time air-writing multilingual datasets are widely used for several purposes, such as handwriting character or numeral pattern recognition. The air-writing systems are commonly used in operation theatres, online education systems, banking sectors, reservation counters, etc. However, the air-written numeral datasets are less for Devanagari and English languages needed for detecting patterns. Therefore, the present article introduces novel datasets written in the air for Devanagari and English. In addition, this article proposes a systematic novel strategy to collect the air-written multilingual numeral dataset from 100 individuals ranging in 20-40 age groups. The Devanagari and English 0-9 digits were ten times written in the air by every individual resulting in 10,000 images for each language. Thus, 20,000 images were generated and stored in the databases. The proposed dataset is freely available and could be a good resource for pattern recognition research.

摘要

实时空中书写多语言数据集被广泛用于多种目的,例如手写字符或数字模式识别。空中书写系统常用于手术室、在线教育系统、银行部门、预订柜台等。然而,用于检测模式所需的梵文和英文的空中书写数字数据集较少。因此,本文介绍了用梵文和英文在空中书写的新颖数据集。此外,本文提出了一种系统的新颖策略,从100名年龄在20至40岁之间的个体中收集空中书写的多语言数字数据集。每个个体将梵文和英文的0至9数字在空中书写十次,每种语言生成10000张图像。因此,共生成20000张图像并存储在数据库中。所提出的数据集可免费获取,可能是模式识别研究的良好资源。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb14/10654529/07a5c528984f/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb14/10654529/8ecd2c9dafae/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb14/10654529/07a5c528984f/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb14/10654529/8ecd2c9dafae/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb14/10654529/07a5c528984f/gr2.jpg

相似文献

1
An online multilingual numeral dataset on Devnagari and English languages for pattern recognition research.一个用于模式识别研究的、关于天城文和英语的在线多语言数字数据集。
Data Brief. 2023 Oct 31;51:109743. doi: 10.1016/j.dib.2023.109743. eCollection 2023 Dec.
2
A multi-purpose dataset of Devanagari script comprising of isolated numerals and vowels.一个包含孤立数字和元音的梵文字母多用途数据集。
Data Brief. 2021 Dec 16;40:107723. doi: 10.1016/j.dib.2021.107723. eCollection 2022 Feb.
3
Adapting multilingual vision language transformers for low-resource Urdu optical character recognition (OCR).使多语言视觉语言变换器适用于低资源乌尔都语光学字符识别(OCR)。
PeerJ Comput Sci. 2024 Apr 29;10:e1964. doi: 10.7717/peerj-cs.1964. eCollection 2024.
4
Handwritten numeral databases of Indian scripts and multistage recognition of mixed numerals.印度文字手写数字数据库及混合数字的多阶段识别
IEEE Trans Pattern Anal Mach Intell. 2009 Mar;31(3):444-57. doi: 10.1109/TPAMI.2008.88.
5
Cursive-Text: A Comprehensive Dataset for End-to-End Urdu Text Recognition in Natural Scene Images.连笔文本:用于自然场景图像中乌尔都语文本端到端识别的综合数据集。
Data Brief. 2020 May 21;31:105749. doi: 10.1016/j.dib.2020.105749. eCollection 2020 Aug.
6
A Robust Handwritten Numeral Recognition Using Hybrid Orthogonal Polynomials and Moments.基于混合正交多项式和矩的稳健手写数字识别。
Sensors (Basel). 2021 Mar 12;21(6):1999. doi: 10.3390/s21061999.
7
Understanding Editing Behaviors in Multilingual Wikipedia.理解多语言维基百科中的编辑行为
PLoS One. 2016 May 12;11(5):e0155305. doi: 10.1371/journal.pone.0155305. eCollection 2016.
8
Multilingual event extraction for epidemic detection.用于疫情检测的多语言事件提取
Artif Intell Med. 2015 Oct;65(2):131-43. doi: 10.1016/j.artmed.2015.06.005. Epub 2015 Jul 17.
9
Multilingual character recognition dataset for Moroccan official documents.摩洛哥官方文件的多语言字符识别数据集。
Data Brief. 2023 Dec 13;52:109953. doi: 10.1016/j.dib.2023.109953. eCollection 2024 Feb.
10
Pashtu Language Digits Dataset.普什图语数字数据集。
Data Brief. 2022 Oct 26;45:108701. doi: 10.1016/j.dib.2022.108701. eCollection 2022 Dec.