• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

BanglaWriting: A multi-purpose offline Bangla handwriting dataset.

作者信息

Mridha M F, Ohi Abu Quwsar, Ali M Ameer, Emon Mazedul Islam, Kabir Muhammad Mohsin

机构信息

Department of Computer Science & Engineering, Bangladesh University of Business & Technology, Dhaka, Bangladesh.

出版信息

Data Brief. 2020 Dec 9;34:106633. doi: 10.1016/j.dib.2020.106633. eCollection 2021 Feb.

DOI:10.1016/j.dib.2020.106633
PMID:33354607
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7744928/
Abstract

This article presents a Bangla handwriting dataset named BanglaWriting that contains single-page handwritings of 260 individuals of different personalities and ages. Each page includes bounding-boxes that bounds each word, along with the unicode representation of the writing. This dataset contains 21,234 words and 32,787 characters in total. Moreover, this dataset includes 5,470 unique words of Bangla vocabulary. Apart from the usual words, the dataset comprises 261 comprehensible overwriting and 450 handwritten strikes and mistakes. All of the bounding-boxes and word labels are manually-generated. The dataset can be used for complex optical character/word recognition, writer identification, handwritten word segmentation, and word generation. Furthermore, this dataset is suitable for extracting age-based and gender-based variation of handwriting.

摘要
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/71ac/7744928/e65ea04b4aec/gr9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/71ac/7744928/1378d7ce16ad/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/71ac/7744928/7c27b84a0176/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/71ac/7744928/0fae0f156c3a/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/71ac/7744928/52ddc2d0be31/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/71ac/7744928/a9cd28e7b8c5/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/71ac/7744928/49b0b2c56f15/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/71ac/7744928/205f128843c4/gr7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/71ac/7744928/8fc8591cf7d7/gr8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/71ac/7744928/e65ea04b4aec/gr9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/71ac/7744928/1378d7ce16ad/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/71ac/7744928/7c27b84a0176/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/71ac/7744928/0fae0f156c3a/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/71ac/7744928/52ddc2d0be31/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/71ac/7744928/a9cd28e7b8c5/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/71ac/7744928/49b0b2c56f15/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/71ac/7744928/205f128843c4/gr7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/71ac/7744928/8fc8591cf7d7/gr8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/71ac/7744928/e65ea04b4aec/gr9.jpg

相似文献

1
BanglaWriting: A multi-purpose offline Bangla handwriting dataset.
Data Brief. 2020 Dec 9;34:106633. doi: 10.1016/j.dib.2020.106633. eCollection 2021 Feb.
2
BanglaLekha-Isolated: A multi-purpose comprehensive dataset of Handwritten Bangla Isolated characters.孟加拉语手写孤立字符多用途综合数据集:BanglaLekha-Isolated
Data Brief. 2017 Mar 29;12:103-107. doi: 10.1016/j.dib.2017.03.035. eCollection 2017 Jun.
3
iVision HHID: Handwritten hyperspectral images dataset for benchmarking hyperspectral imaging-based document forensic analysis.iVision HHID:用于基于高光谱成像的文件司法鉴定分析基准测试的手写高光谱图像数据集。
Data Brief. 2022 Feb 16;41:107964. doi: 10.1016/j.dib.2022.107964. eCollection 2022 Apr.
4
CBD2023: A Hypercomplex Bangla Handwriting Character Recognition Data for Hierarchical Class Expansion.CBD2023:用于分层类别扩展的超复杂孟加拉语手写字符识别数据
Data Brief. 2023 Dec 8;52:109909. doi: 10.1016/j.dib.2023.109909. eCollection 2024 Feb.
5
Arabic handwritten alphabets, words and paragraphs per user (AHAWP) dataset.每位用户的阿拉伯文手写字母、单词和段落(AHAWP)数据集
Data Brief. 2022 Feb 13;41:107947. doi: 10.1016/j.dib.2022.107947. eCollection 2022 Apr.
6
A Novel GAN-Based Synthesis Method for In-Air Handwritten Words.基于新型 GAN 的空中手写文字合成方法。
Sensors (Basel). 2020 Nov 16;20(22):6548. doi: 10.3390/s20226548.
7
A new dataset for mongolian online handwritten recognition.用于蒙古文在线手写识别的新数据集。
Sci Rep. 2023 Jan 2;13(1):26. doi: 10.1038/s41598-022-27267-8.
8
A vast dataset for Kurdish handwritten digits and isolated characters recognition.
Data Brief. 2023 Mar 2;47:109014. doi: 10.1016/j.dib.2023.109014. eCollection 2023 Apr.
9
Synthesis of Common Arabic Handwritings to Aid Optical Character Recognition Research.合成常见阿拉伯手写体以辅助光学字符识别研究。
Sensors (Basel). 2016 Mar 11;16(3):346. doi: 10.3390/s16030346.
10
Convolutional neural network-based ensemble methods to recognize Bangla handwritten character.基于卷积神经网络的集成方法用于识别孟加拉语手写字符。
PeerJ Comput Sci. 2021 Jun 28;7:e565. doi: 10.7717/peerj-cs.565. eCollection 2021.

引用本文的文献

1
Kurdish standard EMNIST-like character dataset.库尔德标准类EMNIST字符数据集。
Data Brief. 2024 Jan 9;52:110038. doi: 10.1016/j.dib.2024.110038. eCollection 2024 Feb.

本文引用的文献

1
BanglaLekha-Isolated: A multi-purpose comprehensive dataset of Handwritten Bangla Isolated characters.孟加拉语手写孤立字符多用途综合数据集:BanglaLekha-Isolated
Data Brief. 2017 Mar 29;12:103-107. doi: 10.1016/j.dib.2017.03.035. eCollection 2017 Jun.
2
Deep learning.深度学习。
Nature. 2015 May 28;521(7553):436-44. doi: 10.1038/nature14539.