Suppr超能文献

A vast dataset for Kurdish handwritten digits and isolated characters recognition.

作者信息

Abdalla Peshraw Ahmed, Qadir Abdalbasit Mohammed, Shakor Mohammed Y, Saeed Ari M, Jabar Abdalla Taha, Salam Ali Abdalla, Amin Hedi Hamid Hama

机构信息

Department of Computer Science, College of Science, University of Halabja, Halabja, Iraq.

Department of Computer Science, College of Science and Technology, University of Human Development, Sulaimaniyah, Iraq.

出版信息

Data Brief. 2023 Mar 2;47:109014. doi: 10.1016/j.dib.2023.109014. eCollection 2023 Apr.

Abstract

This article presents two massive datasets for central Kurdish handwriting digits and isolated characters named and . The first dataset, named dataset, contains 70,000 images of Kurdish digits, 7000 images for each digit, and a printed A4 paper with a grid of 10 × 10 is used for data collection. Apart from digits, the dataset includes 245,000 images of all Kurdish characters, 7000 images for each character; data was collected via a printed A4 paper with a grid of 12 × 10 for this dataset. Moreover, both datasets include 315,000 images. Python programming has been used to scan each piece of paper, segment, crop, resize, binarize, and invert the images via edge detection and image processing techniques.

摘要
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a764/10018436/aaf682d2f78e/gr1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验