• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用定制深度学习技术的普什图语手写不变字符轨迹预测

Pashto Handwritten Invariant Character Trajectory Prediction Using a Customized Deep Learning Technique.

作者信息

Khaliq Fazli, Shabir Muhammad, Khan Inayat, Ahmad Shafiq, Usman Muhammad, Zubair Muhammad, Huda Shamsul

机构信息

Department of Computer Science, Islamia College University Peshawar, Peshawar 25000, Pakistan.

Department of Computer Science, University of Buner, Buner 19290, Pakistan.

出版信息

Sensors (Basel). 2023 Jun 30;23(13):6060. doi: 10.3390/s23136060.

DOI:10.3390/s23136060
PMID:37447909
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10346912/
Abstract

Before the 19th century, all communication and official records relied on handwritten documents, cherished as valuable artefacts by different ethnic groups. While significant efforts have been made to automate the transcription of major languages like English, French, Arabic, and Chinese, there has been less research on regional and minor languages, despite their importance from geographical and historical perspectives. This research focuses on detecting and recognizing Pashto handwritten characters and ligatures, which is essential for preserving this regional cursive language in Pakistan and its status as the national language of Afghanistan. Deep learning techniques were employed to detect and recognize Pashto characters and ligatures, utilizing a newly developed dataset specific to Pashto. A further enhancement was done on the dataset by implementing data augmentation, i.e., scaling and rotation on Pashto handwritten characters and ligatures, which gave us many variations of a single trajectory. Different morphological operations for minimizing gaps in the trajectories were also performed. The median filter was used for the removal of different noises. This dataset will be combined with the existing PHWD-V2 dataset. Various deep-learning techniques were evaluated, including VGG19, MobileNetV2, MobileNetV3, and a customized CNN. The customized CNN demonstrated the highest accuracy and minimal loss, achieving a training accuracy of 93.98%, validation accuracy of 92.08% and testing accuracy of 92.99%.

摘要

19世纪以前,所有的通信和官方记录都依赖手写文件,不同民族将其视为珍贵的文物。尽管已经做出了巨大努力来实现英语、法语、阿拉伯语和中文等主要语言的转录自动化,但对于地区性和小语种语言的研究却较少,尽管从地理和历史角度来看它们很重要。这项研究专注于检测和识别普什图语手写字符和连字,这对于在巴基斯坦保存这种地区性草书语言以及其作为阿富汗国语的地位至关重要。采用深度学习技术来检测和识别普什图语字符和连字,利用一个新开发的特定于普什图语的数据集。通过对普什图语手写字符和连字进行数据增强,即缩放和旋转,对数据集进行了进一步增强,这为我们提供了单个轨迹的许多变体。还执行了不同的形态学操作以最小化轨迹中的间隙。使用中值滤波器去除不同的噪声。这个数据集将与现有的PHWD-V2数据集合并。评估了各种深度学习技术,包括VGG19、MobileNetV2、MobileNetV3和一个定制的卷积神经网络(CNN)。定制的CNN表现出最高的准确率和最小的损失,训练准确率达到93.98%,验证准确率达到92.08%,测试准确率达到92.99%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c89/10346912/5e1e42d38c5d/sensors-23-06060-g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c89/10346912/bd40ee000a97/sensors-23-06060-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c89/10346912/7aa1eb984fdf/sensors-23-06060-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c89/10346912/46f9fa590b45/sensors-23-06060-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c89/10346912/73cae9e020ff/sensors-23-06060-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c89/10346912/be8c02fcddef/sensors-23-06060-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c89/10346912/026ade814ab4/sensors-23-06060-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c89/10346912/5ae4b11e1180/sensors-23-06060-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c89/10346912/8b11e10d138f/sensors-23-06060-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c89/10346912/2b933c8de973/sensors-23-06060-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c89/10346912/84e81af6a969/sensors-23-06060-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c89/10346912/5ba1856da56e/sensors-23-06060-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c89/10346912/487c03b602ac/sensors-23-06060-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c89/10346912/c10c288c3bee/sensors-23-06060-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c89/10346912/5e1e42d38c5d/sensors-23-06060-g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c89/10346912/bd40ee000a97/sensors-23-06060-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c89/10346912/7aa1eb984fdf/sensors-23-06060-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c89/10346912/46f9fa590b45/sensors-23-06060-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c89/10346912/73cae9e020ff/sensors-23-06060-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c89/10346912/be8c02fcddef/sensors-23-06060-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c89/10346912/026ade814ab4/sensors-23-06060-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c89/10346912/5ae4b11e1180/sensors-23-06060-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c89/10346912/8b11e10d138f/sensors-23-06060-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c89/10346912/2b933c8de973/sensors-23-06060-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c89/10346912/84e81af6a969/sensors-23-06060-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c89/10346912/5ba1856da56e/sensors-23-06060-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c89/10346912/487c03b602ac/sensors-23-06060-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c89/10346912/c10c288c3bee/sensors-23-06060-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c89/10346912/5e1e42d38c5d/sensors-23-06060-g014.jpg

相似文献

1
Pashto Handwritten Invariant Character Trajectory Prediction Using a Customized Deep Learning Technique.使用定制深度学习技术的普什图语手写不变字符轨迹预测
Sensors (Basel). 2023 Jun 30;23(13):6060. doi: 10.3390/s23136060.
2
Recognition of Pashto Handwritten Characters Based on Deep Learning.基于深度学习的普什图文手写字符识别。
Sensors (Basel). 2020 Oct 17;20(20):5884. doi: 10.3390/s20205884.
3
Robust Optical Recognition of Cursive Pashto Script Using Scale, Rotation and Location Invariant Approach.使用尺度、旋转和位置不变方法对普什图文草体进行稳健光学识别。
PLoS One. 2015 Sep 14;10(9):e0133648. doi: 10.1371/journal.pone.0133648. eCollection 2015.
4
Deep learning-based recognition system for pashto handwritten text: benchmark on PHTI.基于深度学习的普什图语手写文本识别系统:PHTI基准测试
PeerJ Comput Sci. 2024 Mar 27;10:e1925. doi: 10.7717/peerj-cs.1925. eCollection 2024.
5
CBD2023: A Hypercomplex Bangla Handwriting Character Recognition Data for Hierarchical Class Expansion.CBD2023:用于分层类别扩展的超复杂孟加拉语手写字符识别数据
Data Brief. 2023 Dec 8;52:109909. doi: 10.1016/j.dib.2023.109909. eCollection 2024 Feb.
6
Deep Learning-Based Child Handwritten Arabic Character Recognition and Handwriting Discrimination.基于深度学习的儿童手写阿拉伯字符识别与书写鉴别。
Sensors (Basel). 2023 Jul 28;23(15):6774. doi: 10.3390/s23156774.
7
Kurdish Handwritten character recognition using deep learning techniques.基于深度学习技术的库尔德手写字符识别。
Gene Expr Patterns. 2022 Dec;46:119278. doi: 10.1016/j.gep.2022.119278. Epub 2022 Oct 3.
8
Pashto offensive language detection: a benchmark dataset and monolingual Pashto BERT.普什图语冒犯性语言检测:一个基准数据集和单语普什图语BERT
PeerJ Comput Sci. 2023 Oct 18;9:e1617. doi: 10.7717/peerj-cs.1617. eCollection 2023.
9
Pashto script and graphics detection in camera captured Pashto document images using deep learning model.使用深度学习模型在相机拍摄的普什图文文档图像中进行普什图文脚本和图形检测。
PeerJ Comput Sci. 2024 Jul 26;10:e2089. doi: 10.7717/peerj-cs.2089. eCollection 2024.
10
Leveraging ShuffleNet transfer learning to enhance handwritten character recognition.利用 ShuffleNet 迁移学习来增强手写字符识别。
Gene Expr Patterns. 2022 Sep;45:119263. doi: 10.1016/j.gep.2022.119263. Epub 2022 Jul 16.

引用本文的文献

1
Deep learning-based recognition system for pashto handwritten text: benchmark on PHTI.基于深度学习的普什图语手写文本识别系统:PHTI基准测试
PeerJ Comput Sci. 2024 Mar 27;10:e1925. doi: 10.7717/peerj-cs.1925. eCollection 2024.

本文引用的文献

1
Skin Lesion Classification on Imbalanced Data Using Deep Learning with Soft Attention.基于深度学习的软注意力机制处理数据不平衡的皮肤损伤分类。
Sensors (Basel). 2022 Oct 4;22(19):7530. doi: 10.3390/s22197530.
2
Transfer learning for image classification using VGG19: Caltech-101 image data set.使用VGG19进行图像分类的迁移学习:加州理工学院101图像数据集。
J Ambient Intell Humaniz Comput. 2023;14(4):3609-3620. doi: 10.1007/s12652-021-03488-z. Epub 2021 Sep 17.
3
Recognition of Pashto Handwritten Characters Based on Deep Learning.基于深度学习的普什图文手写字符识别。
Sensors (Basel). 2020 Oct 17;20(20):5884. doi: 10.3390/s20205884.
4
Robust Optical Recognition of Cursive Pashto Script Using Scale, Rotation and Location Invariant Approach.使用尺度、旋转和位置不变方法对普什图文草体进行稳健光学识别。
PLoS One. 2015 Sep 14;10(9):e0133648. doi: 10.1371/journal.pone.0133648. eCollection 2015.