• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用深度卷积神经网络的普什图语孤立数字识别

Pashto isolated digits recognition using deep convolutional neural network.

作者信息

Zada Bakht, Ullah Rahim

机构信息

Government Degree College Samar Bagh, Pakistan.

出版信息

Heliyon. 2020 Feb 12;6(2):e03372. doi: 10.1016/j.heliyon.2020.e03372. eCollection 2020 Feb.

DOI:10.1016/j.heliyon.2020.e03372
PMID:32083214
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7016387/
Abstract

Speech recognition has become one of the most significant parts of human-computer interaction due to emergence of new technologies such as smartphone, smart watch and many modern technologies, therefore the need of an ASR for local languages is felt. The basic aim of this paper is to develop an isolated digits recognition for Pashto language, using deep CNN. The database of Pashto digits from 0 to 9 with 50 utterance for each digits is used. Twenty MFCC features extracted for each isolated digit and fed as input to CNN. The network has been used for the proposed system is deep up to 4 convolutional layers, followed by ReLU and max-pooling layers. The network has been trained on the 50% of data and the rest of the data was used for testing. The total average of 84.17% accuracy was achieved for testing which show 7.32% better performance as compared to existing similar works.

摘要

由于智能手机、智能手表等新技术以及许多现代技术的出现,语音识别已成为人机交互最重要的部分之一,因此人们感到需要一种针对本地语言的自动语音识别(ASR)。本文的基本目标是使用深度卷积神经网络(CNN)开发一种普什图语孤立数字识别系统。使用了包含从0到9的普什图语数字数据库,每个数字有50个发音。为每个孤立数字提取20个梅尔频率倒谱系数(MFCC)特征,并将其作为输入馈送到CNN。所提出的系统使用的网络深度达4个卷积层,随后是整流线性单元(ReLU)和最大池化层。该网络在50%的数据上进行训练,其余数据用于测试。测试的总平均准确率达到84.17%,与现有类似工作相比,性能提高了7.32%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/379b/7016387/d9e16f9b5506/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/379b/7016387/ca5cefc3e888/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/379b/7016387/88faf0ddfb44/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/379b/7016387/3a7c23d9c0b0/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/379b/7016387/499e9e359625/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/379b/7016387/d9e16f9b5506/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/379b/7016387/ca5cefc3e888/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/379b/7016387/88faf0ddfb44/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/379b/7016387/3a7c23d9c0b0/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/379b/7016387/499e9e359625/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/379b/7016387/d9e16f9b5506/gr5.jpg

相似文献

1
Pashto isolated digits recognition using deep convolutional neural network.使用深度卷积神经网络的普什图语孤立数字识别
Heliyon. 2020 Feb 12;6(2):e03372. doi: 10.1016/j.heliyon.2020.e03372. eCollection 2020 Feb.
2
Recognition of Pashto Handwritten Characters Based on Deep Learning.基于深度学习的普什图文手写字符识别。
Sensors (Basel). 2020 Oct 17;20(20):5884. doi: 10.3390/s20205884.
3
Performance analysis of hybrid deep learning framework using a vision transformer and convolutional neural network for handwritten digit recognition.使用视觉Transformer和卷积神经网络的混合深度学习框架对手写数字识别的性能分析
MethodsX. 2024 Jan 5;12:102554. doi: 10.1016/j.mex.2024.102554. eCollection 2024 Jun.
4
Deep Convolutional Neural Networks for large-scale speech tasks.用于大规模语音任务的深度卷积神经网络。
Neural Netw. 2015 Apr;64:39-48. doi: 10.1016/j.neunet.2014.08.005. Epub 2014 Sep 16.
5
Novel Deep Convolutional Neural Network-Based Contextual Recognition of Arabic Handwritten Scripts.基于新型深度卷积神经网络的阿拉伯手写体上下文识别
Entropy (Basel). 2021 Mar 13;23(3):340. doi: 10.3390/e23030340.
6
Classification of Alzheimer's Disease Based on Eight-Layer Convolutional Neural Network with Leaky Rectified Linear Unit and Max Pooling.基于带泄露整流线性单元和最大池化的八层卷积神经网络的阿尔茨海默病分类。
J Med Syst. 2018 Mar 26;42(5):85. doi: 10.1007/s10916-018-0932-7.
7
Pashto Handwritten Invariant Character Trajectory Prediction Using a Customized Deep Learning Technique.使用定制深度学习技术的普什图语手写不变字符轨迹预测
Sensors (Basel). 2023 Jun 30;23(13):6060. doi: 10.3390/s23136060.
8
Deep Convolutional Neural Network for Mapping Smallholder Agriculture Using High Spatial Resolution Satellite Image.用于利用高空间分辨率卫星图像绘制小农户农业地图的深度卷积神经网络
Sensors (Basel). 2019 May 25;19(10):2398. doi: 10.3390/s19102398.
9
Deep Convolutional Extreme Learning Machine and Its Application in Handwritten Digit Classification.深度卷积极限学习机及其在手写数字分类中的应用
Comput Intell Neurosci. 2016;2016:3049632. doi: 10.1155/2016/3049632. Epub 2016 Aug 17.
10
Convolutional neural networks: an overview and application in radiology.卷积神经网络:概述及其在放射学中的应用。
Insights Imaging. 2018 Aug;9(4):611-629. doi: 10.1007/s13244-018-0639-9. Epub 2018 Jun 22.

引用本文的文献

1
Deep Learning-Based Classification of Spoken English Digits.基于深度学习的英语口语数字分类。
Comput Intell Neurosci. 2022 Sep 28;2022:3364141. doi: 10.1155/2022/3364141. eCollection 2022.