• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

卷积神经网络(CNN)和循环神经网络(RNN)在语音病理学检测中的比较分析。

Comparative Analysis of CNN and RNN for Voice Pathology Detection.

机构信息

Department of Biomedical Engineering and Department of Electrical Engineering, Ziauddin University Faculty of Engineering Science, Technology, and Management, Karachi, Pakistan.

Department of Electrical Engineering and Department of Software Engineering, Ziauddin University Faculty of Engineering Science, Technology, and Management, Karachi, Pakistan.

出版信息

Biomed Res Int. 2021 Apr 14;2021:6635964. doi: 10.1155/2021/6635964. eCollection 2021.

DOI:10.1155/2021/6635964
PMID:33937404
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8062167/
Abstract

Diagnosis on the basis of a computerized acoustic examination may play an incredibly important role in early diagnosis and in monitoring and even improving effective pathological speech diagnostics. Various acoustic metrics test the health of the voice. The precision of these parameters also has to do with algorithms for the detection of speech noise. The idea is to detect the disease pathology from the voice. First, we apply the feature extraction on the SVD dataset. After the feature extraction, the system input goes into the 27 neuronal layer neural networks that are convolutional and recurrent neural network. We divided the dataset into training and testing, and after 10 k-fold validation, the reported accuracies of CNN and RNN are 87.11% and 86.52%, respectively. A 10-fold cross-validation is used to evaluate the performance of the classifier. On a Linux workstation with one NVidia Titan X GPU, program code was written in Python using the TensorFlow package.

摘要

基于计算机声学检查的诊断可能在早期诊断以及监测甚至改善有效的病理性语音诊断中发挥非常重要的作用。各种声学指标测试声音的健康状况。这些参数的精度也与语音噪声检测算法有关。其理念是从声音中检测出疾病病理。首先,我们在奇异值分解数据集上应用特征提取。在特征提取之后,系统输入进入 27 个神经元层的卷积和递归神经网络。我们将数据集分为训练和测试,在 10k 折验证后,CNN 和 RNN 的报告准确率分别为 87.11%和 86.52%。使用 10 折交叉验证来评估分类器的性能。在具有一个 NVIDIA Titan X GPU 的 Linux 工作站上,使用 TensorFlow 包在 Python 中编写了程序代码。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5298/8062167/c42b374d9847/BMRI2021-6635964.008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5298/8062167/963f82d0a399/BMRI2021-6635964.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5298/8062167/a4af7f6706c4/BMRI2021-6635964.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5298/8062167/7c5fa56414f2/BMRI2021-6635964.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5298/8062167/54abd18a5cb0/BMRI2021-6635964.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5298/8062167/812bb45025f0/BMRI2021-6635964.005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5298/8062167/3a643fef4b94/BMRI2021-6635964.006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5298/8062167/e520bba6fd27/BMRI2021-6635964.007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5298/8062167/c42b374d9847/BMRI2021-6635964.008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5298/8062167/963f82d0a399/BMRI2021-6635964.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5298/8062167/a4af7f6706c4/BMRI2021-6635964.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5298/8062167/7c5fa56414f2/BMRI2021-6635964.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5298/8062167/54abd18a5cb0/BMRI2021-6635964.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5298/8062167/812bb45025f0/BMRI2021-6635964.005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5298/8062167/3a643fef4b94/BMRI2021-6635964.006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5298/8062167/e520bba6fd27/BMRI2021-6635964.007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5298/8062167/c42b374d9847/BMRI2021-6635964.008.jpg

相似文献

1
Comparative Analysis of CNN and RNN for Voice Pathology Detection.卷积神经网络(CNN)和循环神经网络(RNN)在语音病理学检测中的比较分析。
Biomed Res Int. 2021 Apr 14;2021:6635964. doi: 10.1155/2021/6635964. eCollection 2021.
2
Fully Automated Convolutional Neural Network Method for Quantification of Breast MRI Fibroglandular Tissue and Background Parenchymal Enhancement.全自动卷积神经网络方法定量分析乳腺 MRI 的纤维腺体组织和背景实质增强。
J Digit Imaging. 2019 Feb;32(1):141-147. doi: 10.1007/s10278-018-0114-7.
3
Convolutional Neural Networks for Pathological Voice Detection.用于病理性语音检测的卷积神经网络
Annu Int Conf IEEE Eng Med Biol Soc. 2018 Jul;2018:1-4. doi: 10.1109/EMBC.2018.8513222.
4
Predicting Breast Cancer Molecular Subtype with MRI Dataset Utilizing Convolutional Neural Network Algorithm.利用卷积神经网络算法对 MRI 数据集进行乳腺癌分子亚型预测。
J Digit Imaging. 2019 Apr;32(2):276-282. doi: 10.1007/s10278-019-00179-2.
5
Classification of Voice Disorders Using a One-Dimensional Convolutional Neural Network.基于一维卷积神经网络的嗓音障碍分类
J Voice. 2022 Jan;36(1):15-20. doi: 10.1016/j.jvoice.2020.02.009. Epub 2020 Mar 13.
6
A novel CNN algorithm for pathological complete response prediction using an I-SPY TRIAL breast MRI database.一种使用 I-SPY TRIAL 乳腺 MRI 数据库预测病理完全缓解的新型 CNN 算法。
Magn Reson Imaging. 2020 Nov;73:148-151. doi: 10.1016/j.mri.2020.08.021. Epub 2020 Sep 2.
7
Axillary Lymph Node Evaluation Utilizing Convolutional Neural Networks Using MRI Dataset.利用 MRI 数据集的卷积神经网络进行腋窝淋巴结评估。
J Digit Imaging. 2018 Dec;31(6):851-856. doi: 10.1007/s10278-018-0086-7.
8
Meta-analysis of voice disorders databases and applied machine learning techniques.嗓音障碍数据库的元分析及应用机器学习技术。
Math Biosci Eng. 2020 Nov 11;17(6):7958-7979. doi: 10.3934/mbe.2020404.
9
Development of a machine-learning based voice disorder screening tool.基于机器学习的语音障碍筛查工具的开发。
Am J Otolaryngol. 2022 Mar-Apr;43(2):103327. doi: 10.1016/j.amjoto.2021.103327. Epub 2021 Dec 15.
10
Neurogenerative Disease Diagnosis in Cepstral Domain Using MFCC with Deep Learning.基于梅尔频率倒谱系数的深度学习在声谱域中的神经退行性疾病诊断
Comput Math Methods Med. 2022 Apr 4;2022:4364186. doi: 10.1155/2022/4364186. eCollection 2022.

引用本文的文献

1
Applications of flexible materials in health management assisted by machine learning.柔性材料在机器学习辅助健康管理中的应用。
RSC Adv. 2025 Jun 30;15(28):22386-22410. doi: 10.1039/d5ra02594j.
2
Research on automatic assessment of the severity of unilateral vocal cord paralysis based on Mel-spectrogram and convolutional neural networks.基于梅尔频谱图和卷积神经网络的单侧声带麻痹严重程度自动评估研究
Biomed Eng Online. 2025 Jun 21;24(1):76. doi: 10.1186/s12938-025-01401-9.
3
Deep learning-based classification of speech disorder in stroke and hearing impairment.

本文引用的文献

1
Inter classifier comparison to detect voice pathologies.嗓音病变的分类器比较。
Math Biosci Eng. 2021 Mar 5;18(3):2258-2273. doi: 10.3934/mbe.2021114.
2
Meta-analysis of voice disorders databases and applied machine learning techniques.嗓音障碍数据库的元分析及应用机器学习技术。
Math Biosci Eng. 2020 Nov 11;17(6):7958-7979. doi: 10.3934/mbe.2020404.
3
Multiple Vowels Repair Based on Pitch Extraction and Line Spectrum Pair Feature for Voice Disorder.基于基频提取和线谱对特征的多重元音修复用于语音障碍。
基于深度学习的中风和听力障碍语音障碍分类
PLoS One. 2025 May 28;20(5):e0315286. doi: 10.1371/journal.pone.0315286. eCollection 2025.
4
Detecting Forged Audio Files Using "Mixed Paste" Command: A Deep Learning Approach Based on Korean Phonemic Features.使用“混合粘贴”命令检测伪造音频文件:一种基于韩语音素特征的深度学习方法
Sensors (Basel). 2024 Mar 14;24(6):1872. doi: 10.3390/s24061872.
5
Multitask and Transfer Learning Approach for Joint Classification and Severity Estimation of Dysphonia.多任务和迁移学习方法在联合分类和嗓音障碍严重程度估计中的应用。
IEEE J Transl Eng Health Med. 2023 Dec 7;12:233-244. doi: 10.1109/JTEHM.2023.3340345. eCollection 2024.
6
End-to-end deep learning classification of vocal pathology using stacked vowels.使用叠加元音的端到端深度学习进行嗓音病理学分类
Laryngoscope Investig Otolaryngol. 2023 Aug 31;8(5):1312-1318. doi: 10.1002/lio2.1144. eCollection 2023 Oct.
7
A deep learning-based approach to diagnose mild traumatic brain injury using audio classification.基于深度学习的音频分类方法诊断轻度创伤性脑损伤。
PLoS One. 2022 Sep 28;17(9):e0274395. doi: 10.1371/journal.pone.0274395. eCollection 2022.
8
Lightweight Deep Learning Model for Assessment of Substitution Voicing and Speech after Laryngeal Carcinoma Surgery.用于评估喉癌手术后替代发声和语音的轻量级深度学习模型
Cancers (Basel). 2022 May 11;14(10):2366. doi: 10.3390/cancers14102366.
9
Neurogenerative Disease Diagnosis in Cepstral Domain Using MFCC with Deep Learning.基于梅尔频率倒谱系数的深度学习在声谱域中的神经退行性疾病诊断
Comput Math Methods Med. 2022 Apr 4;2022:4364186. doi: 10.1155/2022/4364186. eCollection 2022.
IEEE J Biomed Health Inform. 2020 Jul;24(7):1940-1951. doi: 10.1109/JBHI.2020.2978103. Epub 2020 Mar 3.
4
A Survey on Machine Learning Approaches for Automatic Detection of Voice Disorders.机器学习方法在自动语音障碍检测中的研究进展
J Voice. 2019 Nov;33(6):947.e11-947.e33. doi: 10.1016/j.jvoice.2018.07.014. Epub 2018 Oct 11.
5
Development of the Arabic Voice Pathology Database and Its Evaluation by Using Speech Features and Machine Learning Algorithms.开发阿拉伯语语音病理学数据库及其通过语音特征和机器学习算法进行评估。
J Healthc Eng. 2017;2017:8783751. doi: 10.1155/2017/8783751. Epub 2017 Oct 19.
6
An Investigation of Multidimensional Voice Program Parameters in Three Different Databases for Voice Pathology Detection and Classification.在三个不同数据库中用于语音病理学检测和分类的多维语音程序参数研究
J Voice. 2017 Jan;31(1):113.e9-113.e18. doi: 10.1016/j.jvoice.2016.03.019. Epub 2016 Apr 19.
7
Investigation of Voice Pathology Detection and Classification on Different Frequency Regions Using Correlation Functions.基于相关函数的不同频率区域语音病理学检测与分类研究
J Voice. 2017 Jan;31(1):3-15. doi: 10.1016/j.jvoice.2016.01.014. Epub 2016 Mar 15.
8
Multi parametric voice assessment: sri ramachandra university protocol.多参数语音评估:斯里兰卡拉马钱德拉大学方案
Indian J Otolaryngol Head Neck Surg. 2014 Jan;66(Suppl 1):246-51. doi: 10.1007/s12070-011-0460-y. Epub 2012 Jan 25.
9
Vocal dose measures: quantifying accumulated vibration exposure in vocal fold tissues.嗓音剂量测量:量化声带组织中的累积振动暴露。
J Speech Lang Hear Res. 2003 Aug;46(4):919-32. doi: 10.1044/1092-4388(2003/072).
10
A basic protocol for functional assessment of voice pathology, especially for investigating the efficacy of (phonosurgical) treatments and evaluating new assessment techniques. Guideline elaborated by the Committee on Phoniatrics of the European Laryngological Society (ELS).一种用于嗓音病理学功能评估的基本方案,尤其用于研究(嗓音外科)治疗的疗效和评估新的评估技术。由欧洲喉科学会(ELS)嗓音病学委员会制定的指南。
Eur Arch Otorhinolaryngol. 2001 Feb;258(2):77-82. doi: 10.1007/s004050000299.