• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于机器学习的语音障碍筛查工具的开发。

Development of a machine-learning based voice disorder screening tool.

机构信息

Division of Otolaryngology-Head and Neck Surgery, Department of Surgery, Faculty of Medicine and Dentistry, University of Alberta, Edmonton, AB, Canada.

Department of Physics, Faculty of Science, University of Alberta, Edmonton, AB, Canada.

出版信息

Am J Otolaryngol. 2022 Mar-Apr;43(2):103327. doi: 10.1016/j.amjoto.2021.103327. Epub 2021 Dec 15.

DOI:10.1016/j.amjoto.2021.103327
PMID:34923280
Abstract

OBJECTIVE

Early recognition and referral are crucial for voice disorder management. Limited availability of subspecialists, poor primary care awareness, and the need for specialized equipment impede effective care. Thus, there is a need for a tool to improve voice pathology screening. Machine learning algorithms (MLAs) have shown promise in analyzing acoustic characteristics of phonation. However, few studies report clinical applications of MLAs for voice pathology detection. The objective of this study was to design and validate a MLA for detecting pathological voices.

METHODS

A MLA was developed for voice analysis. Audio samples converted into spectrograms were inputted into a pre-existing VGG19 convolutional neural network (CNN) and image-classifier. The resulting feature map was classified as either pathological or healthy using a Support Vector Machine (SVM) binary linear classifier. This combined MLA was "trained" with 950 sustained "/i/" vowel audio samples from the Saarbrucken Voice Database (SVD), which contains subjects with and without voice disorders. The trained MLA was "tested" with 406 SVD samples to determine sensitivity, specificity, and overall accuracy. External validation of the MLA was performed using clinical voice samples collected from patients attending a subspecialty voice clinic.

RESULTS

The MLA detected pathologies in SVD samples with 98.5% sensitivity, 97.1% specificity and 97.8% overall accuracy. In 30 samples obtained prospectively from voice clinic patients, the MLA detected pathologies with 100% sensitivity, 96.3% specificity and 96.7% overall accuracy.

CONCLUSIONS

This study demonstrates that a MLA using a simple audio input can detect diverse vocal pathologies with high sensitivity and specificity. Thus, this algorithm shows promise as a potential screening tool.

摘要

目的

早期识别和转介对于嗓音障碍管理至关重要。由于专家资源有限、初级保健意识较差以及对专业设备的需求,有效治疗受到阻碍。因此,需要有一种工具来改善嗓音病理学筛查。机器学习算法(MLA)已显示出在分析发声的声学特征方面的潜力。然而,很少有研究报告 MLA 用于嗓音病理学检测的临床应用。本研究旨在设计和验证一种用于检测病理性嗓音的 MLA。

方法

我们开发了一种用于语音分析的 MLA。转换为声谱图的音频样本被输入到现有的 VGG19 卷积神经网络(CNN)和图像分类器中。使用支持向量机(SVM)二进制线性分类器对生成的特征图进行分类,分为病理性或健康性。这种组合的 MLA 使用来自 Saarbrucken 嗓音数据库(SVD)的 950 个持续的/i/元音音频样本进行“训练”,该数据库包含有声带疾病和无嗓音疾病的受试者。使用 406 个 SVD 样本对经过训练的 MLA 进行“测试”,以确定灵敏度、特异性和总体准确性。使用从参加嗓音专科诊所的患者中收集的临床嗓音样本对 MLA 进行外部验证。

结果

该 MLA 检测 SVD 样本中的病变,灵敏度为 98.5%,特异性为 97.1%,总体准确性为 97.8%。在从嗓音诊所患者前瞻性获得的 30 个样本中,MLA 检测到的病变具有 100%的灵敏度、96.3%的特异性和 96.7%的总体准确性。

结论

本研究表明,使用简单音频输入的 MLA 可以以高灵敏度和特异性检测多种嗓音病变。因此,该算法作为一种潜在的筛查工具具有很大的应用前景。

相似文献

1
Development of a machine-learning based voice disorder screening tool.基于机器学习的语音障碍筛查工具的开发。
Am J Otolaryngol. 2022 Mar-Apr;43(2):103327. doi: 10.1016/j.amjoto.2021.103327. Epub 2021 Dec 15.
2
The accuracy of an Online Sequential Extreme Learning Machine in detecting voice pathology using the Malaysian Voice Pathology Database.使用马来西亚语音病理学数据库检测语音病理学的在线序贯极限学习机的准确性。
J Otolaryngol Head Neck Surg. 2023 Sep 20;52(1):62. doi: 10.1186/s40463-023-00661-6.
3
Design and Validation of a New Diagnostic Tool for the Differentiation of Pathological Voices in Parkinsonian Patients.设计和验证一种用于帕金森病患者病理性声音鉴别诊断的新工具。
Adv Exp Med Biol. 2021;1339:77-83. doi: 10.1007/978-3-030-78787-5_11.
4
Classification of Voice Disorders Using a One-Dimensional Convolutional Neural Network.基于一维卷积神经网络的嗓音障碍分类
J Voice. 2022 Jan;36(1):15-20. doi: 10.1016/j.jvoice.2020.02.009. Epub 2020 Mar 13.
5
Machine Learning-based Voice Assessment for the Detection of Positive and Recovered COVID-19 Patients.基于机器学习的语音评估用于检测 COVID-19 阳性和康复患者。
J Voice. 2024 May;38(3):796.e1-796.e13. doi: 10.1016/j.jvoice.2021.11.004. Epub 2021 Nov 26.
6
Discrimination between pathological and normal voices using GMM-SVM approach.基于 GMM-SVM 方法的病理性嗓音与正常嗓音的区分。
J Voice. 2011 Jan;25(1):38-43. doi: 10.1016/j.jvoice.2009.08.002. Epub 2010 Feb 4.
7
A generalized procedure for analyzing sustained and dynamic vocal fold vibrations from laryngeal high-speed videos using phonovibrograms.一种使用声振图从喉部高速视频分析持续和动态声带振动的通用程序。
Artif Intell Med. 2016 Jan;66:15-28. doi: 10.1016/j.artmed.2015.10.002. Epub 2015 Oct 30.
8
Detection of Pathological Voice Using Cepstrum Vectors: A Deep Learning Approach.基于倒谱向量的病理性嗓音检测:深度学习方法。
J Voice. 2019 Sep;33(5):634-641. doi: 10.1016/j.jvoice.2018.02.003. Epub 2018 Mar 19.
9
The Effectiveness of Supervised Machine Learning in Screening and Diagnosing Voice Disorders: Systematic Review and Meta-analysis.监督机器学习在筛查和诊断嗓音障碍中的有效性:系统评价和荟萃分析。
J Med Internet Res. 2022 Oct 14;24(10):e38472. doi: 10.2196/38472.
10
Meta-analysis of voice disorders databases and applied machine learning techniques.嗓音障碍数据库的元分析及应用机器学习技术。
Math Biosci Eng. 2020 Nov 11;17(6):7958-7979. doi: 10.3934/mbe.2020404.

引用本文的文献

1
A Deep-Learning Model for Multi-class Audio Classification of Vocal Fold Pathologies in Office Stroboscopy.一种用于办公室频闪喉镜检查中声带病变多类别音频分类的深度学习模型。
Laryngoscope. 2025 Jul;135(7):2428-2436. doi: 10.1002/lary.32036. Epub 2025 Feb 5.
2
New developments in the application of artificial intelligence to laryngology.人工智能在喉科学中的应用新进展。
Curr Opin Otolaryngol Head Neck Surg. 2024 Dec 1;32(6):391-397. doi: 10.1097/MOO.0000000000000999. Epub 2024 Jul 24.
3
Stakeholder perspectives on ethical and trustworthy voice AI in health care.
医疗保健领域中利益相关者对符合伦理道德且值得信赖的语音人工智能的看法。
Digit Health. 2024 Jul 23;10:20552076241260407. doi: 10.1177/20552076241260407. eCollection 2024 Jan-Dec.
4
An introduction to machine learning and generative artificial intelligence for otolaryngologists-head and neck surgeons: a narrative review.耳鼻喉科-头颈外科医师的机器学习和生成式人工智能入门:叙述性综述。
Eur Arch Otorhinolaryngol. 2024 May;281(5):2723-2731. doi: 10.1007/s00405-024-08512-4. Epub 2024 Feb 23.
5
Multitask and Transfer Learning Approach for Joint Classification and Severity Estimation of Dysphonia.多任务和迁移学习方法在联合分类和嗓音障碍严重程度估计中的应用。
IEEE J Transl Eng Health Med. 2023 Dec 7;12:233-244. doi: 10.1109/JTEHM.2023.3340345. eCollection 2024.
6
Survey of Transfer Learning Approaches in the Machine Learning of Digital Health Sensing Data.数字健康传感数据机器学习中的迁移学习方法综述
J Pers Med. 2023 Dec 12;13(12):1703. doi: 10.3390/jpm13121703.
7
Correlating Perceptual Voice Quality in Adductor Spasmodic Dysphonia With Computer Vision Assessment of Glottal Geometry Dynamics.关联评估喉肌痉挛性发音障碍的计算机视觉与感知声音质量的评估。
J Speech Lang Hear Res. 2022 Oct 17;65(10):3695-3708. doi: 10.1044/2022_JSLHR-22-00053. Epub 2022 Sep 21.