基于 GMM-SVM 方法的病理性嗓音与正常嗓音的区分。

Discrimination between pathological and normal voices using GMM-SVM approach.

机构信息

Thinkit Speech Lab, Institute of Acoustics, Chinese Academy of Science, Beijing, China.

出版信息

J Voice. 2011 Jan;25(1):38-43. doi: 10.1016/j.jvoice.2009.08.002. Epub 2010 Feb 4.

DOI:10.1016/j.jvoice.2009.08.002

PMID:20137892

Abstract

Acoustic features of vocal tract function are used widely in the study of pathological voices detection. Classification of normal and pathological voices by acoustic parameters is a useful way to diagnose voice diseases. In this aspect, mel-frequency cepstral coefficients are proved to be effective with traditional classifiers such as Gaussian Mixture Model (GMM). However, the accuracy of the classification method can be further improved. In this article, a Gaussian mixture model supervector kernel-support vector machine (GMM-SVM) classifier is compared with GMM classifier for the detection of voice pathology. We found that a sustain vowel phonation can be classified as normal or pathological with an accuracy of 96.1%. Voice recordings are selected from the Kay database to carry out the experiments. Experimental results show that equal error rates decrease from 8.0% for GMM to 4.6% for GMM-SVM.

摘要

声道功能的声学特征在病理嗓音检测研究中得到了广泛应用。通过声学参数对正常和病理嗓音进行分类是诊断嗓音疾病的一种有效方法。在这方面，梅尔频率倒谱系数与传统分类器（如高斯混合模型 (GMM)）相结合被证明是有效的。然而，分类方法的准确性可以进一步提高。在本文中，将高斯混合模型超矢量核支持向量机 (GMM-SVM) 分类器与 GMM 分类器进行了比较，用于检测嗓音病理。我们发现，持续元音发声可以以 96.1%的准确率被分类为正常或病理。实验从 Kay 数据库中选择了语音录音进行实验。实验结果表明，对于 GMM，等错误率从 8.0%降低到了 GMM-SVM 的 4.6%。

相似文献

Discrimination between pathological and normal voices using GMM-SVM approach.基于 GMM-SVM 方法的病理性嗓音与正常嗓音的区分。

J Voice. 2011 Jan;25(1):38-43. doi: 10.1016/j.jvoice.2009.08.002. Epub 2010 Feb 4.

On combining information from modulation spectra and mel-frequency cepstral coefficients for automatic detection of pathological voices.结合调制谱和梅尔频率倒谱系数信息用于病理性嗓音自动检测

Logoped Phoniatr Vocol. 2011 Jul;36(2):60-9. doi: 10.3109/14015439.2010.528788. Epub 2010 Nov 12.

Multidirectional regression (MDR)-based features for automatic voice disorder detection.基于多方向回归 (MDR) 的特征用于自动语音障碍检测。

J Voice. 2012 Nov;26(6):817.e19-27. doi: 10.1016/j.jvoice.2012.05.002.

Automatic Voice Pathology Detection With Running Speech by Using Estimation of Auditory Spectrum and Cepstral Coefficients Based on the All-Pole Model.基于全极点模型，通过估计听觉频谱和倒谱系数，对连续语音进行自动语音病理学检测。

J Voice. 2016 Nov;30(6):757.e7-757.e19. doi: 10.1016/j.jvoice.2015.08.010. Epub 2015 Oct 27.

Towards objective evaluation of perceived roughness and breathiness: an approach based on mel-frequency cepstral analysis.迈向对感知粗糙度和呼吸声的客观评估：一种基于梅尔频率倒谱分析的方法。

Logoped Phoniatr Vocol. 2011 Jul;36(2):52-9. doi: 10.3109/14015439.2010.517551. Epub 2010 Sep 17.

Automatic detection of pathological voices using complexity measures, noise parameters, and mel-cepstral coefficients.使用复杂度测度、噪声参数和梅尔倒谱系数自动检测病理性嗓音。

IEEE Trans Biomed Eng. 2011 Feb;58(2):370-9. doi: 10.1109/TBME.2010.2089052.

Discrimination of "hot potato voice" caused by upper airway obstruction utilizing a support vector machine.

Laryngoscope. 2019 Jun;129(6):1301-1307. doi: 10.1002/lary.27584. Epub 2018 Nov 28.

Instrumental dimensioning of normal and pathological phonation using acoustic measurements.使用声学测量对正常和病理性发声进行仪器测量。

Clin Linguist Phon. 2008 Jun;22(6):407-20. doi: 10.1080/02699200701830869.

Reliable jitter and shimmer measurements in voice clinics: the relevance of vowel, gender, vocal intensity, and fundamental frequency effects in a typical clinical task.在语音诊所中进行可靠的抖动和颤抖测量：在典型临床任务中母音、性别、发声强度和基频效应的相关性。

J Voice. 2011 Jan;25(1):44-53. doi: 10.1016/j.jvoice.2009.07.002. Epub 2010 Apr 8.

Dimensionality reduction of a pathological voice quality assessment system based on Gaussian mixture models and short-term cepstral parameters.基于高斯混合模型和短时倒谱参数的病理性嗓音质量评估系统的降维

IEEE Trans Biomed Eng. 2006 Oct;53(10):1943-53. doi: 10.1109/TBME.2006.871883.

引用本文的文献

The Effectiveness of Supervised Machine Learning in Screening and Diagnosing Voice Disorders: Systematic Review and Meta-analysis.监督机器学习在筛查和诊断嗓音障碍中的有效性：系统评价和荟萃分析。

J Med Internet Res. 2022 Oct 14;24(10):e38472. doi: 10.2196/38472.

Diagnosis of COVID-19 via acoustic analysis and artificial intelligence by monitoring breath sounds on smartphones.通过在智能手机上监测呼吸声的声学分析和人工智能诊断 COVID-19。

J Biomed Inform. 2022 Jun;130:104078. doi: 10.1016/j.jbi.2022.104078. Epub 2022 Apr 27.

Age estimation based on children's voice: a fuzzy-based decision fusion strategy.基于儿童声音的年龄估计：一种基于模糊的决策融合策略。

ScientificWorldJournal. 2014;2014:534064. doi: 10.1155/2014/534064. Epub 2014 Jun 5.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于 GMM-SVM 方法的病理性嗓音与正常嗓音的区分。

Discrimination between pathological and normal voices using GMM-SVM approach.

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献