通过短期倒谱参数和基于神经网络的检测器自动检测语音损伤。

Automatic detection of voice impairments by means of short-term cepstral parameters and neural network based detectors.

作者信息

Godino-Llorente J I, Gómez-Vilda P

机构信息

Universidad Politécnica de Madrid, Escuela Universitaria de Ingeniería Técnica de Telecomunicación, Dpt. of Ingeniería de Circuitos y Sistemas, Ctra. Valencia Km. 7, 28031, Madrid.

出版信息

IEEE Trans Biomed Eng. 2004 Feb;51(2):380-4. doi: 10.1109/TBME.2003.820386.

DOI:10.1109/TBME.2003.820386

PMID:14765711

Abstract

It is well known that vocal and voice diseases do not necessarily cause perceptible changes in the acoustic voice signal. Acoustic analysis is a useful tool to diagnose voice diseases being a complementary technique to other methods based on direct observation of the vocal folds by laryngoscopy. Through the present paper two neural-network based classification approaches applied to the automatic detection of voice disorders will be studied. Structures studied are multilayer perceptron and learning vector quantization fed using short-term vectors calculated accordingly to the well-known Mel Frequency Coefficient cepstral parameterization. The paper shows that these architectures allow the detection of voice disorders--including glottic cancer--under highly reliable conditions. Within this context, the Learning Vector quantization methodology demonstrated to be more reliable than the multilayer perceptron architecture yielding 96% frame accuracy under similar working conditions.

摘要

众所周知，嗓音和语音疾病不一定会在声学语音信号中引起可察觉的变化。声学分析是诊断语音疾病的一种有用工具，是喉镜直接观察声带的其他方法的补充技术。通过本文，将研究两种基于神经网络的分类方法应用于语音障碍的自动检测。所研究的结构是多层感知器和学习向量量化，使用根据著名的梅尔频率系数倒谱参数化计算的短期向量进行馈送。本文表明，这些架构能够在高度可靠的条件下检测语音障碍，包括声门癌。在此背景下，学习向量量化方法被证明比多层感知器架构更可靠，在类似工作条件下产生96%的帧准确率。

相似文献

Automatic detection of voice impairments by means of short-term cepstral parameters and neural network based detectors.通过短期倒谱参数和基于神经网络的检测器自动检测语音损伤。

IEEE Trans Biomed Eng. 2004 Feb;51(2):380-4. doi: 10.1109/TBME.2003.820386.

Dimensionality reduction of a pathological voice quality assessment system based on Gaussian mixture models and short-term cepstral parameters.基于高斯混合模型和短时倒谱参数的病理性嗓音质量评估系统的降维

IEEE Trans Biomed Eng. 2006 Oct;53(10):1943-53. doi: 10.1109/TBME.2006.871883.

Discrimination of pathological voices using a time-frequency approach.使用时频方法鉴别病理性嗓音。

IEEE Trans Biomed Eng. 2005 Mar;52(3):421-30. doi: 10.1109/TBME.2004.842962.

Automatic Voice Pathology Detection With Running Speech by Using Estimation of Auditory Spectrum and Cepstral Coefficients Based on the All-Pole Model.基于全极点模型，通过估计听觉频谱和倒谱系数，对连续语音进行自动语音病理学检测。

J Voice. 2016 Nov;30(6):757.e7-757.e19. doi: 10.1016/j.jvoice.2015.08.010. Epub 2015 Oct 27.

Automatic assessment of voice quality according to the GRBAS scale.根据GRBAS量表自动评估嗓音质量。

Conf Proc IEEE Eng Med Biol Soc. 2006;2006:2478-81. doi: 10.1109/IEMBS.2006.260603.

Acoustic analysis and detection of hypernasality using a group delay function.使用群延迟函数对高鼻音进行声学分析与检测。

IEEE Trans Biomed Eng. 2007 Apr;54(4):621-9. doi: 10.1109/TBME.2006.889191.

On combining information from modulation spectra and mel-frequency cepstral coefficients for automatic detection of pathological voices.结合调制谱和梅尔频率倒谱系数信息用于病理性嗓音自动检测

Logoped Phoniatr Vocol. 2011 Jul;36(2):60-9. doi: 10.3109/14015439.2010.528788. Epub 2010 Nov 12.

Towards objective evaluation of perceived roughness and breathiness: an approach based on mel-frequency cepstral analysis.迈向对感知粗糙度和呼吸声的客观评估：一种基于梅尔频率倒谱分析的方法。

Logoped Phoniatr Vocol. 2011 Jul;36(2):52-9. doi: 10.3109/14015439.2010.517551. Epub 2010 Sep 17.

Automated speech analysis applied to laryngeal disease categorization.应用于喉疾病分类的自动语音分析。

Comput Methods Programs Biomed. 2008 Jul;91(1):36-47. doi: 10.1016/j.cmpb.2008.01.008. Epub 2008 Mar 17.

Intra- and Inter-database Study for Arabic, English, and German Databases: Do Conventional Speech Features Detect Voice Pathology?阿拉伯语、英语和德语数据库的库内及库间研究：传统语音特征能否检测语音病理学？

J Voice. 2017 May;31(3):386.e1-386.e8. doi: 10.1016/j.jvoice.2016.09.009. Epub 2016 Oct 10.

引用本文的文献

Improving Voice Spoofing Detection Through Extensive Analysis of Multicepstral Feature Reduction.通过对多倒谱特征约简的广泛分析改进语音欺骗检测

Sensors (Basel). 2025 Aug 5;25(15):4821. doi: 10.3390/s25154821.

Identifying bias in models that detect vocal fold paralysis from audio recordings using explainable machine learning and clinician ratings.使用可解释机器学习和临床医生评级来识别从音频记录中检测声带麻痹的模型中的偏差。

PLOS Digit Health. 2024 May 30;3(5):e0000516. doi: 10.1371/journal.pdig.0000516. eCollection 2024 May.

End-to-end deep learning classification of vocal pathology using stacked vowels.使用叠加元音的端到端深度学习进行嗓音病理学分类

Laryngoscope Investig Otolaryngol. 2023 Aug 31;8(5):1312-1318. doi: 10.1002/lio2.1144. eCollection 2023 Oct.

An Experimental Analysis on Multicepstral Projection Representation Strategies for Dysphonia Detection.多倒谱投影表示策略在嗓音障碍检测中的实验分析。

Sensors (Basel). 2023 May 30;23(11):5196. doi: 10.3390/s23115196.

Employing Energy and Statistical Features for Automatic Diagnosis of Voice Disorders.利用能量和统计特征进行语音障碍的自动诊断。

Diagnostics (Basel). 2022 Nov 11;12(11):2758. doi: 10.3390/diagnostics12112758.

The Effectiveness of Supervised Machine Learning in Screening and Diagnosing Voice Disorders: Systematic Review and Meta-analysis.监督机器学习在筛查和诊断嗓音障碍中的有效性：系统评价和荟萃分析。

J Med Internet Res. 2022 Oct 14;24(10):e38472. doi: 10.2196/38472.

Do you have COVID-19? An artificial intelligence-based screening tool for COVID-19 using acoustic parameters.你是否感染了 COVID-19？一种使用声学参数的基于人工智能的 COVID-19 筛查工具。

J Acoust Soc Am. 2021 Sep;150(3):1945. doi: 10.1121/10.0006104.

A new approach: information gain algorithm-based k-nearest neighbors hybrid diagnostic system for Parkinson's disease.一种新方法：基于信息增益算法的 k-最近邻混合诊断系统用于帕金森病。

Phys Eng Sci Med. 2021 Jun;44(2):511-524. doi: 10.1007/s13246-021-01001-6. Epub 2021 Apr 14.

X-Vectors: New Quantitative Biomarkers for Early Parkinson's Disease Detection From Speech.X向量：用于早期帕金森病语音检测的新型定量生物标志物。

Front Neuroinform. 2021 Feb 19;15:578369. doi: 10.3389/fninf.2021.578369. eCollection 2021.

medRxiv. 2024 Mar 20:2020.11.23.20235945. doi: 10.1101/2020.11.23.20235945.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

通过短期倒谱参数和基于神经网络的检测器自动检测语音损伤。

Automatic detection of voice impairments by means of short-term cepstral parameters and neural network based detectors.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献