基于调制谱优化指标的语音病理学检测

Voice Pathology Detection Using Modulation Spectrum-Optimized Metrics.

机构信息

Center for Biomedical Technology, Universidad Politécnica de Madrid , Madrid , Spain.

出版信息

Front Bioeng Biotechnol. 2016 Jan 20;4:1. doi: 10.3389/fbioe.2016.00001. eCollection 2016.

DOI:10.3389/fbioe.2016.00001

PMID:26835449

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4718980/

Abstract

There exist many acoustic parameters employed for pathological assessment tasks, which have served as tools for clinicians to distinguish between normophonic and pathological voices. However, many of these parameters require an appropriate tuning in order to maximize its efficiency. In this work, a group of new and already proposed modulation spectrum (MS) metrics are optimized considering different time and frequency ranges pursuing the maximization of efficiency for the detection of pathological voices. The optimization of the metrics is performed simultaneously in two different voice databases in order to identify what tuning ranges produce a better generalization. The experiments were cross-validated so as to ensure the validity of the results. A third database is used to test the optimized metrics. In spite of some differences, results indicate that the behavior of the metrics in the optimization process follows similar tendencies for the tuning databases, confirming the generalization capabilities of the proposed MS metrics. In addition, the tuning process reveals which bands of the modulation spectra have relevant information for each metric, which has a physical interpretation respecting the phonatory system. Efficiency values up to 90.6% are obtained in one tuning database, while in the other, the maximum efficiency reaches 71.1%. Obtained results also evidence a separability between normophonic and pathological states using the proposed metrics, which can be exploited for voice pathology detection or assessment.

摘要

存在许多用于病理评估任务的声学参数，这些参数已被临床医生用作区分正常音和病理音的工具。然而，许多参数都需要进行适当的调整，以最大限度地提高其效率。在这项工作中，考虑到不同的时间和频率范围，我们对一组新的和已提出的调制谱 (MS) 指标进行了优化，以追求检测病理音的效率最大化。为了识别出产生更好泛化能力的调整范围，在两个不同的语音数据库中同时进行了指标的优化。实验采用交叉验证，以确保结果的有效性。第三个数据库用于测试优化后的指标。尽管存在一些差异，但结果表明，指标在优化过程中的行为遵循与调整数据库相似的趋势，从而证实了所提出的 MS 指标的泛化能力。此外，调整过程揭示了调制谱的哪些频段对每个指标具有相关信息，这与发音系统的物理解释相符。在一个调整数据库中，效率值高达 90.6%，而在另一个数据库中，最大效率达到 71.1%。使用所提出的指标还可以证明正常音和病理音状态之间的可分离性，这可用于语音病理检测或评估。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ab9/4718980/c9a1b0915290/fbioe-04-00001-g001.jpg

相似文献

Voice Pathology Detection Using Modulation Spectrum-Optimized Metrics.基于调制谱优化指标的语音病理学检测

Front Bioeng Biotechnol. 2016 Jan 20;4:1. doi: 10.3389/fbioe.2016.00001. eCollection 2016.

Corrigendum: Voice Pathology Detection Using Modulation Spectrum-Optimized Metrics.勘误：使用调制谱优化指标进行语音病理学检测。

Front Bioeng Biotechnol. 2016 Aug 24;4:67. doi: 10.3389/fbioe.2016.00067. eCollection 2016.

Investigation of Voice Pathology Detection and Classification on Different Frequency Regions Using Correlation Functions.基于相关函数的不同频率区域语音病理学检测与分类研究

J Voice. 2017 Jan;31(1):3-15. doi: 10.1016/j.jvoice.2016.01.014. Epub 2016 Mar 15.

Voice pathology detection based eon short-term jitter estimations in running speech.基于连续语音中短期抖动估计的嗓音病理学检测

Folia Phoniatr Logop. 2009;61(3):153-70. doi: 10.1159/000219951. Epub 2009 Jul 1.

[THE APPLICATION OF SHORT-TERM EFFICIENCY ANALYSIS IN DIAGNOSING OCCUPATIONAL VOICE DISORDERS].[短期效率分析在职业性嗓音障碍诊断中的应用]

Med Pr. 2015;66(2):225-34. doi: 10.13075/mp.5893.00155.

The Acoustic Voice Quality Index: toward improved treatment outcomes assessment in voice disorders.声学嗓音质量指数：用于改善嗓音障碍的治疗效果评估

J Commun Disord. 2010 May-Jun;43(3):161-74. doi: 10.1016/j.jcomdis.2009.12.004. Epub 2009 Dec 23.

Convolutional Neural Networks for Pathological Voice Detection.用于病理性语音检测的卷积神经网络

Annu Int Conf IEEE Eng Med Biol Soc. 2018 Jul;2018:1-4. doi: 10.1109/EMBC.2018.8513222.

Using modulation spectra for voice pathology detection and classification.利用调制谱进行语音病理学检测与分类。

Annu Int Conf IEEE Eng Med Biol Soc. 2009;2009:2514-7. doi: 10.1109/IEMBS.2009.5334850.

An Investigation of Multidimensional Voice Program Parameters in Three Different Databases for Voice Pathology Detection and Classification.在三个不同数据库中用于语音病理学检测和分类的多维语音程序参数研究

J Voice. 2017 Jan;31(1):113.e9-113.e18. doi: 10.1016/j.jvoice.2016.03.019. Epub 2016 Apr 19.

Validation of the Acoustic Voice Quality Index in the Japanese Language.日语中声学嗓音质量指数的验证

J Voice. 2017 Mar;31(2):260.e1-260.e9. doi: 10.1016/j.jvoice.2016.05.010. Epub 2016 Jun 7.

引用本文的文献

Viscous damping of tremor using a wearable robot with an optimized mechanical metamaterial.使用具有优化机械超材料的可穿戴机器人对震颤进行粘性阻尼

Wearable Technol. 2024 Dec 10;5:e20. doi: 10.1017/wtc.2024.15. eCollection 2024.

本文引用的文献

Modulation Spectra Morphological Parameters: A New Method to Assess Voice Pathologies according to the GRBAS Scale.调制频谱形态学参数：一种根据GRBAS量表评估嗓音疾病的新方法。

Biomed Res Int. 2015;2015:259239. doi: 10.1155/2015/259239. Epub 2015 Oct 18.

Discriminating simulated vocal tremor source using amplitude modulation spectra.利用调幅谱鉴别模拟的声带震颤源

J Voice. 2015 Mar;29(2):140-7. doi: 10.1016/j.jvoice.2014.07.020. Epub 2014 Dec 19.

Assessment of voice quality: Current state-of-the-art.嗓音质量评估：当前的技术水平。

Auris Nasus Larynx. 2015 Jun;42(3):183-8. doi: 10.1016/j.anl.2014.11.001. Epub 2014 Nov 28.

Automatic detection of pathological voices using complexity measures, noise parameters, and mel-cepstral coefficients.使用复杂度测度、噪声参数和梅尔倒谱系数自动检测病理性嗓音。

IEEE Trans Biomed Eng. 2011 Feb;58(2):370-9. doi: 10.1109/TBME.2010.2089052.

On combining information from modulation spectra and mel-frequency cepstral coefficients for automatic detection of pathological voices.结合调制谱和梅尔频率倒谱系数信息用于病理性嗓音自动检测

Logoped Phoniatr Vocol. 2011 Jul;36(2):60-9. doi: 10.3109/14015439.2010.528788. Epub 2010 Nov 12.

Quantifying dysphonia severity using a spectral/cepstral-based acoustic index: Comparisons with auditory-perceptual judgements from the CAPE-V.使用基于频谱/倒谱的声学指标量化发声障碍严重程度：与CAPE-V的听觉感知判断的比较。

Clin Linguist Phon. 2010 Sep;24(9):742-58. doi: 10.3109/02699206.2010.492446.

Pathological likelihood index as a measurement of the degree of voice normality and perceived hoarseness.病理似然指数作为一种衡量嗓音正常程度和可感知嘶哑程度的指标。

J Voice. 2010 Nov;24(6):667-77. doi: 10.1016/j.jvoice.2009.04.003. Epub 2010 Mar 6.

The effectiveness of the glottal to noise excitation ratio for the screening of voice disorders.声门噪声比用于嗓音障碍筛查的效果。

J Voice. 2010 Jan;24(1):47-56. doi: 10.1016/j.jvoice.2008.04.006. Epub 2009 Jan 9.

Acoustic analysis of voice using WPCVox: a comparative study with Multi Dimensional Voice Program.使用WPCVox进行嗓音声学分析：与多维嗓音程序的比较研究。

Eur Arch Otorhinolaryngol. 2008 Apr;265(4):465-76. doi: 10.1007/s00405-007-0467-x. Epub 2007 Oct 9.

Artificial neural network-based classification to screen for dysphonia using psychoacoustic scaling of acoustic voice features.基于人工神经网络的分类方法，利用声学语音特征的心理声学标度筛选嗓音障碍。

J Voice. 2008 Mar;22(2):155-63. doi: 10.1016/j.jvoice.2006.09.003. Epub 2006 Oct 30.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于调制谱优化指标的语音病理学检测

Voice Pathology Detection Using Modulation Spectrum-Optimized Metrics.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献