• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

结合调制谱和梅尔频率倒谱系数信息用于病理性嗓音自动检测

On combining information from modulation spectra and mel-frequency cepstral coefficients for automatic detection of pathological voices.

作者信息

Arias-Londoño Julián David, Godino-Llorente Juan I, Markaki Maria, Stylianou Yannis

机构信息

Universidad Politécnica de Madrid, Circuits & Systems Engineering, EUIT de Telecomunicación, Universidad Politécnica de Madrid, Ctra. Valencia, km 7, Madrid 28031, Spain.

出版信息

Logoped Phoniatr Vocol. 2011 Jul;36(2):60-9. doi: 10.3109/14015439.2010.528788. Epub 2010 Nov 12.

DOI:10.3109/14015439.2010.528788
PMID:21073260
Abstract

This work presents a novel approach for the automatic detection of pathological voices based on fusing the information extracted by means of mel-frequency cepstral coefficients (MFCC) and features derived from the modulation spectra (MS). The system proposed uses a two-stepped classification scheme. First, the MFCC and MS features were used to feed two different and independent classifiers; and then the outputs of each classifier were used in a second classification stage. In order to establish the best configuration which provides the highest accuracy in the detection, the fusion of information was carried out employing different classifier combination strategies. The experiments were carried out using two different databases: the one developed by The Massachusetts Eye and Ear Infirmary Voice Laboratory, and a database recorded by the Universidad Politécnica de Madrid. The results show that the combination of MFCC and MS features employing the proposed approach yields an improvement in the detection accuracy, demonstrating that both methods of parameterization are complementary.

摘要

这项工作提出了一种基于融合通过梅尔频率倒谱系数(MFCC)提取的信息和从调制谱(MS)导出的特征来自动检测病理性嗓音的新方法。所提出的系统采用两步分类方案。首先,MFCC和MS特征用于输入两个不同且独立的分类器;然后每个分类器的输出用于第二个分类阶段。为了确定在检测中提供最高准确率的最佳配置,采用不同的分类器组合策略进行信息融合。实验使用了两个不同的数据库:一个由马萨诸塞州眼耳医院嗓音实验室开发,另一个由马德里理工大学录制的数据库。结果表明,采用所提出的方法融合MFCC和MS特征可提高检测准确率,表明两种参数化方法是互补的。

相似文献

1
On combining information from modulation spectra and mel-frequency cepstral coefficients for automatic detection of pathological voices.结合调制谱和梅尔频率倒谱系数信息用于病理性嗓音自动检测
Logoped Phoniatr Vocol. 2011 Jul;36(2):60-9. doi: 10.3109/14015439.2010.528788. Epub 2010 Nov 12.
2
Towards objective evaluation of perceived roughness and breathiness: an approach based on mel-frequency cepstral analysis.迈向对感知粗糙度和呼吸声的客观评估:一种基于梅尔频率倒谱分析的方法。
Logoped Phoniatr Vocol. 2011 Jul;36(2):52-9. doi: 10.3109/14015439.2010.517551. Epub 2010 Sep 17.
3
Multidirectional regression (MDR)-based features for automatic voice disorder detection.基于多方向回归 (MDR) 的特征用于自动语音障碍检测。
J Voice. 2012 Nov;26(6):817.e19-27. doi: 10.1016/j.jvoice.2012.05.002.
4
Automatic detection of pathological voices using complexity measures, noise parameters, and mel-cepstral coefficients.使用复杂度测度、噪声参数和梅尔倒谱系数自动检测病理性嗓音。
IEEE Trans Biomed Eng. 2011 Feb;58(2):370-9. doi: 10.1109/TBME.2010.2089052.
5
Discrimination between pathological and normal voices using GMM-SVM approach.基于 GMM-SVM 方法的病理性嗓音与正常嗓音的区分。
J Voice. 2011 Jan;25(1):38-43. doi: 10.1016/j.jvoice.2009.08.002. Epub 2010 Feb 4.
6
Automatic Voice Pathology Detection With Running Speech by Using Estimation of Auditory Spectrum and Cepstral Coefficients Based on the All-Pole Model.基于全极点模型,通过估计听觉频谱和倒谱系数,对连续语音进行自动语音病理学检测。
J Voice. 2016 Nov;30(6):757.e7-757.e19. doi: 10.1016/j.jvoice.2015.08.010. Epub 2015 Oct 27.
7
Validity of jitter measures in non-quasi-periodic voices. Part I: perceptual and computer performances in cycle pattern recognition.非准周期性嗓音中抖动测量的有效性。第一部分:周期模式识别中的感知与计算机性能
Logoped Phoniatr Vocol. 2011 Jul;36(2):70-7. doi: 10.3109/14015439.2011.578078.
8
Automatic detection of voice impairments by means of short-term cepstral parameters and neural network based detectors.通过短期倒谱参数和基于神经网络的检测器自动检测语音损伤。
IEEE Trans Biomed Eng. 2004 Feb;51(2):380-4. doi: 10.1109/TBME.2003.820386.
9
Intra- and Inter-database Study for Arabic, English, and German Databases: Do Conventional Speech Features Detect Voice Pathology?阿拉伯语、英语和德语数据库的库内及库间研究:传统语音特征能否检测语音病理学?
J Voice. 2017 May;31(3):386.e1-386.e8. doi: 10.1016/j.jvoice.2016.09.009. Epub 2016 Oct 10.
10
Validity of jitter measures in non-quasi-periodic voices. Part II: the effect of noise.非准周期性嗓音中抖动测量的有效性。第二部分:噪声的影响。
Logoped Phoniatr Vocol. 2011 Jul;36(2):78-89. doi: 10.3109/14015439.2011.578077. Epub 2011 May 24.

引用本文的文献

1
Robust Vocal Quality Feature Embeddings for Dysphonic Voice Detection.用于嗓音障碍语音检测的稳健嗓音质量特征嵌入
IEEE/ACM Trans Audio Speech Lang Process. 2023;31:1348-1359. doi: 10.1109/taslp.2023.3261753. Epub 2023 Mar 28.
2
An Experimental Analysis on Multicepstral Projection Representation Strategies for Dysphonia Detection.多倒谱投影表示策略在嗓音障碍检测中的实验分析。
Sensors (Basel). 2023 May 30;23(11):5196. doi: 10.3390/s23115196.
3
The Effectiveness of Supervised Machine Learning in Screening and Diagnosing Voice Disorders: Systematic Review and Meta-analysis.
监督机器学习在筛查和诊断嗓音障碍中的有效性:系统评价和荟萃分析。
J Med Internet Res. 2022 Oct 14;24(10):e38472. doi: 10.2196/38472.
4
TrackUSF, a novel tool for automated ultrasonic vocalization analysis, reveals modified calls in a rat model of autism.TrackUSF,一种用于自动超声发声分析的新型工具,揭示了自闭症大鼠模型中经过修饰的叫声。
BMC Biol. 2022 Jul 12;20(1):159. doi: 10.1186/s12915-022-01299-y.
5
An Analytical Study of Speech Pathology Detection Based on MFCC and Deep Neural Networks.基于 MFCC 和深度神经网络的语音病理学检测分析研究。
Comput Math Methods Med. 2022 Apr 4;2022:7814952. doi: 10.1155/2022/7814952. eCollection 2022.
6
Voice Pathology Detection Using Modulation Spectrum-Optimized Metrics.基于调制谱优化指标的语音病理学检测
Front Bioeng Biotechnol. 2016 Jan 20;4:1. doi: 10.3389/fbioe.2016.00001. eCollection 2016.
7
Modulation Spectra Morphological Parameters: A New Method to Assess Voice Pathologies according to the GRBAS Scale.调制频谱形态学参数:一种根据GRBAS量表评估嗓音疾病的新方法。
Biomed Res Int. 2015;2015:259239. doi: 10.1155/2015/259239. Epub 2015 Oct 18.