• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用迭代期望最大化算法进行声带病变检测的直接语音特征估计

Direct speech feature estimation using an iterative EM algorithm for vocal fold pathology detection.

作者信息

Gavidia-Ceballos L, Hansen J H

机构信息

Department of Biomedical Engineering, Duke University, Durham, NC 37708-0291, USA.

出版信息

IEEE Trans Biomed Eng. 1996 Apr;43(4):373-83. doi: 10.1109/10.486257.

DOI:10.1109/10.486257
PMID:8626186
Abstract

The focus of this study is to formulate a speech parameter estimation algorithm for analysis/detection of vocal fold pathology. The speech processing algorithm proposed estimates features necessary to formulate a stochastic model to characterize healthy and pathology conditions from speech recordings. The general idea is to separate speech components under healthy and assumed pathology conditions. This problem is addressed using an iterative maximum-likelihood (ML) estimation procedure, based on the estimation-maximization (EM) algorithm. A new feature for characterizing pathology, termed enhanced-spectral-pathology component (ESPC), is estimated and shown to vary consistently between healthy and pathology conditions. It is also shown that the mean-area-peak-value (MAPV) and the weighted-slope (WSLOPE) indexes, which are obtained from the ESPC estimate, are meaningful measures of speech pathology conditions. For classification purposes, a five-state hidden-Markov-model (HMM) recognizer was formulated, based on the MAPV, WSLOPE, and ESPC spectral features. A set of log Mel-frequency filter bank coefficients were used to parameterize the ESPC feature. An evaluation of the HMM-based classifier was performed using speech recordings from healthy and vocal fold cancer patients of sustained vowel sounds. It is shown that while both MAPV and WSLOPE are useful features for vocal fold pathology detection, superior performance was achieved using a finer spectral representation of ESPC (e.g., a detection rate of 88.7% for pathology and 92.8% for healthy condition). One main advantage of the proposed method is that it does not require direct estimation of the glottal flow waveform. Therefore, the limitation of the inability to characterize vocal fold pathology, due to incomplete glottal closure, is no longer an issue. The results suggest that general analysis of the ESPC feature can provide a quantitative, noninvasive approach for analysis, detection, and characterization of speech production under vocal fold pathology.

摘要

本研究的重点是制定一种语音参数估计算法,用于分析/检测声带病变。所提出的语音处理算法估计了构建随机模型所需的特征,以便根据语音记录来表征健康和病变状况。总体思路是在健康和假定的病变状况下分离语音成分。基于期望最大化(EM)算法,使用迭代最大似然(ML)估计程序来解决这个问题。估计了一种用于表征病变的新特征,称为增强频谱病变成分(ESPC),并表明其在健康和病变状况之间存在一致变化。还表明,从ESPC估计中获得的平均面积峰值(MAPV)和加权斜率(WSLOPE)指标是语音病变状况的有意义度量。为了进行分类,基于MAPV、WSLOPE和ESPC频谱特征构建了一个五状态隐马尔可夫模型(HMM)识别器。使用一组对数梅尔频率滤波器组系数对ESPC特征进行参数化。使用来自健康人和声带癌患者的持续元音语音记录对基于HMM的分类器进行了评估。结果表明,虽然MAPV和WSLOPE都是用于声带病变检测的有用特征,但使用ESPC更精细的频谱表示可实现更好的性能(例如,病变检测率为88.7%,健康状况检测率为92.8%)。所提出方法的一个主要优点是它不需要直接估计声门流波形。因此,由于声门闭合不完全而无法表征声带病变的局限性不再是一个问题。结果表明,对ESPC特征的一般分析可以为声带病变情况下语音产生的分析、检测和表征提供一种定量、非侵入性的方法。

相似文献

1
Direct speech feature estimation using an iterative EM algorithm for vocal fold pathology detection.使用迭代期望最大化算法进行声带病变检测的直接语音特征估计
IEEE Trans Biomed Eng. 1996 Apr;43(4):373-83. doi: 10.1109/10.486257.
2
A nonlinear operator-based speech feature analysis method with application to vocal fold pathology assessment.
IEEE Trans Biomed Eng. 1998 Mar;45(3):300-13. doi: 10.1109/10.661155.
3
Optimal selection of wavelet-packet-based features using genetic algorithm in pathological assessment of patients' speech signal with unilateral vocal fold paralysis.基于遗传算法的小波包特征优化选择在单侧声带麻痹患者语音信号病理评估中的应用
Comput Biol Med. 2007 Apr;37(4):474-85. doi: 10.1016/j.compbiomed.2006.08.016. Epub 2006 Oct 10.
4
Classification of unilateral vocal fold paralysis by endoscopic digital high-speed recordings and inversion of a biomechanical model.通过内镜数字高速记录和生物力学模型反演对单侧声带麻痹进行分类
IEEE Trans Biomed Eng. 2006 Jun;53(6):1099-108. doi: 10.1109/TBME.2006.873396.
5
[The use of the expectation-maximization (EM) algorithm for maximum likelihood estimation of gametic frequencies of multilocus polymorphic codominant systems based on sampled population data].[基于抽样群体数据,使用期望最大化(EM)算法对多位点共显性系统的配子频率进行最大似然估计]
Genetika. 2002 Mar;38(3):407-18.
6
A constraint-based evolutionary learning approach to the expectation maximization for optimal estimation of the hidden Markov model for speech signal modeling.一种基于约束的进化学习方法,用于语音信号建模的隐马尔可夫模型最优估计的期望最大化。
IEEE Trans Syst Man Cybern B Cybern. 2009 Feb;39(1):182-97. doi: 10.1109/TSMCB.2008.2004051. Epub 2008 Dec 9.
7
Voice pathology detection based eon short-term jitter estimations in running speech.基于连续语音中短期抖动估计的嗓音病理学检测
Folia Phoniatr Logop. 2009;61(3):153-70. doi: 10.1159/000219951. Epub 2009 Jul 1.
8
Discrimination of pathological voices using a time-frequency approach.使用时频方法鉴别病理性嗓音。
IEEE Trans Biomed Eng. 2005 Mar;52(3):421-30. doi: 10.1109/TBME.2004.842962.
9
Vibration parameter extraction from endoscopic image series of the vocal folds.从声带的内镜图像序列中提取振动参数。
IEEE Trans Biomed Eng. 2002 Aug;49(8):773-81. doi: 10.1109/TBME.2002.800755.
10
Spectral pattern complexity analysis and the quantification of voice normality in healthy and radiotherapy patient groups.健康组和放疗患者组的频谱模式复杂性分析及嗓音正常度量化
Med Eng Phys. 2004 May;26(4):291-301. doi: 10.1016/j.medengphy.2004.01.005.

引用本文的文献

1
Convolutional Neural Network Classifies Pathological Voice Change in Laryngeal Cancer with High Accuracy.卷积神经网络可高精度地对喉癌中的病理性声音变化进行分类。
J Clin Med. 2020 Oct 25;9(11):3415. doi: 10.3390/jcm9113415.