• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

嗓音障碍患者合成声门区波形中额外脉冲的检测。

Detection of extra pulses in synthesized glottal area waveforms of dysphonic voices.

作者信息

Aichinger P, Pernkopf F, Schoentgen J

机构信息

Division of Phoniatrics-Logopedics, Department of Otorhinolaryngology, Medical University of Vienna, Waehringer Guertel 18-20, 1090, Vienna, Austria.

Signal Processing and Speech Communication Laboratory, Graz University of Technology, Inffeldgasse 16c/EG, 8010, Graz, Austria.

出版信息

Biomed Signal Process Control. 2019 Apr;50:158-167. doi: 10.1016/j.bspc.2019.01.007.

DOI:10.1016/j.bspc.2019.01.007
PMID:30996730
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6464090/
Abstract

BACKGROUND AND OBJECTIVES

The description of production kinematics of dysphonic voices plays an important role in the clinical care of voice disorders. However, high-speed videolaryngoscopy is not routinely used in clinical practice, partly because there is a lack of diagnostic markers that may be obtained from high-speed videos automatically. Aim of the study is to propose and test a procedure that automatically detects extra pulses, which may occur in voiced source signals of pathological voices in addition to cyclic pulses.

MATERIAL AND METHODS

Glottal area waveforms (GAW) are synthesized and used to test a detector for extra pulses. Regarding synthesis, for each GAW a cyclic pulse train is mixed with an extra pulse train, and additive noise. The cyclic pulse trains are varied across GAWs in terms of fundamental frequency, pulse shape, and modulation noise, i.e., jitter and shimmer. The extra pulse trains are varied across GAWs in terms of the height of the extra pulses, and their rates of occurrence. The energy level of the additive noise is also varied. Regarding detection, first, the fundamental frequency is estimated jointly with the cyclic pulse train waveform, second, the modulation noise is estimated, and finally the extra pulse train waveform is estimated. Two versions of the detector are compared, i.e., one that parameterizes the shapes of the cyclic pulses, and one that uses unparameterized pulse shape estimates. Two corpora are used for testing, i.e., one with 100 GAWs containing random extra pulses, and one with 25 GAWs containing extra pulses in the closed phases of each glottal phase representing subharmonic voices.

RESULTS AND DISCUSSION

With pulse shape parameterization (PSP) a maximum mean accuracy of 88.3% is achieved when detecting random extra pulses. Without PSP, the maximum mean accuracy reduces to 82.9%. Detection performance decreases if the energy level of additive noise is higher than -25 dB with respect to the energy of the cyclic pulse train, and if the irregularity strength exceeds 0.1. For bicyclic, i.e., subharmonic voices, the approach fails without PSP, whereas with PSP, a mean sensitivity of 87.4% is achieved for subharmonic voices.

CONCLUSION

A synthesizer for GAWs containing extra pulses, and a detector for extra pulses are proposed. With PSP, favorable detector performance is observed for not too high levels of additive noise and irregularity strengths. In signals with high noise levels, the detector without PSP outperforms the other one. Detection of extra pulses fails if irregularity strength is large. For subharmonic voices PSP must be used.

摘要

背景与目的

嗓音障碍的发声运动学描述在嗓音疾病的临床护理中起着重要作用。然而,高速视频喉镜在临床实践中并未常规使用,部分原因是缺乏可从高速视频中自动获取的诊断标志物。本研究的目的是提出并测试一种程序,该程序能自动检测额外脉冲,这些额外脉冲可能出现在病理性嗓音的发声源信号中,除了周期性脉冲之外。

材料与方法

合成声门面积波形(GAW)并用于测试额外脉冲的检测器。关于合成,对于每个GAW,将一个周期性脉冲序列与一个额外脉冲序列以及加性噪声混合。周期性脉冲序列在不同的GAW之间,在基频、脉冲形状和调制噪声(即抖动和闪烁)方面有所变化。额外脉冲序列在不同的GAW之间,在额外脉冲的高度及其出现率方面有所变化。加性噪声的能量水平也有所变化。关于检测,首先,联合估计基频和周期性脉冲序列波形,其次,估计调制噪声,最后估计额外脉冲序列波形。比较了检测器的两个版本,即一个对周期性脉冲的形状进行参数化的版本,和一个使用未参数化脉冲形状估计的版本。使用两个语料库进行测试,一个包含100个带有随机额外脉冲的GAW,另一个包含25个在每个声门相位的闭合阶段带有额外脉冲的GAW,这些额外脉冲代表次谐波嗓音。

结果与讨论

对于检测随机额外脉冲,采用脉冲形状参数化(PSP)时,最大平均准确率达到88.3%。不采用PSP时,最大平均准确率降至82.9%。如果加性噪声的能量水平相对于周期性脉冲序列的能量高于 -25 dB,并且不规则强度超过0.1,则检测性能会下降。对于双周期的,即次谐波嗓音,不采用PSP时该方法失败,而采用PSP时,对于次谐波嗓音平均灵敏度达到87.4%。

结论

提出了一种用于包含额外脉冲的GAW的合成器和一种用于额外脉冲的检测器。采用PSP时,对于不太高的加性噪声水平和不规则强度,观察到检测器性能良好。在高噪声水平的信号中,不采用PSP的检测器优于另一个。如果不规则强度较大,则无法检测到额外脉冲。对于次谐波嗓音必须使用PSP。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7dd4/6464090/924dc537ecd2/emss-82231-f001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7dd4/6464090/924dc537ecd2/emss-82231-f001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7dd4/6464090/924dc537ecd2/emss-82231-f001.jpg

相似文献

1
Detection of extra pulses in synthesized glottal area waveforms of dysphonic voices.嗓音障碍患者合成声门区波形中额外脉冲的检测。
Biomed Signal Process Control. 2019 Apr;50:158-167. doi: 10.1016/j.bspc.2019.01.007.
2
Source-filter comparison of measurements of fundamental frequency perturbation and amplitude perturbation for synthesized voice signals.合成语音信号基频微扰和幅度微扰测量的源-滤波器比较
J Voice. 2008 Mar;22(2):125-37. doi: 10.1016/j.jvoice.2006.09.007. Epub 2006 Dec 4.
3
Spectral characterization of jitter, shimmer, and additive noise in synthetically generated voice signals.
J Acoust Soc Am. 2000 Feb;107(2):978-88. doi: 10.1121/1.428272.
4
Diplophonia Disturbs Jitter and Shimmer Measurement.双音干扰基频微扰和振幅微扰测量。
Folia Phoniatr Logop. 2016;68(1):22-8. doi: 10.1159/000447589. Epub 2016 Jul 21.
5
Severity of voice disorders: integration of perceptual and acoustic data in dysphonic patients.嗓音障碍的严重程度:发音障碍患者感知和声学数据的整合
Codas. 2014 Sep-Oct;26(5):382-8. doi: 10.1590/2317-1782/20142013033.
6
Multiparametric analysis of vocal fold vibrations in healthy and disordered voices in high-speed imaging.高速成像中健康和失调嗓音的声带振动的多参数分析。
J Voice. 2011 Sep;25(5):576-90. doi: 10.1016/j.jvoice.2010.04.004. Epub 2010 Aug 21.
7
Validity of jitter measures in non-quasi-periodic voices. Part II: the effect of noise.非准周期性嗓音中抖动测量的有效性。第二部分:噪声的影响。
Logoped Phoniatr Vocol. 2011 Jul;36(2):78-89. doi: 10.3109/14015439.2011.578077. Epub 2011 May 24.
8
Speech synthesis by glottal excited linear prediction.基于声门激励线性预测的语音合成。
J Acoust Soc Am. 1994 Oct;96(4):2026-36. doi: 10.1121/1.411319.
9
Suitability of acoustic perturbation measures in analysing periodic and nearly periodic voice signals.声学微扰测量在分析周期性和近周期性语音信号中的适用性。
Folia Phoniatr Logop. 2005 Jan-Feb;57(1):38-47. doi: 10.1159/000081960.
10
Reliable determination of pulse-shape instability in trains of ultrashort laser pulses using frequency-resolved optical gating.利用频率分辨光学门控技术可靠地测定超短激光脉冲序列中的脉冲形状不稳定性。
Sci Rep. 2022 Dec 5;12(1):21006. doi: 10.1038/s41598-022-25193-3.

引用本文的文献

1
How to analyse and manipulate nonlinear phenomena in voice recordings.如何分析和处理语音记录中的非线性现象。
Philos Trans R Soc Lond B Biol Sci. 2025 Apr 3;380(1923):20240003. doi: 10.1098/rstb.2024.0003.

本文引用的文献

1
Testing two crackle criteria using modified jet noise waveforms.使用修正的喷射噪声波形测试两种啰音标准。
J Acoust Soc Am. 2017 Jun;141(6):EL549. doi: 10.1121/1.4984819.
2
Advanced waveform decomposition for high-speed videoendoscopy analysis.高速视频内镜分析的先进波形分解。
J Voice. 2013 May;27(3):369-75. doi: 10.1016/j.jvoice.2013.01.004. Epub 2013 Mar 13.
3
Development and perceptual assessment of a synthesizer of disordered voices.紊乱语音合成器的开发与感知评估。
J Acoust Soc Am. 2012 Oct;132(4):2603-15. doi: 10.1121/1.4751536.
4
Mitigation of temporal aliasing via harmonic modeling of laryngeal waveforms in high-speed videoendoscopy.通过高速视频内镜中声带波的谐波建模来减轻时间混淆。
J Acoust Soc Am. 2012 Sep;132(3):1636-45. doi: 10.1121/1.4742730.
5
Current role of stroboscopy in laryngeal imaging.频闪喉镜在喉部成像中的当前作用。
Curr Opin Otolaryngol Head Neck Surg. 2012 Dec;20(6):429-36. doi: 10.1097/MOO.0b013e3283585f04.
6
Kymographic imaging of laryngeal vibrations.喉部振动的记波成像
Curr Opin Otolaryngol Head Neck Surg. 2012 Dec;20(6):458-65. doi: 10.1097/MOO.0b013e3283581feb.
7
Videokymography in voice disorders: what to look for?嗓音障碍中的视频动态镜检查:应关注什么?
Ann Otol Rhinol Laryngol. 2007 Mar;116(3):172-80. doi: 10.1177/000348940711600303.
8
A basic protocol for functional assessment of voice pathology, especially for investigating the efficacy of (phonosurgical) treatments and evaluating new assessment techniques. Guideline elaborated by the Committee on Phoniatrics of the European Laryngological Society (ELS).一种用于嗓音病理学功能评估的基本方案,尤其用于研究(嗓音外科)治疗的疗效和评估新的评估技术。由欧洲喉科学会(ELS)嗓音病学委员会制定的指南。
Eur Arch Otorhinolaryngol. 2001 Feb;258(2):77-82. doi: 10.1007/s004050000299.
9
Analysis, synthesis, and perception of voice quality variations among female and male talkers.对男女说话者语音质量变化的分析、综合和感知。
J Acoust Soc Am. 1990 Feb;87(2):820-57. doi: 10.1121/1.398894.