• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用改进的扩散混沌技术对人类发声中的嗓音类型成分进行量化。

Quantification of Voice Type Components Present in Human Phonation Using a Modified Diffusive Chaos Technique.

作者信息

Liu Boquan, Polce Evan, Raj Hayley, Jiang Jack

机构信息

1 Department of Surgery-Division of Otolaryngology, University of Wisconsin School of Medicine and Public Health, Madison, WI, USA.

出版信息

Ann Otol Rhinol Laryngol. 2019 Oct;128(10):921-931. doi: 10.1177/0003489419848451. Epub 2019 May 14.

DOI:10.1177/0003489419848451
PMID:31084359
Abstract

PURPOSE

Signal typing has been used to categorize healthy and disordered voices; however, human voices are likely comprised of differing proportions of periodic type 1 elements, type 2 elements that are periodic with modulations, aperiodic type 3 elements, and stochastic type 4 elements. A novel diffusive chaos method is presented to detect the distribution of voice types within a signal with the goal of providing an objective and clinically useful tool for evaluating the voice. It was predicted that continuous calculation of the diffusive chaos parameter throughout the voice sample would allow for construction of comprehensive voice type component profiles (VTCP).

METHODS

One hundred thirty-five voice samples of sustained /a/ vowels were randomly selected from the Disordered Voice Database Model 4337. All samples were classified according to the voice type paradigm using spectrogram analysis, yielding 34 type 1, 35 type 2, 42 type 3, and 24 type 4 voice samples. All samples were then analyzed using the diffusive chaos method, and VTCPs were generated to show the distribution of the 4 voice type components (VTC).

RESULTS

The proportions of VTC varied significantly between the majority of the traditional voice types ( < .001). Three of the 4 VTCs of type 3 voices were significantly different from the VTCs of type 4 voices ( < .001). These results were compared to calculations of spectrum convergence ratio, which did not vary significantly between voice types 1 and 2 or 2 and 3.

CONCLUSION

The diffusive chaos method demonstrates proficiency in generating comprehensive VTCPs for disordered voices with varying severity. In contrast to acoustic parameters that provide a single measure of disorder, VTCPs can be used to detect subtler changes by observing variations in each VTC over time. This method also provides the advantage of quantifying stochastic noise components that are due to breathiness in the voice.

摘要

目的

信号分型已被用于对健康和紊乱的嗓音进行分类;然而,人类嗓音可能由不同比例的周期性1型元素、带有调制的周期性2型元素、非周期性3型元素和随机性4型元素组成。本文提出一种新的扩散混沌方法来检测信号中嗓音类型的分布,目的是为嗓音评估提供一种客观且临床有用的工具。据预测,在整个嗓音样本中持续计算扩散混沌参数将有助于构建全面的嗓音类型成分剖面图(VTCP)。

方法

从紊乱嗓音数据库模型4337中随机选取135个持续发/a/元音的嗓音样本。使用频谱图分析根据嗓音类型范式对所有样本进行分类,得到34个1型、35个2型、42个3型和24个4型嗓音样本。然后使用扩散混沌方法对所有样本进行分析,并生成VTCP以显示4种嗓音类型成分(VTC)的分布。

结果

大多数传统嗓音类型之间的VTC比例差异显著(<.001)。3型嗓音的4个VTC中有3个与4型嗓音的VTC显著不同(<.001)。将这些结果与频谱收敛率的计算结果进行比较,频谱收敛率在1型和2型或2型和3型嗓音之间没有显著差异。

结论

扩散混沌方法在为不同严重程度的紊乱嗓音生成全面的VTCP方面表现出优势。与提供单一紊乱测量值的声学参数不同,VTCP可通过观察每个VTC随时间的变化来检测更细微的变化。该方法还具有量化因嗓音呼吸声导致的随机噪声成分的优势。

相似文献

1
Quantification of Voice Type Components Present in Human Phonation Using a Modified Diffusive Chaos Technique.使用改进的扩散混沌技术对人类发声中的嗓音类型成分进行量化。
Ann Otol Rhinol Laryngol. 2019 Oct;128(10):921-931. doi: 10.1177/0003489419848451. Epub 2019 May 14.
2
Application of Local Intrinsic Dimension for Acoustical Analysis of Voice Signal Components.局部本征维数在语音信号成分声学分析中的应用。
Ann Otol Rhinol Laryngol. 2018 Sep;127(9):588-597. doi: 10.1177/0003489418780439. Epub 2018 Jun 17.
3
Using Rate of Divergence as an Objective Measure to Differentiate between Voice Signal Types Based on the Amount of Disorder in the Signal.使用发散率作为一种客观度量,基于信号中的紊乱程度来区分语音信号类型。
J Voice. 2017 Jan;31(1):16-23. doi: 10.1016/j.jvoice.2016.01.005. Epub 2016 Feb 23.
4
An Objective Parameter to Classify Voice Signals Based on Variation in Energy Distribution.基于能量分布变化的语音信号分类的客观参数。
J Voice. 2019 Sep;33(5):591-602. doi: 10.1016/j.jvoice.2018.02.011. Epub 2018 May 18.
5
The Influence of Voice Training on Vocal Learner's Objective Acoustic Voice Components.
J Voice. 2023 May;37(3):355-361. doi: 10.1016/j.jvoice.2021.01.011. Epub 2021 Feb 27.
6
Evaluating the Voice Type Component Distributions of Excised Larynx Phonations at Three Subglottal Pressures.评估三种声门下压力下切除喉发声的声音类型分量分布。
J Speech Lang Hear Res. 2021 May 11;64(5):1447-1456. doi: 10.1044/2021_JSLHR-20-00429. Epub 2021 Apr 22.
7
Automatic modeling of acoustic perception of breathiness in pathological voices.病理性嗓音中呼吸音声学感知的自动建模
IEEE Trans Biomed Eng. 2009 Apr;56(4):932-40. doi: 10.1109/TBME.2008.2007910.
8
Acoustic analysis of aperiodic voice: perturbation and nonlinear dynamic properties in esophageal phonation.非周期性嗓音的声学分析:食管发声中的微扰与非线性动力学特性
J Voice. 2009 May;23(3):283-90. doi: 10.1016/j.jvoice.2007.10.004. Epub 2008 Apr 14.
9
Applied Chaos Level Test for Validation of Signal Conditions Underlying Optimal Performance of Voice Classification Methods.应用混沌水平测试验证语音分类方法最佳性能背后的信号条件
J Speech Lang Hear Res. 2018 May 17;61(5):1130-1139. doi: 10.1044/2018_JSLHR-S-17-0250.
10
Instrumental dimensioning of normal and pathological phonation using acoustic measurements.使用声学测量对正常和病理性发声进行仪器测量。
Clin Linguist Phon. 2008 Jun;22(6):407-20. doi: 10.1080/02699200701830869.

引用本文的文献

1
Acoustic Analysis of Blues, Country, Folk, Italian Opera, and Rock Singing.蓝调、乡村、民谣、意大利歌剧和摇滚演唱的声学分析。
J Voice. 2025 Jun 24. doi: 10.1016/j.jvoice.2025.05.018.
2
Voice Type Component Profile Model of Glottal Gap Voice in Ex Vivo Canine Larynges.离体犬喉声门间隙性嗓音的嗓音类型成分剖析模型
J Voice. 2024 Oct 22. doi: 10.1016/j.jvoice.2024.09.045.
3
[Current methods of acoustic analysis of voice: a review].[当前嗓音声学分析方法综述]
Lin Chuang Er Bi Yan Hou Tou Jing Wai Ke Za Zhi. 2022 Dec;36(12):966-970;976. doi: 10.13201/j.issn.2096-7993.2022.12.016.
4
Do the Nonlinear Dynamic Acoustic Measurements, Nonlinear Energy Difference Ratio and Spectrum Convergence Ratio, Correlate with Perceptual Evaluation of Esophageal Voice Speakers?非线性动态声测量、非线性能量差比和频谱收敛比是否与食管语音患者的感知评估相关?
J Voice. 2024 Nov;38(6):1278-1287. doi: 10.1016/j.jvoice.2022.06.004. Epub 2022 Jul 9.
5
Monitoring the Outcome of Phonosurgery and Vocal Exercises with Established and New Diagnostic Tools.监测嗓音外科手术和发声练习的结果:使用既定和新的诊断工具。
Biomed Res Int. 2020 Jan 23;2020:4208189. doi: 10.1155/2020/4208189. eCollection 2020.