• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

嗓音质量因素:分析、合成与感知。

Vocal quality factors: analysis, synthesis, and perception.

作者信息

Childers D G, Lee C K

机构信息

Department of Electrical Engineering, University of Florida, Gainesville 32611-2024.

出版信息

J Acoust Soc Am. 1991 Nov;90(5):2394-410. doi: 10.1121/1.402044.

DOI:10.1121/1.402044
PMID:1837797
Abstract

The purpose of this study was to examine several factors of vocal quality that might be affected by changes in vocal fold vibratory patterns. Four voice types were examined: modal, vocal fry, falsetto, and breathy. Three categories of analysis techniques were developed to extract source-related features from speech and electroglottographic (EGG) signals. Four factors were found to be important for characterizing the glottal excitations for the four voice types: the glottal pulse width, the glottal pulse skewness, the abruptness of glottal closure, and the turbulent noise component. The significance of these factors for voice synthesis was studied and a new voice source model that accounted for certain physiological aspects of vocal fold motion was developed and tested using speech synthesis. Perceptual listening tests were conducted to evaluate the auditory effects of the source model parameters upon synthesized speech. The effects of the spectral slope of the source excitation, the shape of the glottal excitation pulse, and the characteristics of the turbulent noise source were considered. Applications for these research results include synthesis of natural sounding speech, synthesis and modeling of vocal disorders, and the development of speaker independent (or adaptive) speech recognition systems.

摘要

本研究的目的是考察可能受声带振动模式变化影响的几个嗓音质量因素。研究了四种嗓音类型:模态嗓音、气泡音、假声和呼吸音。开发了三类分析技术,用于从语音和电声门图(EGG)信号中提取与声源相关的特征。发现有四个因素对于表征四种嗓音类型的声门激励很重要:声门脉冲宽度、声门脉冲偏度、声门闭合的突然性以及湍流噪声成分。研究了这些因素对语音合成的重要性,并开发了一种新的声源模型,该模型考虑了声带运动的某些生理方面,并使用语音合成进行了测试。进行了感知听力测试,以评估声源模型参数对合成语音的听觉效果。考虑了声源激励的频谱斜率、声门激励脉冲的形状以及湍流噪声源的特征的影响。这些研究结果的应用包括自然语音合成、嗓音障碍的合成与建模,以及独立于说话者(或自适应)的语音识别系统的开发。

相似文献

1
Vocal quality factors: analysis, synthesis, and perception.嗓音质量因素:分析、合成与感知。
J Acoust Soc Am. 1991 Nov;90(5):2394-410. doi: 10.1121/1.402044.
2
Speech synthesis by glottal excited linear prediction.基于声门激励线性预测的语音合成。
J Acoust Soc Am. 1994 Oct;96(4):2026-36. doi: 10.1121/1.411319.
3
Modeling the glottal volume-velocity waveform for three voice types.对三种嗓音类型的声门容积速度波形进行建模。
J Acoust Soc Am. 1995 Jan;97(1):505-19. doi: 10.1121/1.412276.
4
Analysis of voice source characteristics using a constrained polynomial representation of voice source signals.使用语音源信号的约束多项式表示法对语音源特征进行分析。
J Acoust Soc Am. 2007 Feb;121(2):745-8. doi: 10.1121/1.2359234.
5
Determination of glottal excitation cycles in running speech.连续语音中声门激励周期的测定
Phonetica. 1995;52(3):196-204. doi: 10.1159/000262171.
6
Exploring the anatomical encoding of voice with a mathematical model of the vocal system.用语音系统的数学模型探索语音的解剖学编码。
Neuroimage. 2016 Nov 1;141:31-39. doi: 10.1016/j.neuroimage.2016.07.033. Epub 2016 Jul 17.
7
Glottal Adduction and Subglottal Pressure in Singing.歌唱中的声门内收与声门下压力
J Voice. 2015 Jul;29(4):391-402. doi: 10.1016/j.jvoice.2014.08.009. Epub 2015 May 2.
8
Comparisons among aerodynamic, electroglottographic, and acoustic spectral measures of female voice.女性嗓音的空气动力学、电子声门图和声学频谱测量之间的比较。
J Speech Hear Res. 1995 Dec;38(6):1212-23. doi: 10.1044/jshr.3806.1212.
9
Glottal characteristics of female speakers: acoustic correlates.女性说话者的声门特征:声学关联
J Acoust Soc Am. 1997 Jan;101(1):466-81. doi: 10.1121/1.417991.
10
A Comparison of the Use of Glottal Fry in the Spontaneous Speech of Young and Middle-Aged American Women.美国年轻和中年女性自发言语中喉塞音的使用比较。
J Voice. 2016 Nov;30(6):684-687. doi: 10.1016/j.jvoice.2015.08.015. Epub 2015 Oct 1.

引用本文的文献

1
Domestic dogs (Canis familiaris) recognise meaningful content in monotonous streams of read speech.家犬(犬属)能够识别单调朗读语音流中的有意义内容。
Anim Cogn. 2025 Apr 12;28(1):29. doi: 10.1007/s10071-025-01948-z.
2
Breathy Vocal Quality, Background Noise, and Hearing Loss: How Do These Adverse Conditions Affect Speech Perception by Older Adults?呼吸音质、背景噪音与听力损失:这些不利状况如何影响老年人的言语感知?
Ear Hear. 2025;46(2):474-482. doi: 10.1097/AUD.0000000000001599. Epub 2024 Nov 4.
3
A smart look at monitoring while drilling (MWD) and optimizing using acoustic emission technique (AET).
利用声发射技术(AET)对随钻监测(MWD)进行明智的审视与优化。
Sci Rep. 2024 Aug 26;14(1):19766. doi: 10.1038/s41598-024-70717-8.
4
Consistency of the Signature of Phonotraumatic Vocal Hyperfunction Across Different Ambulatory Voice Measures.不同动态嗓音测量中语音创伤性发声过度特征的一致性。
J Speech Lang Hear Res. 2024 Jul 9;67(7):1997-2020. doi: 10.1044/2024_JSLHR-23-00515. Epub 2024 Jun 11.
5
Pragmatic De-Noising of Electroglottographic Signals.电声门图信号的实用去噪
Bioengineering (Basel). 2024 May 11;11(5):479. doi: 10.3390/bioengineering11050479.
6
MultiBench: Multiscale Benchmarks for Multimodal Representation Learning.多模态基准测试:用于多模态表示学习的多尺度基准测试
Adv Neural Inf Process Syst. 2021 Dec;2021(DB1):1-20.
7
The influence of listener experience, measurement scale and speech task on the reliability of auditory-perceptual evaluation of vocal quality.聆听者经验、测量尺度和言语任务对嗓音质量听觉感知评估可靠性的影响。
Codas. 2024 Apr 15;36(3):e20230175. doi: 10.1590/2317-1782/20232023175. eCollection 2024.
8
Towards a Singing Voice Multi-Sensor Analysis Tool: System Design, and Assessment Based on Vocal Breathiness.面向歌声多传感器分析工具:基于发声呼吸音的系统设计与评估。
Sensors (Basel). 2021 Nov 30;21(23):8006. doi: 10.3390/s21238006.
9
Evaluation of Glottal Inverse Filtering Algorithms Using a Physiologically Based Articulatory Speech Synthesizer.使用基于生理学的发音语音合成器评估声门逆滤波算法
IEEE/ACM Trans Audio Speech Lang Process. 2017 Aug;25(8):1718-1730. doi: 10.1109/taslp.2017.2714839. Epub 2017 Jun 12.
10
Bidirectional Interactions With Humpback Whale Singer Using Concrete Sound Elements.使用具体声音元素与座头鲸歌手的双向互动。
Front Psychol. 2021 Jun 11;12:654314. doi: 10.3389/fpsyg.2021.654314. eCollection 2021.