Department of Head and Neck Surgery, University of California, Los Angeles, 31-24 Rehab Center, 1000 Veteran Avenue, Los Angeles, California 90095-1794, USA.
J Acoust Soc Am. 2024 Jul 1;156(1):278-283. doi: 10.1121/10.0027913.
How we produce and perceive voice is constrained by laryngeal physiology and biomechanics. Such constraints may present themselves as principal dimensions in the voice outcome space that are shared among speakers. This study attempts to identify such principal dimensions in the voice outcome space and the underlying laryngeal control mechanisms in a three-dimensional computational model of voice production. A large-scale voice simulation was performed with parametric variations in vocal fold geometry and stiffness, glottal gap, vocal tract shape, and subglottal pressure. Principal component analysis was applied to data combining both the physiological control parameters and voice outcome measures. The results showed three dominant dimensions accounting for at least 50% of the total variance. The first two dimensions describe respiratory-laryngeal coordination in controlling the energy balance between low- and high-frequency harmonics in the produced voice, and the third dimension describes control of the fundamental frequency. The dominance of these three dimensions suggests that voice changes along these principal dimensions are likely to be more consistently produced and perceived by most speakers than other voice changes, and thus are more likely to have emerged during evolution and be used to convey important personal information, such as emotion and larynx size.
我们产生和感知声音的方式受到喉部生理学和生物力学的限制。这些限制可能表现为语音结果空间中的主要维度,这些维度在说话者之间共享。本研究试图在三维语音产生计算模型中识别语音结果空间中的这些主要维度和潜在的喉部控制机制。通过对声带几何形状和刚度、声门间隙、声道形状和声门下压力进行参数化变化,进行了大规模的语音模拟。对结合生理控制参数和语音结果测量值的数据应用主成分分析。结果表明,有三个主要维度占总方差的至少 50%。前两个维度描述了呼吸-喉部协调,用于控制产生的声音中低频和高频谐波之间的能量平衡,第三个维度描述了基频的控制。这三个维度的主导地位表明,沿着这些主要维度的声音变化比其他声音变化更有可能被大多数说话者一致地产生和感知,因此更有可能在进化过程中出现,并用于传达重要的个人信息,如情绪和喉大小。