Xing Fangxu, Zhuo Jiachen, Stone Maureen, Liu Xiaofeng, Reese Timothy G, Wedeen Van J, Prince Jerry L, Woo Jonghye
Department of Radiology, Harvard Medical School/Massachusetts General Hospital, Boston, Massachusetts 02114, USA.
Department of Radiology and Nuclear Medicine, University of Maryland School of Medicine, Baltimore, Maryland 21201, USA.
J Acoust Soc Am. 2024 Dec 1;156(6):4000-4009. doi: 10.1121/10.0034639.
Identification and quantification of speech variations in velar production across various phonological environments have always been an interesting topic in speech motor control studies. Dynamic magnetic resonance imaging has become a favorable tool for visualizing articulatory deformations and providing quantitative insights into speech activities over time. Based on this modality, it is proposed to employ a workflow of image analysis techniques to uncover potential deformation variations in the human tongue caused by changes in phonological environments by altering the placement of velar consonants in utterances. The speech deformations of four human subjects in three different consonant positions were estimated from magnetic resonance images using a spatiotemporal tracking method before being warped via image registration into a common space-a dynamic atlas space constructed using four-dimensional alignments-for normalized quantitative comparisons. Statistical tests and principal component analyses were conducted on the magnitude of deformations, consonant-specific deformations, and internal muscle strains. The results revealed an overall decrease in deformation intensity following the initial consonant production, indicating potential muscle adaptation behaviors at a later temporal position in one speech utterance.
识别和量化不同语音环境下软腭发音的语音变化,一直是语音运动控制研究中一个有趣的话题。动态磁共振成像已成为一种有利的工具,用于可视化发音变形,并随着时间推移对语音活动提供定量见解。基于这种模态,有人提出采用一种图像分析技术工作流程,通过改变话语中软腭辅音的位置,来揭示语音环境变化导致的人类舌头潜在变形差异。使用时空跟踪方法从磁共振图像中估计了四名人类受试者在三种不同辅音位置的语音变形,然后通过图像配准将其扭曲到一个公共空间——使用四维对齐构建的动态图谱空间——进行标准化定量比较。对变形幅度、辅音特异性变形和内部肌肉应变进行了统计测试和主成分分析。结果显示,在最初的辅音发音后,变形强度总体下降,表明在一个语音话语的后期时间位置可能存在肌肉适应行为。