朝向基于磁共振图像的声道自动研究。

Towards the automatic study of the vocal tract from magnetic resonance images.

机构信息

Faculty of Engineering, University of Porto/Laboratory of Optics and Experimental Mechanics, Institute of Mechanical Engineering and Industrial Management, Porto, Portugal.

出版信息

J Voice. 2011 Nov;25(6):732-42. doi: 10.1016/j.jvoice.2010.05.002. Epub 2010 Oct 16.

DOI:10.1016/j.jvoice.2010.05.002

PMID:20952159

Abstract

Over the last few decades, researchers have been investigating the mechanisms involved in speech production. Image analysis can be a valuable aid in the understanding of the morphology of the vocal tract. The application of magnetic resonance imaging to study these mechanisms has been proven to be reliable and safe. We have applied deformable models in magnetic resonance images to conduct an automatic study of the vocal tract; mainly, to evaluate the shape of the vocal tract in the articulation of some European Portuguese sounds, and then to successfully automatically segment the vocal tract's shape in new images. Thus, a point distribution model has been built from a set of magnetic resonance images acquired during artificially sustained articulations of 21 sounds, which successfully extracts the main characteristics of the movements of the vocal tract. The combination of that statistical shape model with the gray levels of its points is subsequently used to build active shape models and active appearance models. Those models have then been used to segment the modeled vocal tract into new images in a successful and automatic manner. The computational models have thus been revealed to be useful for the specific area of speech simulation and rehabilitation, namely to simulate and recognize the compensatory movements of the articulators during speech production.

摘要

在过去的几十年中，研究人员一直在研究言语产生过程中的机制。图像分析可以成为理解声道形态的有价值的辅助手段。磁共振成像在研究这些机制中的应用已被证明是可靠和安全的。我们已经在磁共振图像中应用了可变形模型，以对声道进行自动研究；主要是评估在一些欧洲葡萄牙语发音时声道的形状，然后成功地自动分割新图像中的声道形状。因此，从一组在人为持续发音 21 个声音期间获得的磁共振图像中构建了一个点分布模型，该模型成功地提取了声道运动的主要特征。随后，将该统计形状模型与各点的灰度值相结合，以构建主动形状模型和主动外观模型。然后，使用这些模型成功且自动地将建模声道分割到新的图像中。因此，计算模型被证明对于言语模拟和康复的特定领域很有用，即模拟和识别言语产生过程中构音器官的代偿运动。