Akgul Y S, Kambhamettu C, Stone M
Department of Computer and Information Sciences, University of Delaware, Newark 19716, USA.
IEEE Trans Med Imaging. 1999 Oct;18(10):1035-45. doi: 10.1109/42.811315.
Computerized analysis of the tongue surface movement can provide valuable information to speech and swallowing research. Ultrasound technology is currently the most attractive modality for the tongue imaging mainly because of its high video frame rate. However, problems with ultrasound imaging, such as noise and echo artifacts, refractions, and unrelated reflections pose significant challenges for computer analysis of the tongue images and hence specific methods must be developed. This paper presents a system that is developed for automatic extraction and tracking of the tongue surface movements from ultrasound image sequences. The ultrasound images are supplied by the head and transducer support system (HATS), which was developed in order to fix the head and support the transducer under the chin in a known position without disturbing speech. In this work, we propose a novel scheme for the analysis of the tongue images using deformable contours. We incorporate novel mechanisms to 1) impose speech related constraints on the deformations; 2) perform spatiotemporal smoothing using a contour postprocessing stage; 3) utilize optical flow techniques to speed up the search process; and 4) propagate user supplied information to the analysis of all image frames. We tested the system's performance qualitatively and quantitatively in consultation with speech scientists. Our system produced contours that are within the range of manual measurement variations. The results of our system are extremely encouraging and the system can be used in practical speech and swallowing research in the field of otolaryngology.
舌面运动的计算机化分析可为言语和吞咽研究提供有价值的信息。超声技术目前是用于舌成像最具吸引力的方式,主要是因为其高视频帧率。然而,超声成像存在的问题,如噪声和回波伪像、折射以及无关反射,给舌图像的计算机分析带来了重大挑战,因此必须开发特定的方法。本文介绍了一种为从超声图像序列中自动提取和跟踪舌面运动而开发的系统。超声图像由头部和换能器支撑系统(HATS)提供,该系统是为了在不干扰言语的情况下将头部固定并将换能器支撑在下巴下方的已知位置而开发的。在这项工作中,我们提出了一种使用可变形轮廓分析舌图像的新颖方案。我们纳入了新颖的机制来:1)对变形施加与言语相关的约束;2)使用轮廓后处理阶段进行时空平滑;3)利用光流技术加速搜索过程;4)将用户提供的信息传播到所有图像帧的分析中。我们与言语科学家协商,对该系统的性能进行了定性和定量测试。我们的系统生成的轮廓在手动测量变化范围内。我们系统的结果非常令人鼓舞,该系统可用于耳鼻喉科领域的实际言语和吞咽研究。