Chen Gang, Kreiman Jody, Alwan Abeer
Department of Electrical Engineering, University of California Los Angeles, 63-134 Engr IV, Los Angeles, CA 90095-1594.
Department of Head and Neck Surgery, University of California Los Angeles School of Medicine, 31-24 Rehab Center, Los Angeles, CA 90095-1794.
Comput Speech Lang. 2014 Sep 1;28(5):1156-1169. doi: 10.1016/j.csl.2013.11.006.
Laryngeal high-speed videoendoscopy is a state-of-the-art technique to examine physiological vibrational patterns of the vocal folds. With sampling rates of thousands of frames per second, high-speed videoendoscopy produces a large amount of data that is difficult to analyze subjectively. In order to visualize high-speed video in a straightforward and intuitive way, many methods have been proposed to condense the three-dimensional data into a few static images that preserve characteristics of the underlying vocal fold vibratory patterns. In this paper, we propose the "glottaltopogram," which is based on principal component analysis of changes over time in the brightness of each pixel in consecutive video images. This method reveals the overall synchronization of the vibrational patterns of the vocal folds over the entire laryngeal area. Experimental results showed that this method is effective in visualizing pathological and normal vocal fold vibratory patterns.
喉高速视频内窥镜检查是一种用于检查声带生理振动模式的先进技术。高速视频内窥镜检查以每秒数千帧的采样率产生大量难以主观分析的数据。为了以直接和直观的方式可视化高速视频,已经提出了许多方法将三维数据浓缩成一些静态图像,这些图像保留了潜在声带振动模式的特征。在本文中,我们提出了“声门地形图”,它基于对连续视频图像中每个像素亮度随时间变化的主成分分析。该方法揭示了整个喉部区域声带振动模式的整体同步性。实验结果表明,该方法在可视化病理性和正常声带振动模式方面是有效的。