Clements M A, Braida L D, Durlach N I
Research Laboratory of Electronics, Massachusetts Institute of Technology, Cambridge 02139.
J Rehabil Res Dev. 1988 Fall;25(4):25-44.
Two methods of encoding speech for tactile displays were compared in discrimination experiments using speech segments. One display represented the short-term speech spectrum in time-swept mode and used vibration amplitude to encode spectral amplitude. The other represented the linear predictive coding (LPC)-derived vocal tract shape as a filled bar graph in which the number of active vibrators was used to encode cross sectional area. The displays were applied to the thigh via a matrix of vibrators. The vibrators were driven at 250 Hz during voiced segments, and by random noise during unvoiced segments. Overall results show a slight superiority for the spectral display in vowel discrimination. Detailed results were analyzed in terms of an articulatory description of the speech stimuli, a multidimensional scaling (MDS) analysis of confusions, and an ideal receiver analysis. The results of these analyses suggest that the detailed characteristics of the tactile patterns were only crudely discriminated.
在使用语音片段的辨别实验中,对两种用于触觉显示器的语音编码方法进行了比较。一种显示器以时间扫描模式呈现短期语音频谱,并使用振动幅度对频谱幅度进行编码。另一种则将线性预测编码(LPC)得出的声道形状表示为填充条形图,其中活动振动器的数量用于对横截面积进行编码。这些显示器通过振动器矩阵施加到大腿上。在有声片段期间,振动器以250赫兹驱动,在无声片段期间则由随机噪声驱动。总体结果表明,频谱显示器在元音辨别方面略具优势。从语音刺激的发音描述、混淆的多维缩放(MDS)分析以及理想接收器分析的角度对详细结果进行了分析。这些分析结果表明,触觉模式的详细特征仅得到了粗略的辨别。