Department of Otolaryngology-Head and Neck Surgery, St. Vincent's Hospital, The Catholic University of Korea School of Medicine, Suwon, Korea.
Clin Exp Otorhinolaryngol. 2012 Jun;5(2):68-73. doi: 10.3342/ceo.2012.5.2.68. Epub 2012 Jun 12.
To investigate acoustic differences between conversational and clear speech of Korean and to evaluate the influence of the gender on the speech clarity using the long-term average speech spectrum (LTASS).
Each subject's voice was recorded using a sound level meter connected to GoldWave program. Average long-term root mean square (RMS) of one-third octave bands speech spectrum was calculated from 100 to 10,000 Hz after normalizing to 70 dB overall level using the MATLAB program. Twenty ordinary Korean were compared with 20 Korean announcers with equal numbers of men and women in each group.
Compared with the LTASS of ordinary men, that of ordinary women was lower at low frequencies, but higher at 630, 800, 1,600, 5,000, and 10,000 Hz. Compared with the LTASS of male announcers, that of female announcers was lower at low frequencies. Compared with the LTASS of ordinary men, that of male announcers was significantly lower at 100, 125, 200, and 250 Hz. Compared with the LTASS of ordinary women, that of female announcers was lower at 100, 125, 160, 200, 250, 500, and 10,000 Hz. The LTASS of announcer showed lower levels at 100, 200 Hz and higher at 500, 630, 800, and 1,000 Hz that that of ordinary Koreans.
This study showed that the drop-off of the LTASS in the low frequency region might make the ratings of women and announcers more clearly than those of men and ordinary persons respectively. This drop-off in the low frequency might result in less upward spread of masking and clearer speech. This study reduced an error resulting from a wide variability of clear speech strategies and intelligibility gains, because this study recruited professional speakers. We hope that our results demonstrate the difference in acoustic characteristics of the speech of ordinary Korean persons.
研究韩国会话语音和清晰语音之间的声学差异,并使用长期平均语音频谱(LTASS)评估性别对语音清晰度的影响。
使用与 GoldWave 程序相连的声级计录制每位受试者的声音。使用 MATLAB 程序将每个 1/3 倍频程语音频谱的平均长期均方根(RMS)归一化为 70 dB 总水平后,计算 100 Hz 至 10,000 Hz 的语音频谱。将 20 名普通韩国人和 20 名男女各半的韩国播音员进行比较。
与普通男性的 LTASS 相比,普通女性的低频较低,但在 630、800、1600、5000 和 10,000 Hz 较高。与男性播音员的 LTASS 相比,女性播音员的低频较低。与普通男性的 LTASS 相比,男性播音员在 100、125、200 和 250 Hz 显著较低。与普通女性的 LTASS 相比,女性播音员在 100、125、160、200、250、500 和 10,000 Hz 较低。与普通韩国人相比,播音员的 LTASS 在 100、200 Hz 较低,在 500、630、800 和 1,000 Hz 较高。
本研究表明,低频区域 LTASS 的下降可能使女性和播音员的评分比男性和普通韩国人分别更清晰。这种低频下降可能导致掩蔽的向上扩展减少,语音更清晰。本研究减少了由于清晰语音策略和可懂度增益的广泛变化而导致的误差,因为本研究招募了专业演讲者。我们希望我们的结果展示了普通韩国人语音的声学特征差异。