Graduate School of Biomedical Sciences, Nagasaki University, Sakamoto 1-12-4, Nagasaki 852-8523, Japan.
Neurosci Res. 2011 May;70(1):62-70. doi: 10.1016/j.neures.2011.01.010. Epub 2011 Jan 21.
Infant-directed speech (IDS) has the important functions of capturing the infants' attention and maintaining communication between the mother and the infant. It is known that three acoustic components (F0, F0 range, and tempo) in IDS and adult-directed speech (ADS) are different. However, it is not easy to discriminate between IDS and ADS using procedural approaches due to the wide range of individual differences. In this paper, we propose a novel approach to discriminate between IDS and ADS that uses mel-frequency cepstral coefficient and a hidden Markov model-based speech discrimination algorithm; this approach is not based on the prosodic features of F0, F0 range, and tempo. The average discrimination accuracy of the proposed algorithm is 84.34%. The objective accuracy of the discrimination models have been confirmed using the head-turn preference procedure, which measures infants' listening duration to auditory stimuli of IDS and ADS. These results suggest that the proposed algorithm may enable a robust and reliable classification of mothers' speech and infant attention to the mothers' speech may depend on IDS clarity.
婴儿导向语音(IDS)具有吸引婴儿注意力和维持母婴之间交流的重要功能。已知 IDS 和成人导向语音(ADS)在三个声学特征(基频 F0、F0 范围和节奏)方面存在差异。然而,由于个体差异范围广泛,使用程序方法来区分 IDS 和 ADS 并不容易。在本文中,我们提出了一种使用梅尔频率倒谱系数和基于隐马尔可夫模型的语音识别算法来区分 IDS 和 ADS 的新方法;这种方法不是基于 F0、F0 范围和节奏等韵律特征。所提出算法的平均判别准确率为 84.34%。使用头转向偏好程序(一种测量婴儿对 IDS 和 ADS 听觉刺激的聆听时间的方法)验证了判别模型的客观准确性。这些结果表明,所提出的算法可能能够实现对母亲语音的稳健和可靠分类,并且婴儿对母亲语音的注意力可能取决于 IDS 的清晰度。