Correspondence to Stephen J. Sheinkopf:
J Speech Lang Hear Res. 2013 Oct;56(5):1416-28. doi: 10.1044/1092-4388(2013/11-0298). Epub 2013 Jun 19.
In this article, the authors describe and validate the performance of a modern acoustic analyzer specifically designed for infant cry analysis.
Utilizing known algorithms, the authors developed a method to extract acoustic parameters describing infant cries from standard digital audio files. They used a frame rate of 25 ms with a frame advance of 12.5 ms. Cepstral-based acoustic analysis proceeded in 2 phases, computing frame-level data and then organizing and summarizing this information within cry utterances. Using signal detection methods, the authors evaluated the accuracy of the automated system to determine voicing and to detect fundamental frequency (F 0) as compared to voiced segments and pitch periods manually coded from spectrogram displays.
The system detected F 0 with 88% to 95% accuracy, depending on tolerances set at 10 to 20 Hz. Receiver operating characteristic analyses demonstrated very high accuracy at detecting voicing characteristics in the cry samples.
This article describes an automated infant cry analyzer with high accuracy to detect important acoustic features of cry. A unique and important aspect of this work is the rigorous testing of the system's accuracy as compared to ground-truth manual coding. The resulting system has implications for basic and applied research on infant cry development.
本文作者描述并验证了一种专门设计用于婴儿哭声分析的现代声学分析仪的性能。
作者利用已知的算法,开发了一种从标准数字音频文件中提取描述婴儿哭声的声学参数的方法。他们使用 25ms 的帧率和 12.5ms 的帧提前量。基于倒谱的声学分析分两个阶段进行,计算帧级数据,然后在哭声中组织和总结这些信息。作者使用信号检测方法,评估自动系统确定发声和检测基频 (F0) 的准确性,与手动从声谱图显示编码的浊音段和音高周期进行比较。
该系统检测 F0 的准确率为 88%至 95%,具体取决于设置的容限为 10 至 20Hz。接收者操作特性分析表明,该系统在检测哭声样本中的发声特征方面具有非常高的准确性。
本文描述了一种具有高准确性的自动婴儿哭声分析仪,可用于检测哭声的重要声学特征。这项工作的一个独特而重要的方面是,对系统的准确性与地面真实手动编码进行了严格的测试。该系统对婴儿哭声发育的基础和应用研究具有重要意义。