Schönweiler R, Kaese S, Möller S, Rinscheid A, Ptok M
Klinik für Phoniatrie und Pädaudiologie, Medizinische Hochschule Hannover.
HNO. 1996 Apr;44(4):201-6.
Subjective and auditory assessment of the voice is now more commonly being replaced by objective voice analysis. Because of the amount of data available from computer-aided voice analysis, subjective selection and interpretation of single data sets remain a matter of experience of the individual investigator. Since neuronal networks are widely used in telecommunication and speech recognition, we applied self-organizing Kohonen networks to classify voice patterns. In the phase of "learning," the Kohonen map is adapted to patterns of the primary signals obtained. If, in the phase of using the map, the input signal hits the field of the primary signals, it will resemble them closely. In this study, we recorded newborn and young infant cries using a DAT recorder and a high-quality microphone. The cries were elicited by wearing uncomfortable headphones ("cries of discomfort"). Spectrographic characteristics of the cries were classified by 20-step bark spectra and then applied to the neuronal networks. It was possible to recognize similarities of different cries of the same children and interindividual differences, as well as cries of children with profound hearing loss. In addition, delayed auditory feedback at 80 dB SL was presented to 27 children via headphone using a three-headed tape-recorder as a model for induced individual cry changes. However, it was not possible to classify short-term changes as in a delayed feedback procedure. Nevertheless, neuronal networks may be helpful as an additional tool in spectrographic voice analysis.
如今,嗓音的主观和听觉评估正越来越多地被客观嗓音分析所取代。由于计算机辅助嗓音分析可提供大量数据,单个数据集的主观选择和解读仍是个别研究者的经验问题。鉴于神经网络在电信和语音识别中广泛应用,我们应用自组织Kohonen网络对嗓音模式进行分类。在“学习”阶段,Kohonen图会适应所获得的主要信号模式。在使用该图的阶段,如果输入信号命中主要信号区域,它将与它们非常相似。在本研究中,我们使用数字音频磁带(DAT)录音机和高质量麦克风记录了新生儿和幼儿的哭声。这些哭声是通过佩戴不舒服的耳机引发的(“不适哭声”)。哭声的频谱特征通过20步巴克谱进行分类,然后应用于神经网络。能够识别同一儿童不同哭声的相似性和个体间差异,以及重度听力损失儿童的哭声。此外,使用三头磁带录音机作为诱导个体哭声变化的模型,通过耳机向27名儿童呈现80分贝感觉级(dB SL)的延迟听觉反馈。然而,在延迟反馈程序中,无法对短期变化进行分类。尽管如此,神经网络可能作为频谱嗓音分析中的一种辅助工具而有所帮助。