通过使用早期经验中从刺激派生的响应场来表示相似性，实现稳健的声音分类。

Robust sound classification through the representation of similarity using response fields derived from stimuli during early experience.

作者信息

Coath Martin, Denham Susan L

机构信息

Centre for Theoretical and Computational Neuroscience, University of Plymouth, Drakes Circus PL4 8AA, UK.

出版信息

Biol Cybern. 2005 Jul;93(1):22-30. doi: 10.1007/s00422-005-0560-4. Epub 2005 Jun 8.

DOI:10.1007/s00422-005-0560-4

PMID:15944856

Abstract

Models of auditory processing, particularly of speech, face many difficulties. Included in these are variability among speakers, variability in speech rate, and robustness to moderate distortions such as time compression. We constructed a system based on ensembles of feature detectors derived from fragments of an onset-sensitive sound representation. This method is based on the idea of 'spectro-temporal response fields' and uses convolution to measure the degree of similarity through time between the feature detectors and the stimulus. The output from the ensemble was used to derive segmentation cues and patterns of response, which were used to train an artificial neural network (ANN) classifier. This allowed us to estimate a lower bound for the mutual information between the class of the input and the class of the output. Our results suggest that there is significant information in the output of our system, and that this is robust with respect to the exact choice of feature set, time compression in the stimulus, and speaker variation. In addition, the robustness to time compression in the stimulus has features in common with human psychophysics. Similar experiments using feature detectors derived from fragments of non-speech sounds performed less well. This result is interesting in the light of results showing aberrant cortical development in animals exposed to impoverished auditory environments during the developmental phase.

摘要

听觉处理模型，尤其是语音处理模型，面临着诸多困难。其中包括说话者之间的差异、语速的变化以及对诸如时间压缩等适度失真的鲁棒性。我们构建了一个基于从起始敏感声音表示的片段中派生的特征检测器集合的系统。该方法基于“光谱 - 时间响应场”的概念，并使用卷积来测量特征检测器与刺激之间随时间的相似程度。集合的输出用于导出分割线索和响应模式，这些被用于训练人工神经网络（ANN）分类器。这使我们能够估计输入类别与输出类别之间互信息的下限。我们的结果表明，我们系统的输出中存在大量信息，并且对于特征集的精确选择、刺激中的时间压缩以及说话者变化而言，该信息具有鲁棒性。此外，对刺激中时间压缩的鲁棒性具有与人类心理物理学相同的特征。使用从非语音声音片段派生的特征检测器进行的类似实验表现较差。鉴于有结果表明在发育阶段暴露于贫困听觉环境的动物存在异常的皮层发育，这一结果很有趣。

Suppr 超能文献

文献检索

文件翻译

深度研究

Suppr 超能文献

文献检索

文件翻译

深度研究

通过使用早期经验中从刺激派生的响应场来表示相似性，实现稳健的声音分类。

Robust sound classification through the representation of similarity using response fields derived from stimuli during early experience.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

通过使用早期经验中从刺激派生的响应场来表示相似性，实现稳健的声音分类。

Robust sound classification through the representation of similarity using response fields derived from stimuli during early experience.

作者信息

机构信息

出版信息

相似文献

引用本文的文献