Scheffers M T
J Acoust Soc Am. 1983 Dec;74(6):1716-25. doi: 10.1121/1.390280.
A model was developed for estimating the pitch of complex sounds that are partially masked by background sound. Our ultimate aim is to obtain a model that can separate two simultaneous sounds on the basis of the harmonic structure of at least one of the sounds. The MDWS model is an extension of the Duifhuis, Willems, and Sluyter pitch meter (DWS) [J. Acoust. Soc. Am. 71, 1568-1580 (1982)] which is a practical implementation of Goldstein's optimum processor theory of pitch perception [J. Acoust. Soc. Am. 54, 1496-1516 (1973)]. The main modifications incorporated in MDWS consist of a more faithful modeling of auditory frequency analysis and of an alteration to the criterion used to decide which fundamental best fits a set of resolved components. Effects of the latter modification were investigated in a comparison between model estimates of the pitch of inharmonic complex signals and results obtained for humans. Furthermore, the accuracy of model estimates of the pitch of periodic signals (among which were synthesized vowel sounds), partially masked by noise, was compared with the just noticeable difference of fundamental frequency of these sounds for human observers. The results of these two tests show that the model estimates come close to human perception.
开发了一种模型,用于估计被背景声音部分掩蔽的复杂声音的音高。我们的最终目标是获得一种模型,该模型能够基于至少一个声音的谐波结构分离两个同时发出的声音。MDWS模型是Duifhuis、Willems和Sluyter音高计(DWS)[《美国声学学会杂志》71, 1568 - 1580 (1982)]的扩展,DWS是Goldstein音高感知最优处理器理论[《美国声学学会杂志》54, 1496 - 1516 (1973)]的实际实现。MDWS中包含的主要修改包括对听觉频率分析进行更精确的建模,以及改变用于确定哪个基频最适合一组分辨出的分量的标准。在对非谐波复合信号音高的模型估计与人类获得的结果进行比较时,研究了后一种修改的效果。此外,将被噪声部分掩蔽的周期性信号(其中包括合成元音声音)音高的模型估计准确性与人类观察者对这些声音基频的刚可察觉差异进行了比较。这两项测试的结果表明,模型估计接近人类感知。