Department of Electrical Engineering, University of Texas at Dallas Richardson, TX 75083-0688, USA.
J Acoust Soc Am. 2011 Aug;130(2):986-95. doi: 10.1121/1.3605668.
The conventional articulation index (AI) measure cannot be applied in situations where non-linear operations are involved and additive noise is present. This is because the definitions of the target and masker signals become vague following non-linear processing, as both the target and masker signals are affected. The aim of the present work is to modify the basic form of the AI measure to account for non-linear processing. This was done using a new definition of the output or effective SNR obtained following non-linear processing. The proposed output SNR definition for a specific band was designed to handle cases where the non-linear processing affects predominantly the target signal rather than the masker signal. The proposed measure also takes into consideration the fact that the input SNR in a specific band cannot be improved following any form of non-linear processing. Overall, the proposed measure quantifies the proportion of input band SNR preserved or transmitted in each band after non-linear processing. High correlation (r = 0.9) was obtained with the proposed measure when evaluated with intelligibility scores obtained by normal-hearing listeners in 72 noisy conditions involving noise-suppressed speech corrupted in four different real-world maskers.
传统的可懂度指数(AI)测量方法不能应用于涉及非线性运算和存在附加噪声的情况。这是因为在非线性处理后,目标和掩蔽信号的定义变得模糊,因为两者都受到影响。本工作的目的是修改 AI 测量的基本形式,以考虑非线性处理。这是通过使用非线性处理后获得的输出或有效 SNR 的新定义来实现的。针对特定频带提出的输出 SNR 定义旨在处理非线性处理主要影响目标信号而不是掩蔽信号的情况。所提出的措施还考虑到这样一个事实,即在任何形式的非线性处理之后,特定频带中的输入 SNR 都无法提高。总体而言,所提出的措施量化了非线性处理后每个频带中输入频带 SNR 保留或传输的比例。在所评估的 72 种噪声条件下,包括在四种不同的实际掩蔽噪声中受到噪声抑制的语音失真的情况下,与正常听力者的可懂度得分相比,所提出的措施具有很高的相关性(r=0.9)。