Yu Chengzhu, Wójcicki Kamil K, Loizou P C, Hansen John H L
Dept. of Electrical Engineering, Erik Jonsson School of Enigneering and Computer Science, University of Texas at Dallas, Richardson, TX 75080.
Proc IEEE Int Conf Acoust Speech Signal Process. 2013. doi: 10.1109/ICASSP.2013.6639025.
Mask-based objective speech-intelligibility measures have been successfully proposed for evaluating the performance of binary masking algorithms. These objective measures were computed directly by comparing the estimated binary mask against the ground truth ideal binary mask (IdBM). Most of these objective measures, however, assign equal weight to all time-frequency (T-F) units. In this study, we propose to improve the existing mask-based objective measures by weighting each T-F unit according to its target or masker loudness. The proposed objective measure shows significantly better performance than two other existing mask-based objective measures.