Department of Computer Science & Engineering, Center for Cognitive Science, The Ohio State University, Columbus, OH 43210, USA.
Trends Amplif. 2008 Dec;12(4):332-53. doi: 10.1177/1084713808326455. Epub 2008 Oct 30.
A new approach to the separation of speech from speech-in-noise mixtures is the use of time-frequency (T-F) masking. Originated in the field of computational auditory scene analysis, T-F masking performs separation in the time-frequency domain. This article introduces the T-F masking concept and reviews T-F masking algorithms that separate target speech from either monaural or binaural mixtures, as well as microphone-array recordings. The review emphasizes techniques that are promising for hearing aid design. This article also surveys recent studies that evaluate the perceptual effects of T-F masking techniques, particularly their effectiveness in improving human speech recognition in noise. An assessment is made of the potential benefits of T-F masking methods for the hearing impaired in light of the processing constraints of hearing aids. Finally, several issues pertinent to T-F masking are discussed.
一种将语音从噪声中的语音混合信号中分离出来的新方法是使用时频(T-F)掩蔽。时频掩蔽起源于计算听觉场景分析领域,在时频域中进行分离。本文介绍了时频掩蔽概念,并回顾了将目标语音从单声道或双耳混合信号以及麦克风阵列录音中分离出来的时频掩蔽算法。该综述强调了对助听器设计有前景的技术。本文还调查了评估时频掩蔽技术感知效果的近期研究,特别是它们在改善噪声中人类语音识别方面的有效性。根据助听器的处理限制,对时频掩蔽方法对听力受损者的潜在益处进行了评估。最后,讨论了与时频掩蔽相关的几个问题。