Yousefian Nima, Loizou Philipos C
Department of Electrical Engineering, University of Texas at Dallas, Richardson, TX 75083 USA.
IEEE Trans Audio Speech Lang Process. 2011 Jul 18;20(2):599-609. doi: 10.1109/TASL.2011.2162406.
A novel dual-microphone speech enhancement technique is proposed in the present paper. The technique utilizes the coherence between the target and noise signals as a criterion for noise reduction and can be generally applied to arrays with closely-spaced microphones, where noise captured by the sensors is highly correlated. The proposed algorithm is simple to implement and requires no estimation of noise statistics. In addition, it offers the capability of coping with multiple interfering sources that might be located at different azimuths. The proposed algorithm was evaluated with normal hearing listeners using intelligibility listening tests and compared against a well-established beamforming algorithm. Results indicated large gains in speech intelligibility relative to the baseline (front microphone) algorithm in both single and multiple-noise source scenarios. The proposed algorithm was found to yield substantially higher intelligibility than that obtained by the beamforming algorithm, particularly when multiple noise sources or competing talker(s) were present. Objective quality evaluation of the proposed algorithm also indicated significant quality improvement over that obtained by the beamforming algorithm. The intelligibility and quality benefits observed with the proposed coherence-based algorithm make it a viable candidate for hearing aid and cochlear implant devices.
本文提出了一种新型双麦克风语音增强技术。该技术利用目标信号与噪声信号之间的相关性作为降噪标准,通常可应用于麦克风间距较小的阵列,在这种情况下传感器捕获的噪声具有高度相关性。所提出的算法易于实现,无需估计噪声统计量。此外,它具有应对可能位于不同方位的多个干扰源的能力。使用可懂度听力测试对所提出的算法进行了评估,并与一种成熟的波束形成算法进行了比较。结果表明,在单噪声源和多噪声源场景中,相对于基线(前置麦克风)算法,语音可懂度都有大幅提高。发现所提出的算法产生的可懂度明显高于波束形成算法,特别是在存在多个噪声源或竞争说话者的情况下。对所提出算法的客观质量评估也表明,其质量比波束形成算法有显著提高。所观察到的基于相关性算法在可懂度和质量方面的优势使其成为助听器和人工耳蜗设备的一个可行候选方案。