Zeremdini Jihen, Ben Messaoud Mohamed Anouar, Bouzid Aicha
National School of Engineers of Tunis, LR11ES17 Signal, Image and Information Technology Laboratory, University of Tunis El Manar, 1002, Tunis, Tunisia.
Brain Inform. 2015 Sep;2(3):155-166. doi: 10.1007/s40708-015-0016-0. Epub 2015 Aug 4.
Humans have the ability to easily separate a composed speech and to form perceptual representations of the constituent sources in an acoustic mixture thanks to their ears. Until recently, researchers attempt to build computer models of high-level functions of the auditory system. The problem of the composed speech segregation is still a very challenging problem for these researchers. In our case, we are interested in approaches that are addressed to the monaural speech segregation. For this purpose, we study in this paper the computational auditory scene analysis (CASA) to segregate speech from monaural mixtures. CASA is the reproduction of the source organization achieved by listeners. It is based on two main stages: segmentation and grouping. In this work, we have presented, and compared several studies that have used CASA for speech separation and recognition.