Li Lux, Rehr Robert, Bruns Patrick, Gerkmann Timo, Röder Brigitte
Biological Psychology and Neuropsychology, University of Hamburg, Hamburg, Germany.
Signal Processing (SP), Department of Informatics, University of Hamburg, Hamburg, Germany.
Front Robot AI. 2020 Jul 7;7:85. doi: 10.3389/frobt.2020.00085. eCollection 2020.
Extracting information from noisy signals is of fundamental importance for both biological and artificial perceptual systems. To provide tractable solutions to this challenge, the fields of human perception and machine signal processing (SP) have developed powerful computational models, including Bayesian probabilistic models. However, little true integration between these fields exists in their applications of the probabilistic models for solving analogous problems, such as noise reduction, signal enhancement, and source separation. In this mini review, we briefly introduce and compare selective applications of probabilistic models in machine SP and human psychophysics. We focus on audio and audio-visual processing, using examples of speech enhancement, automatic speech recognition, audio-visual cue integration, source separation, and causal inference to illustrate the basic principles of the probabilistic approach. Our goal is to identify commonalities between probabilistic models addressing brain processes and those aiming at building intelligent machines. These commonalities could constitute the closest points for interdisciplinary convergence.
从噪声信号中提取信息对于生物感知系统和人工感知系统都至关重要。为了应对这一挑战提供易于处理的解决方案,人类感知和机器信号处理(SP)领域已经开发出了强大的计算模型,包括贝叶斯概率模型。然而,在将概率模型应用于解决诸如降噪、信号增强和源分离等类似问题时,这两个领域之间几乎没有真正的整合。在这篇小型综述中,我们简要介绍并比较概率模型在机器信号处理和人类心理物理学中的选择性应用。我们专注于音频和视听处理,通过语音增强、自动语音识别、视听线索整合、源分离和因果推理的例子来说明概率方法的基本原理。我们的目标是找出处理大脑过程的概率模型与旨在构建智能机器的概率模型之间的共性。这些共性可能构成跨学科融合的最接近点。