Quality and Usability Lab, Technische Universität Berlin, D-10587 Berlin, Germany.
J Neural Eng. 2019 Oct 14;16(6):066008. doi: 10.1088/1741-2552/ab1673.
Non-invasive physiological methods like electroencephalography (EEG) are increasingly employed to assess human information processing during exposure to multimedia signals. In the quality engineering field, previous research has promoted the utility of the P300 event-related brain potential (ERP) component for indicating variation in quality perception. The present study provides a starting point to test whether the P300 and its two subcomponents, P3a and P3b, are truly reflective of changes in the perceived quality of transmitted speech signals given the presence of other, quality-unrelated changes in acoustic stimulation.
High-quality and degraded variants of spoken words were presented in a two-feature oddball task, which required participants to actively respond to rarely occurring 'target' stimuli within a series of frequent 'standard' stimuli, thereby eliciting ERP waveforms. Target presentations involved either single quality changes or concurrent double changes in quality and the initial phoneme.
In case additional phonological change was present, only varying quality of standard stimuli caused significant modulations in P3a and P3b characteristics (N = 32). Thus, the formation of different short-term quality references exerted a persisting influence on the auditory processing of transmitted speech.
The obtained results elucidate the importance of contextual and content-related influencing factors for proving the validity of the P300 as a psychophysiological indicator of speech quality change. Associated questions regarding the transfer of ERP-based quality assessment into more practically relevant measurement contexts are discussed.
脑电图(EEG)等非侵入性生理方法越来越多地被用于评估人类在暴露于多媒体信号下时的信息处理。在质量工程领域,先前的研究已经促进了 P300 事件相关脑电位(ERP)成分在指示质量感知变化方面的效用。本研究为测试 P300 及其两个子成分 P3a 和 P3b 是否真的反映了传输语音信号感知质量的变化提供了一个起点,因为在声学刺激中存在其他与质量无关的变化。
在双特征的Oddball 任务中呈现高质量和低质量的语音变体,要求参与者积极响应一系列常见“标准”刺激中很少出现的“目标”刺激,从而引出 ERP 波形。目标呈现涉及单个质量变化或质量和初始音素的并发双重变化。
如果存在额外的语音变化,只有标准刺激的质量变化会引起 P3a 和 P3b 特征的显著调制(N=32)。因此,不同的短期质量参考的形成对传输语音的听觉处理产生了持续的影响。
所得结果阐明了上下文和与内容相关的影响因素对于证明 P300 作为语音质量变化的心理生理指标的有效性的重要性。讨论了关于将基于 ERP 的质量评估转移到更实际相关的测量环境中的相关问题。