Department of Neurology, Milton S. Hershey Medical Center, Penn State College of Medicine.
Department of Neural & Behavioral Sciences, Milton S. Hershey Medical Center, Penn State College of Medicine.
Cogn Sci. 2020 Sep;44(9):e12883. doi: 10.1111/cogs.12883.
Sound symbolism refers to non-arbitrary mappings between the sounds of words and their meanings and is often studied by pairing auditory pseudowords such as "maluma" and "takete" with rounded and pointed visual shapes, respectively. However, it is unclear what auditory properties of pseudowords contribute to their perception as rounded or pointed. Here, we compared perceptual ratings of the roundedness/pointedness of large sets of pseudowords and shapes to their acoustic and visual properties using a novel application of representational similarity analysis (RSA). Representational dissimilarity matrices (RDMs) of the auditory and visual ratings of roundedness/pointedness were significantly correlated crossmodally. The auditory perceptual RDM correlated significantly with RDMs of spectral tilt, the temporal fast Fourier transform (FFT), and the speech envelope. Conventional correlational analyses showed that ratings of pseudowords transitioned from rounded to pointed as vocal roughness (as measured by the harmonics-to-noise ratio, pulse number, fraction of unvoiced frames, mean autocorrelation, shimmer, and jitter) increased. The visual perceptual RDM correlated significantly with RDMs of global indices of visual shape (the simple matching coefficient, image silhouette, image outlines, and Jaccard distance). Crossmodally, the RDMs of the auditory spectral parameters correlated weakly but significantly with those of the global indices of visual shape. Our work establishes the utility of RSA for analysis of large stimulus sets and offers novel insights into the stimulus parameters underlying sound symbolism, showing that sound-to-shape mapping is driven by acoustic properties of pseudowords and suggesting audiovisual cross-modal correspondence as a basis for language users' sensitivity to this type of sound symbolism.
音响象征主义是指单词的发音与其含义之间的非任意映射,通常通过将听觉上的伪词(如“maluma”和“takete”)分别与圆形和尖形的视觉形状配对来研究。然而,伪词的哪些听觉属性有助于将其感知为圆形或尖形,这一点尚不清楚。在这里,我们使用代表性相似性分析(RSA)的新应用程序,将大量伪词和形状的圆形/尖形感知评分与其声学和视觉特性进行了比较。圆形/尖形的听觉和视觉评分的代表性相似性矩阵(RDM)在跨模态上显著相关。听觉感知 RDM 与光谱倾斜、时间快速傅里叶变换(FFT)和语音包络的 RDM 显著相关。传统的相关分析表明,随着语音粗糙度(如谐波噪声比、脉冲数、无声帧分数、平均自相关、颤动和抖动测量)的增加,伪词的评分从圆形过渡到尖形。视觉感知 RDM 与视觉形状的全局指标 RDM 显著相关(简单匹配系数、图像轮廓、图像轮廓、Jaccard 距离)。跨模态时,听觉频谱参数的 RDM 与视觉形状全局指标的 RDM 弱相关,但具有显著相关性。我们的工作确立了 RSA 用于分析大型刺激集的效用,并为音响象征主义的刺激参数提供了新的见解,表明声音到形状的映射是由伪词的声学特性驱动的,并暗示视听跨模态对应是语言使用者对这种类型的音响象征主义敏感的基础。