Department of Computer Science, Friedrich Schiller University, Jena, Germany.
PLoS One. 2010 Aug 19;5(8):e12268. doi: 10.1371/journal.pone.0012268.
Art images and natural scenes have in common that their radially averaged (1D) Fourier spectral power falls according to a power-law with increasing spatial frequency (1/f(2) characteristics), which implies that the power spectra have scale-invariant properties. In the present study, we show that other categories of man-made images, cartoons and graphic novels (comics and mangas), have similar properties. Further on, we extend our investigations to 2D power spectra. In order to determine whether the Fourier power spectra of man-made images differed from those of other categories of images (photographs of natural scenes, objects, faces and plants and scientific illustrations), we analyzed their 2D power spectra by principal component analysis. Results indicated that the first fifteen principal components allowed a partial separation of the different image categories. The differences between the image categories were studied in more detail by analyzing whether the mean power and the slope of the power gradients from low to high spatial frequencies varied across orientations in the power spectra. Mean power was generally higher in cardinal orientations both in real-world photographs and artworks, with no systematic difference between the two types of images. However, the slope of the power gradients showed a lower degree of mean variability across spectral orientations (i.e., more isotropy) in art images, cartoons and graphic novels than in photographs of comparable subject matters. Taken together, these results indicate that art images, cartoons and graphic novels possess relatively uniform 1/f(2) characteristics across all orientations. In conclusion, the man-made stimuli studied, which were presumably produced to evoke pleasant and/or enjoyable visual perception in human observers, form a subset of all images and share statistical properties in their Fourier power spectra. Whether these properties are necessary or sufficient to induce aesthetic perception remains to be investigated.
艺术图像和自然场景的共同之处在于,它们的径向平均(1D)傅里叶频谱功率随着空间频率的增加而按幂律下降(1/f(2)特征),这意味着频谱具有标度不变性。在本研究中,我们表明其他类别的人造图像,如卡通和漫画(漫画和日本漫画),也具有类似的特性。此外,我们将研究扩展到 2D 功率谱。为了确定人造图像的傅里叶功率谱是否与其他类别的图像(自然场景、物体、人脸和植物的照片以及科学插图)不同,我们通过主成分分析来分析它们的 2D 功率谱。结果表明,前十五个主成分允许对不同的图像类别进行部分分离。通过分析从低到高空间频率的功率梯度的平均功率和斜率是否随功率谱的方向变化,我们更详细地研究了图像类别的差异。在真实世界的照片和艺术作品中,平均功率通常在主方向上更高,而这两种图像类型之间没有系统差异。然而,在艺术图像、卡通和漫画中,功率梯度的斜率显示出在频谱方向上的平均变化程度较低(即各向同性较高),而在可比主题的照片中则没有。总之,这些结果表明,艺术图像、卡通和漫画在所有方向上都具有相对均匀的 1/f(2)特征。总之,研究中使用的人为刺激,据推测是为了在人类观察者中引起愉悦和/或愉快的视觉感知,它们构成了所有图像的一个子集,并在它们的傅里叶功率谱中具有统计特性。这些特性是否是引起审美感知的必要条件或充分条件仍有待研究。