使用声音象征词来表达纹理的计算机视觉系统。

Computer Vision System for Expressing Texture Using Sound-Symbolic Words.

作者信息

Yamagata Koichi, Kwon Jinhwan, Kawashima Takuya, Shimoda Wataru, Sakamoto Maki

机构信息

Graduate School of Informatics and Engineering, The University of Electro Communications, Chofu, Japan.

Department of Education, Kyoto University of Education, Kyoto, Japan.

出版信息

Front Psychol. 2021 Oct 7;12:654779. doi: 10.3389/fpsyg.2021.654779. eCollection 2021.

DOI:10.3389/fpsyg.2021.654779

PMID:34690855

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8529034/

Abstract

The major goals of texture research in computer vision are to understand, model, and process texture and ultimately simulate human visual information processing using computer technologies. The field of computer vision has witnessed remarkable advancements in material recognition using deep convolutional neural networks (DCNNs), which have enabled various computer vision applications, such as self-driving cars, facial and gesture recognition, and automatic number plate recognition. However, for computer vision to "express" texture like human beings is still difficult because texture description has no correct or incorrect answer and is ambiguous. In this paper, we develop a computer vision method using DCNN that expresses texture of materials. To achieve this goal, we focus on Japanese "sound-symbolic" words, which can describe differences in texture sensation at a fine resolution and are known to have strong and systematic sensory-sound associations. Because the phonemes of Japanese sound-symbolic words characterize categories of texture sensations, we develop a computer vision method to generate the phonemes and structure comprising sound-symbolic words that probabilistically correspond to the input images. It was confirmed that the sound-symbolic words output by our system had about 80% accuracy rate in our evaluation.

摘要

计算机视觉中纹理研究的主要目标是理解、建模和处理纹理，并最终利用计算机技术模拟人类视觉信息处理。计算机视觉领域在使用深度卷积神经网络（DCNN）进行材料识别方面取得了显著进展，这使得各种计算机视觉应用成为可能，如自动驾驶汽车、面部和手势识别以及自动车牌识别。然而，要让计算机视觉像人类一样“表达”纹理仍然很困难，因为纹理描述没有正确或错误之分，而且具有模糊性。在本文中，我们开发了一种使用DCNN的计算机视觉方法来表达材料的纹理。为了实现这一目标，我们关注日语中的“语音象征”词，这些词可以在高分辨率下描述纹理感觉的差异，并且已知具有强烈且系统的感官 - 声音关联。由于日语语音象征词的音素表征了纹理感觉的类别，我们开发了一种计算机视觉方法来生成音素以及包含与输入图像概率对应的语音象征词的结构。在我们的评估中，证实了我们系统输出的语音象征词准确率约为80%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac5e/8529034/893fff5e36ab/fpsyg-12-654779-g001.jpg

相似文献

Computer Vision System for Expressing Texture Using Sound-Symbolic Words.使用声音象征词来表达纹理的计算机视觉系统。

Front Psychol. 2021 Oct 7;12:654779. doi: 10.3389/fpsyg.2021.654779. eCollection 2021.

Cross-Modal Associations between Sounds and Drink Tastes/Textures: A Study with Spontaneous Production of Sound-Symbolic Words.声音与饮品味道/质地之间的跨模态关联：一项关于声音象征词自发产生的研究。

Chem Senses. 2016 Mar;41(3):197-203. doi: 10.1093/chemse/bjv078. Epub 2015 Dec 28.

A new test for evaluation of marginal cognitive function deficits in idiopathic normal pressure hydrocephalus through expressing texture recognition by sound symbolic words.一种通过声音象征词表达纹理识别来评估特发性正常压力脑积水边缘认知功能缺陷的新测试。

Front Aging Neurosci. 2024 Sep 17;16:1456242. doi: 10.3389/fnagi.2024.1456242. eCollection 2024.

Bouba/Kiki in Touch: Associations Between Tactile Perceptual Qualities and Japanese Phonemes.《触摸中的布巴/基基：触觉感知特质与日语音素之间的关联》

Front Psychol. 2018 Mar 12;9:295. doi: 10.3389/fpsyg.2018.00295. eCollection 2018.

Japanese sound-symbolic words in global contexts: from translation to hybridization.全球化语境下的日语拟声词：从翻译到杂交。

F1000Res. 2021 Oct 8;10:1024. doi: 10.12688/f1000research.55546.2. eCollection 2021.

The Relationships Between Initial Consonants in Japanese Sound Symbolic Words and Familiarity, Multi-Sensory Imageability, Emotional Valence, and Arousal.日语语音象征词中声母与熟悉度、多感官可意象性、情感效价及唤醒之间的关系

J Psycholinguist Res. 2021 Aug;50(4):831-842. doi: 10.1007/s10936-020-09749-w. Epub 2021 Jan 4.

Vowel Length Expands Perceptual and Emotional Evaluations in Written Japanese Sound-Symbolic Words.元音长度扩展了日语书面形声词的感知和情感评价。

Behav Sci (Basel). 2021 Jun 21;11(6):90. doi: 10.3390/bs11060090.

Japanese Sound-Symbolic Words for Representing the Hardness of an Object Are Judged Similarly by Japanese and English Speakers.日语中表示物体硬度的声音象征词，日本人和说英语的人判断方式相似。

Front Psychol. 2022 Mar 15;13:830306. doi: 10.3389/fpsyg.2022.830306. eCollection 2022.

Automatic Estimation of Multidimensional Ratings from a Single Sound-Symbolic Word and Word-Based Visualization of Tactile Perceptual Space.基于单个语音象征词的多维评分自动估计及触觉感知空间的基于词的可视化

IEEE Trans Haptics. 2017 Apr-Jun;10(2):173-182. doi: 10.1109/TOH.2016.2615923. Epub 2016 Oct 14.

Cognitive neural responses in the semantic comprehension of sound symbolic words and pseudowords.声音象征词和假词语义理解中的认知神经反应。

Front Hum Neurosci. 2023 Oct 11;17:1208572. doi: 10.3389/fnhum.2023.1208572. eCollection 2023.

本文引用的文献

Brain networks underlying the processing of sound symbolism related to softness perception.与柔软感知相关的声音象征性处理的大脑网络。

Sci Rep. 2021 Apr 1;11(1):7399. doi: 10.1038/s41598-021-86328-6.

Neural Mechanisms of Material Perception: Quest on Shitsukan.物质知觉的神经机制：对物质知觉的探索。

Neuroscience. 2018 Nov 10;392:329-347. doi: 10.1016/j.neuroscience.2018.09.001. Epub 2018 Sep 11.

Bouba/Kiki in Touch: Associations Between Tactile Perceptual Qualities and Japanese Phonemes.《触摸中的布巴/基基：触觉感知特质与日语音素之间的关联》

Front Psychol. 2018 Mar 12;9:295. doi: 10.3389/fpsyg.2018.00295. eCollection 2018.

Five mechanisms of sound symbolic association.五种声音象征关联的机制。

Psychon Bull Rev. 2018 Oct;25(5):1619-1643. doi: 10.3758/s13423-017-1361-1.

Exploring Tactile Perceptual Dimensions Using Materials Associated with Sensory Vocabulary.使用与感官词汇相关的材料探索触觉感知维度。

Front Psychol. 2017 Apr 13;8:569. doi: 10.3389/fpsyg.2017.00569. eCollection 2017.

IEEE Trans Haptics. 2017 Apr-Jun;10(2):173-182. doi: 10.1109/TOH.2016.2615923. Epub 2016 Oct 14.

Deep Filter Banks for Texture Recognition, Description, and Segmentation.用于纹理识别、描述和分割的深度滤波器组

Int J Comput Vis. 2016;118:65-94. doi: 10.1007/s11263-015-0872-3. Epub 2016 Jan 9.

Chem Senses. 2016 Mar;41(3):197-203. doi: 10.1093/chemse/bjv078. Epub 2015 Dec 28.

Deep learning.深度学习。

Nature. 2015 May 28;521(7553):436-44. doi: 10.1038/nature14539.

Balloons and bavoons versus spikes and shikes: ERPs reveal shared neural processes for shape-sound-meaning congruence in words, and shape-sound congruence in pseudowords.气球和小丑与尖状物和敲击声：事件相关电位揭示了单词中形状-声音-意义一致性以及假词中形状-声音一致性的共享神经过程。

Brain Lang. 2015 Jun-Jul;145-146:11-22. doi: 10.1016/j.bandl.2015.03.011. Epub 2015 May 16.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

使用声音象征词来表达纹理的计算机视觉系统。

Computer Vision System for Expressing Texture Using Sound-Symbolic Words.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献