Noda Kamma, Soda Takafumi, Yamashita Yuichi
Department of Information Medicine, National Institute of Neuroscience, National Center of Neurology and Psychiatry, Kodaira, Japan.
Front Neurosci. 2024 Jan 17;18:1330512. doi: 10.3389/fnins.2024.1330512. eCollection 2024.
Associating multimodal information is essential for human cognitive abilities including mathematical skills. Multimodal learning has also attracted attention in the field of machine learning, and it has been suggested that the acquisition of better latent representation plays an important role in enhancing task performance. This study aimed to explore the impact of multimodal learning on representation, and to understand the relationship between multimodal representation and the development of mathematical skills.
We employed a multimodal deep neural network as the computational model for multimodal associations in the brain. We compared the representations of numerical information, that is, handwritten digits and images containing a variable number of geometric figures learned through single- and multimodal methods. Next, we evaluated whether these representations were beneficial for downstream arithmetic tasks.
Multimodal training produced better latent representation in terms of clustering quality, which is consistent with previous findings on multimodal learning in deep neural networks. Moreover, the representations learned using multimodal information exhibited superior performance in arithmetic tasks.
Our novel findings experimentally demonstrate that changes in acquired latent representations through multimodal association learning are directly related to cognitive functions, including mathematical skills. This supports the possibility that multimodal learning using deep neural network models may offer novel insights into higher cognitive functions.
关联多模态信息对于包括数学技能在内的人类认知能力至关重要。多模态学习在机器学习领域也引起了关注,并且有人提出,获得更好的潜在表征在提高任务性能方面起着重要作用。本研究旨在探讨多模态学习对表征的影响,并了解多模态表征与数学技能发展之间的关系。
我们采用多模态深度神经网络作为大脑中多模态关联的计算模型。我们比较了通过单模态和多模态方法学习的数字信息(即手写数字和包含不同数量几何图形的图像)的表征。接下来,我们评估了这些表征是否对下游算术任务有益。
就聚类质量而言,多模态训练产生了更好的潜在表征,这与先前关于深度神经网络中多模态学习的研究结果一致。此外,使用多模态信息学习的表征在算术任务中表现出卓越的性能。
我们的新发现通过实验证明,通过多模态关联学习获得的潜在表征的变化与包括数学技能在内的认知功能直接相关。这支持了使用深度神经网络模型的多模态学习可能为更高层次的认知功能提供新见解的可能性。