Liu Shi, Li Kaiyang, Wang Yaoying, Zhu Tianyou, Li Jiwei, Chen Zhenyu
Big Data Center of State Grid Corporation, Beijing 100052, China.
Math Biosci Eng. 2023 Jun 26;20(8):14180-14200. doi: 10.3934/mbe.2023634.
Knowledge graph embedding aims to learn representation vectors for the entities and relations. Most of the existing approaches learn the representation from the structural information in the triples, which neglects the content related to the entity and relation. Though there are some approaches proposed to exploit the related multimodal content to improve knowledge graph embedding, such as the text description and images associated with the entities, they are not effective to address the heterogeneity and cross-modal correlation constraint of different types of content and network structure. In this paper, we propose a multi-modal content fusion model (MMCF) for knowledge graph embedding. To effectively fuse the heterogenous data for knowledge graph embedding, such as text description, related images and structural information, a cross-modal correlation learning component is proposed. It first learns the intra-modal and inter-modal correlation to fuse the multimodal content of each entity, and then they are fused with the structure features by a gating network. Meanwhile, to enhance the features of relation, the features of the associated head entity and tail entity are fused to learn relation embedding. To effectively evaluate the proposed model, we compare it with other baselines in three datasets, i.e., FB-IMG, WN18RR and FB15k-237. Experiment result of link prediction demonstrates that our model outperforms the state-of-the-art in most of the metrics significantly, implying the superiority of the proposed method.
知识图谱嵌入旨在学习实体和关系的表示向量。大多数现有方法从三元组中的结构信息学习表示,这忽略了与实体和关系相关的内容。尽管已经提出了一些利用相关多模态内容来改进知识图谱嵌入的方法,例如与实体相关联的文本描述和图像,但它们在解决不同类型内容和网络结构的异质性和跨模态相关性约束方面并不有效。在本文中,我们提出了一种用于知识图谱嵌入的多模态内容融合模型(MMCF)。为了有效地融合用于知识图谱嵌入的异构数据,如文本描述、相关图像和结构信息,提出了一个跨模态相关性学习组件。它首先学习模态内和模态间的相关性以融合每个实体的多模态内容,然后通过一个门控网络将它们与结构特征进行融合。同时,为了增强关系的特征,将相关头实体和尾实体的特征进行融合以学习关系嵌入。为了有效地评估所提出的模型,我们在三个数据集(即FB-IMG、WN18RR和FB15k-237)中与其他基线进行了比较。链接预测的实验结果表明,我们的模型在大多数指标上显著优于当前最先进的模型,这意味着所提出方法的优越性。