Ban Taiyu, Wang Xiangyu, Chen Lyuzhou, Wu Xingyu, Chen Qiuju, Chen Huanhuan
IEEE Trans Neural Netw Learn Syst. 2024 Feb;35(2):1980-1992. doi: 10.1109/TNNLS.2022.3186033. Epub 2024 Feb 5.
The evaluation of knowledge quality (KQ) in multisource knowledge graphs (KGs) is an essential step for many applications, such as fragmented knowledge fusion and knowledge base construction. Many existing quality evaluation methods for multisource knowledge are based on validation from high-quality knowledge bases or statistical analysis of knowledge related to a specific fact from multiple sources, named external consistency (EC)-based methods. However, high-quality KGs are difficult to obtain, and there might exist incorrect knowledge in multisource KGs interfering with KQ evaluation. To address the issue, this article refers to the internal structure of a KG to evaluate the degree to which the contained triples conform to the overall semantic pattern of the KG, such as KG embedding and logic inference-based approaches, defined as internal consistency (IC) evaluation. The IC is integrated with the EC to identify possible incorrect triples and reduce their influences on the KQ evaluation, thus alleviating the interference of incorrect knowledge. The proposed method is verified with multiple datasets, and the results demonstrate that the proposed method could significantly reduce wrong evaluations caused by incorrect knowledge and effectively improve the quality evaluation of triples.
多源知识图谱(KGs)中知识质量(KQ)的评估是许多应用(如碎片化知识融合和知识库构建)的关键步骤。许多现有的多源知识质量评估方法基于来自高质量知识库的验证或对来自多个源的与特定事实相关的知识进行统计分析,即基于外部一致性(EC)的方法。然而,高质量的知识图谱难以获取,并且多源知识图谱中可能存在干扰知识质量评估的错误知识。为解决该问题,本文参考知识图谱的内部结构来评估其中包含的三元组符合知识图谱整体语义模式的程度,如基于知识图谱嵌入和逻辑推理的方法,定义为内部一致性(IC)评估。将内部一致性与外部一致性相结合,以识别可能错误的三元组并减少它们对知识质量评估的影响,从而减轻错误知识的干扰。通过多个数据集对所提出的方法进行了验证,结果表明该方法可以显著减少由错误知识导致的错误评估,并有效提高三元组的质量评估。