School of Cyberspace Security, Beijing University of Posts and Telecommunications, Beijing 100876, China.
School of Computer Science, Beijing University of Posts and Telecommunications, Beijing 100876, China.
Sensors (Basel). 2020 Dec 15;20(24):7180. doi: 10.3390/s20247180.
Cross-domain relation extraction has become an essential approach when target domain lacking labeled data. Most existing works adapted relation extraction models from the source domain to target domain through aligning sequential features, but failed to transfer non-local and non-sequential features such as word co-occurrence which are also critical for cross-domain relation extraction. To address this issue, in this paper, we propose a novel tripartite graph architecture to adapt non-local features when there is no labeled data in the target domain. The graph uses domain words as nodes to model the co-occurrence relation between domain-specific words and domain-independent words. Through graph convolutions on the tripartite graph, the information of domain-specific words is propagated so that the word representation can be fine-tuned to align domain-specific features. In addition, unlike the traditional graph structure, the weights of edges innovatively combine fixed weight and dynamic weight, to capture the global non-local features and avoid introducing noise to word representation. Experiments on three domains of ACE2005 datasets show that our method outperforms the state-of-the-art models by a big margin.
跨领域关系抽取已成为在目标领域缺乏标记数据时的一种重要方法。大多数现有工作通过对齐序列特征,将关系抽取模型从源领域适配到目标领域,但未能转移非局部和非序列特征,如词共现,这些特征对于跨领域关系抽取也至关重要。针对这个问题,在本文中,我们提出了一种新的三分图架构,在目标领域没有标记数据的情况下,适应非局部特征。该图使用领域词作为节点,来建模领域特定词和领域无关词之间的共现关系。通过在三分图上进行图卷积,可以传播领域特定词的信息,从而可以微调词表示以对齐领域特定特征。此外,与传统图结构不同,边的权重创新性地结合了固定权重和动态权重,以捕获全局非局部特征,并避免将噪声引入词表示。在 ACE2005 数据集的三个领域上的实验表明,我们的方法明显优于最先进的模型。