• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于语义和句法特征的具有长距离依存关系和噪声的实体之间的关系提取

Relationship extraction between entities with long distance dependencies and noise based on semantic and syntactic features.

作者信息

Wang Lei, Wu Fei, Liu Xiaoqing, Cao Jilong, Ma Mingwei, Qu Zhaoyang

机构信息

School of Computer Science, Northeast Electric Power University, Jilin, 132012, China.

Jilin Institute of Chemical Technology, Jilin, 132022, China.

出版信息

Sci Rep. 2025 May 6;15(1):15750. doi: 10.1038/s41598-025-00915-5.

DOI:10.1038/s41598-025-00915-5
PMID:40328850
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12056020/
Abstract

Relation extraction plays a crucial role in tasks such as text processing and knowledge graph construction. However, existing extraction algorithms struggle to maintain accuracy when dealing with long-distance dependencies between entities and noise interference. To address these challenges, this paper proposes a novel relation extraction method that integrates semantic and syntactic features for handling noisy long-distance dependencies. Specifically, we leverage contextual semantic features generated by the pre-trained BERT model alongside syntactic features derived from dependency syntax graphs, effectively utilizing the complementary strengths of both sources of information to enhance the model's performance in long-distance dependency scenarios. To further improve robustness, we introduce a Self-Attention-based Graph Convolutional Network (SA-GCN) to rank neighboring nodes within the syntactic graph, filtering out irrelevant nodes and capturing long-distance dependencies more precisely in noisy environments. Additionally, a residual shrinking network is incorporated to dynamically remove noise from the syntactic graph, further strengthening the model's noise resistance. Moreover, we propose a loss computation method based on predictive interpolation, which dynamically balances the contributions of semantic and syntactic features through weighted interpolation, thereby enhancing relation extraction accuracy. Experiments conducted on two public relation extraction datasets demonstrate that the proposed method achieves significant improvements in accuracy, particularly in handling long-distance dependencies and noise suppression.

摘要

关系抽取在文本处理和知识图谱构建等任务中起着至关重要的作用。然而,现有的抽取算法在处理实体之间的长距离依赖关系和噪声干扰时,难以保持准确性。为了应对这些挑战,本文提出了一种新颖的关系抽取方法,该方法整合了语义和句法特征来处理有噪声的长距离依赖关系。具体而言,我们利用预训练的BERT模型生成的上下文语义特征以及从依存句法图中导出的句法特征,有效地利用这两种信息源的互补优势,以提高模型在长距离依赖场景中的性能。为了进一步提高鲁棒性,我们引入了基于自注意力的图卷积网络(SA-GCN)来对句法图中的相邻节点进行排序,过滤掉不相关节点,并在有噪声的环境中更精确地捕捉长距离依赖关系。此外,还引入了一个残差收缩网络来动态地从句法图中去除噪声,进一步增强模型的抗噪声能力。此外,我们提出了一种基于预测插值的损失计算方法,该方法通过加权插值动态平衡语义和句法特征的贡献,从而提高关系抽取的准确性。在两个公共关系抽取数据集上进行的实验表明,所提出的方法在准确性方面取得了显著提高,特别是在处理长距离依赖关系和噪声抑制方面。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/26b5/12056020/ff58f1134469/41598_2025_915_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/26b5/12056020/693727b0c40b/41598_2025_915_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/26b5/12056020/d54f345e3f37/41598_2025_915_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/26b5/12056020/ce6aabbb7ceb/41598_2025_915_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/26b5/12056020/c4e9af62d1f1/41598_2025_915_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/26b5/12056020/09c005bea43b/41598_2025_915_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/26b5/12056020/d118f54f4461/41598_2025_915_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/26b5/12056020/ff58f1134469/41598_2025_915_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/26b5/12056020/693727b0c40b/41598_2025_915_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/26b5/12056020/d54f345e3f37/41598_2025_915_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/26b5/12056020/ce6aabbb7ceb/41598_2025_915_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/26b5/12056020/c4e9af62d1f1/41598_2025_915_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/26b5/12056020/09c005bea43b/41598_2025_915_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/26b5/12056020/d118f54f4461/41598_2025_915_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/26b5/12056020/ff58f1134469/41598_2025_915_Fig7_HTML.jpg

相似文献

1
Relationship extraction between entities with long distance dependencies and noise based on semantic and syntactic features.基于语义和句法特征的具有长距离依存关系和噪声的实体之间的关系提取
Sci Rep. 2025 May 6;15(1):15750. doi: 10.1038/s41598-025-00915-5.
2
Extracting biomedical relation from cross-sentence text using syntactic dependency graph attention network.基于句法依存图注意力网络的跨句文本生物医学关系抽取
J Biomed Inform. 2023 Aug;144:104445. doi: 10.1016/j.jbi.2023.104445. Epub 2023 Jul 17.
3
Relation Extraction in Biomedical Texts Based on Multi-Head Attention Model With Syntactic Dependency Feature: Modeling Study.基于具有句法依存特征的多头注意力模型的生物医学文本关系抽取:建模研究
JMIR Med Inform. 2022 Oct 20;10(10):e41136. doi: 10.2196/41136.
4
Document-Level Biomedical Relation Extraction Using Graph Convolutional Network and Multihead Attention: Algorithm Development and Validation.使用图卷积网络和多头注意力的文档级生物医学关系抽取:算法开发与验证
JMIR Med Inform. 2020 Jul 31;8(7):e17638. doi: 10.2196/17638.
5
BioByGANS: biomedical named entity recognition by fusing contextual and syntactic features through graph attention network in node classification framework.BioByGANS:通过图注意力网络在节点分类框架中融合上下文和句法特征进行生物医学命名实体识别。
BMC Bioinformatics. 2022 Nov 22;23(1):501. doi: 10.1186/s12859-022-05051-9.
6
Integrating graph convolutional networks to enhance prompt learning for biomedical relation extraction.将图卷积网络集成到提示学习中,以增强生物医学关系抽取。
J Biomed Inform. 2024 Sep;157:104717. doi: 10.1016/j.jbi.2024.104717. Epub 2024 Aug 28.
7
A Graph Convolutional Network-Based Method for Chemical-Protein Interaction Extraction: Algorithm Development.一种基于图卷积网络的化学-蛋白质相互作用提取方法:算法开发
JMIR Med Inform. 2020 May 19;8(5):e17643. doi: 10.2196/17643.
8
Attention enhanced capsule network for text classification by encoding syntactic dependency trees with graph convolutional neural network.通过使用图卷积神经网络对句法依存树进行编码的注意力增强胶囊网络用于文本分类。
PeerJ Comput Sci. 2022 Jan 5;8:e831. doi: 10.7717/peerj-cs.831. eCollection 2022.
9
DAT-MT Accelerated Graph Fusion Dependency Parsing Model for Small Samples in Professional Fields.面向专业领域小样本的DAT-MT加速图融合依存句法分析模型
Entropy (Basel). 2023 Oct 12;25(10):1444. doi: 10.3390/e25101444.
10
Location-enhanced syntactic knowledge for biomedical relation extraction.基于位置增强的生物医学关系抽取句法知识。
J Biomed Inform. 2024 Aug;156:104676. doi: 10.1016/j.jbi.2024.104676. Epub 2024 Jun 12.

引用本文的文献

1
Research on a denoising model for entity-relation extraction using hierarchical contrastive learning with distant supervision.基于远距离监督的层次对比学习的实体关系抽取去噪模型研究
Sci Rep. 2025 Jul 1;15(1):21726. doi: 10.1038/s41598-025-04474-7.