• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于生物医学文本共指消解的知识增强长短期记忆网络

Knowledge enhanced LSTM for coreference resolution on biomedical texts.

作者信息

Li Yufei, Ma Xiaoyong, Zhou Xiangyu, Cheng Pengzhen, He Kai, Li Chen

机构信息

Department of Computer Science and Technology, School of Computer Science and Technology, Xi'an Jiaotong University, Xi'an, Shaanxi 710049, China.

Department of Computer Science and Technology, National Engineering Lab for Big Data Analytics Xi'an Jiaotong University, Xi'an, Shaanxi 710049, China.

出版信息

Bioinformatics. 2021 Sep 9;37(17):2699-2705. doi: 10.1093/bioinformatics/btab153.

DOI:10.1093/bioinformatics/btab153
PMID:33705524
Abstract

MOTIVATION

Bio-entity Coreference Resolution focuses on identifying the coreferential links in biomedical texts, which is crucial to complete bio-events' attributes and interconnect events into bio-networks. Previously, as one of the most powerful tools, deep neural network-based general domain systems are applied to the biomedical domain with domain-specific information integration. However, such methods may raise much noise due to its insufficiency of combining context and complex domain-specific information.

RESULTS

In this article, we explore how to leverage the external knowledge base in a fine-grained way to better resolve coreference by introducing a knowledge-enhanced Long Short Term Memory network (LSTM), which is more flexible to encode the knowledge information inside the LSTM. Moreover, we further propose a knowledge attention module to extract informative knowledge effectively based on contexts. The experimental results on the BioNLP and CRAFT datasets achieve state-of-the-art performance, with a gain of 7.5 F1 on BioNLP and 10.6 F1 on CRAFT. Additional experiments also demonstrate superior performance on the cross-sentence coreferences.

AVAILABILITY AND IMPLEMENTATION

The source code will be made available at https://github.com/zxy951005/KB-CR upon publication. Data is avaliable at http://2011.bionlp-st.org/ and https://github.com/UCDenver-ccp/CRAFT/releases/tag/v3.1.3.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

生物实体共指消解专注于识别生物医学文本中的共指链接,这对于完善生物事件的属性以及将事件相互连接成生物网络至关重要。此前,作为最强大的工具之一,基于深度神经网络的通用领域系统通过集成特定领域信息被应用于生物医学领域。然而,由于其在结合上下文和复杂特定领域信息方面的不足,此类方法可能会产生大量噪声。

结果

在本文中,我们探索如何通过引入知识增强的长短期记忆网络(LSTM)以细粒度方式利用外部知识库来更好地解决共指问题,该网络在对LSTM内部的知识信息进行编码时更加灵活。此外,我们进一步提出了一个知识注意力模块,以基于上下文有效地提取信息丰富的知识。在BioNLP和CRAFT数据集上的实验结果达到了当前最优性能,在BioNLP上F1值提高了7.5,在CRAFT上F1值提高了10.6。额外的实验还证明了在跨句子共指方面的卓越性能。

可用性与实现

源代码将在发布后于https://github.com/zxy951005/KB-CR上提供。数据可在http://2011.bionlp-st.org/和https://github.com/UCDenver-ccp/CRAFT/releases/tag/v3.1.3获取。

补充信息

补充数据可在《生物信息学》在线获取。

相似文献

1
Knowledge enhanced LSTM for coreference resolution on biomedical texts.用于生物医学文本共指消解的知识增强长短期记忆网络
Bioinformatics. 2021 Sep 9;37(17):2699-2705. doi: 10.1093/bioinformatics/btab153.
2
Integrating K+ Entities Into Coreference Resolution on Biomedical Texts.将钾离子实体整合到生物医学文本的指代消解中。
IEEE/ACM Trans Comput Biol Bioinform. 2024 Nov-Dec;21(6):2145-2155. doi: 10.1109/TCBB.2024.3447273. Epub 2024 Dec 10.
3
Distinguished representation of identical mentions in bio-entity coreference resolution.生物实体共指消解中相同提及的出色表示。
BMC Med Inform Decis Mak. 2022 Apr 30;22(1):116. doi: 10.1186/s12911-022-01862-1.
4
A set of domain rules and a deep network for protein coreference resolution.一组用于蛋白质共指解析的领域规则和深度网络。
Database (Oxford). 2018 Jan 1;2018. doi: 10.1093/database/bay065.
5
Bio-SCoRes: A Smorgasbord Architecture for Coreference Resolution in Biomedical Text.生物共指消解评分系统(Bio-SCoRes):一种用于生物医学文本共指消解的混合架构
PLoS One. 2016 Mar 2;11(3):e0148538. doi: 10.1371/journal.pone.0148538. eCollection 2016.
6
Transfer learning for biomedical named entity recognition with neural networks.基于神经网络的生物医学命名实体识别的迁移学习。
Bioinformatics. 2018 Dec 1;34(23):4087-4094. doi: 10.1093/bioinformatics/bty449.
7
BERT-GT: cross-sentence n-ary relation extraction with BERT and Graph Transformer.BERT-GT:使用BERT和图变换器进行跨句子n元关系提取
Bioinformatics. 2021 Apr 5;36(24):5678-5685. doi: 10.1093/bioinformatics/btaa1087.
8
Cross-type biomedical named entity recognition with deep multi-task learning.基于深度多任务学习的跨类型生物医学命名实体识别。
Bioinformatics. 2019 May 15;35(10):1745-1752. doi: 10.1093/bioinformatics/bty869.
9
HUNER: improving biomedical NER with pretraining.HUNER:通过预训练改进生物医学命名实体识别。
Bioinformatics. 2020 Jan 1;36(1):295-302. doi: 10.1093/bioinformatics/btz528.
10
Dataset-aware multi-task learning approaches for biomedical named entity recognition.基于数据集的多任务学习方法在生物医学命名实体识别中的应用。
Bioinformatics. 2020 Aug 1;36(15):4331-4338. doi: 10.1093/bioinformatics/btaa515.

引用本文的文献

1
Distinguished representation of identical mentions in bio-entity coreference resolution.生物实体共指消解中相同提及的出色表示。
BMC Med Inform Decis Mak. 2022 Apr 30;22(1):116. doi: 10.1186/s12911-022-01862-1.
2
A Comprehensive Review on Radiomics and Deep Learning for Nasopharyngeal Carcinoma Imaging.鼻咽癌成像的放射组学与深度学习综合综述
Diagnostics (Basel). 2021 Aug 24;11(9):1523. doi: 10.3390/diagnostics11091523.