• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用额外的外部语境提高生物医学命名实体识别的性能。

Improving biomedical Named Entity Recognition with additional external contexts.

机构信息

Hung Yen University of Technology and Education, Viet Nam; University of Engineering and Technology, Vietnam National University, Hanoi, Viet Nam.

Hung Yen University of Technology and Education, Viet Nam.

出版信息

J Biomed Inform. 2024 Aug;156:104674. doi: 10.1016/j.jbi.2024.104674. Epub 2024 Jun 11.

DOI:10.1016/j.jbi.2024.104674
PMID:38871012
Abstract

OBJECTIVE

Biomedical Named Entity Recognition (bio NER) is the task of recognizing named entities in biomedical texts. This paper introduces a new model that addresses bio NER by considering additional external contexts. Different from prior methods that mainly use original input sequences for sequence labeling, the model takes into account additional contexts to enhance the representation of entities in the original sequences, since additional contexts can provide enhanced information for the concept explanation of biomedical entities.

METHODS

To exploit an additional context, given an original input sequence, the model first retrieves the relevant sentences from PubMed and then ranks the retrieved sentences to form the contexts. It next combines the context with the original input sequence to form a new enhanced sequence. The original and new enhanced sequences are fed into PubMedBERT for learning feature representation. To obtain more fine-grained features, the model stacks a BiLSTM layer on top of PubMedBERT. The final named entity label prediction is done by using a CRF layer. The model is jointly trained in an end-to-end manner to take advantage of the additional context for NER of the original sequence.

RESULTS

Experimental results on six biomedical datasets show that the proposed model achieves promising performance compared to strong baselines and confirms the contribution of additional contexts for bio NER.

CONCLUSION

The promising results confirm three important points. First, the additional context from PubMed helps to improve the quality of the recognition of biomedical entities. Second, PubMed is more appropriate than the Google search engine for providing relevant information of bio NER. Finally, more relevant sentences from the context are more beneficial than irrelevant ones to provide enhanced information for the original input sequences. The model is flexible to integrate any additional context types for the NER task.

摘要

目的

生物医学命名实体识别(bio NER)是识别生物医学文本中命名实体的任务。本文介绍了一种新的模型,通过考虑额外的外部上下文来解决 bio NER 问题。与主要使用原始输入序列进行序列标记的先前方法不同,该模型考虑了额外的上下文,以增强原始序列中实体的表示,因为额外的上下文可以为生物医学实体的概念解释提供增强的信息。

方法

为了利用额外的上下文,给定原始输入序列,模型首先从 PubMed 中检索相关句子,然后对检索到的句子进行排序,形成上下文。然后,它将上下文与原始输入序列相结合,形成新的增强序列。原始序列和新增强序列都被馈送到 PubMedBERT 中进行特征表示学习。为了获得更精细的特征,模型在 PubMedBERT 上堆叠了一个 BiLSTM 层。最终的命名实体标签预测是通过使用 CRF 层完成的。模型以端到端的方式联合训练,以利用额外的上下文进行原始序列的 NER。

结果

在六个生物医学数据集上的实验结果表明,与强大的基线相比,所提出的模型取得了有前途的性能,并证实了额外上下文对 bio NER 的贡献。

结论

有前途的结果证实了三点。首先,来自 PubMed 的额外上下文有助于提高生物医学实体识别的质量。其次,与谷歌搜索引擎相比,PubMed 更适合为 bio NER 提供相关信息。最后,来自上下文的更多相关句子比不相关句子更有利于为原始输入序列提供增强信息。该模型灵活,可以集成任何类型的额外上下文来完成 NER 任务。

相似文献

1
Improving biomedical Named Entity Recognition with additional external contexts.利用额外的外部语境提高生物医学命名实体识别的性能。
J Biomed Inform. 2024 Aug;156:104674. doi: 10.1016/j.jbi.2024.104674. Epub 2024 Jun 11.
2
Short-Term Memory Impairment短期记忆障碍
3
Advancing entity recognition in biomedicine via instruction tuning of large language models.通过指令调整大型语言模型推进生物医学中的实体识别。
Bioinformatics. 2024 Mar 29;40(4). doi: 10.1093/bioinformatics/btae163.
4
Semi-supervised learning from small annotated data and large unlabeled data for fine-grained Participants, Intervention, Comparison, and Outcomes entity recognition.从小规模标注数据和大规模未标注数据中进行半监督学习,用于细粒度的参与者、干预措施、对照和结果实体识别。
J Am Med Inform Assoc. 2025 Mar 1;32(3):555-565. doi: 10.1093/jamia/ocae326.
5
From BERT to generative AI - Comparing encoder-only vs. large language models in a cohort of lung cancer patients for named entity recognition in unstructured medical reports.从BERT到生成式人工智能——在一组肺癌患者中比较仅编码器模型与大语言模型用于非结构化医疗报告中的命名实体识别
Comput Biol Med. 2025 Sep;195:110665. doi: 10.1016/j.compbiomed.2025.110665. Epub 2025 Jun 24.
6
Partial Annotation Learning for Biomedical Entity Recognition.用于生物医学实体识别的部分标注学习
IEEE J Biomed Health Inform. 2025 Feb;29(2):1409-1418. doi: 10.1109/JBHI.2024.3466294. Epub 2025 Feb 10.
7
Discontinuous named entities in clinical text: A systematic literature review.临床文本中的不连续命名实体:系统文献综述
J Biomed Inform. 2025 Feb;162:104783. doi: 10.1016/j.jbi.2025.104783. Epub 2025 Jan 23.
8
Knowledge Graph-Enhanced Deep Learning Model (H-SYSTEM) for Hypertensive Intracerebral Hemorrhage: Model Development and Validation.用于高血压性脑出血的知识图谱增强深度学习模型(H-SYSTEM):模型开发与验证
J Med Internet Res. 2025 Jun 12;27:e66055. doi: 10.2196/66055.
9
Extracting adverse drug events from clinical Notes: A systematic review of approaches used.从临床记录中提取药物不良事件:对所用方法的系统评价
J Biomed Inform. 2024 Mar;151:104603. doi: 10.1016/j.jbi.2024.104603. Epub 2024 Feb 6.
10
Sexual Harassment and Prevention Training性骚扰与预防培训