• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于融合多特征嵌入的生物医学命名实体识别。

Biomedical named entity recognition based on fusion multi-features embedding.

出版信息

Technol Health Care. 2023;31(S1):111-121. doi: 10.3233/THC-236011.

DOI:10.3233/THC-236011
PMID:37038786
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10258877/
Abstract

BACKGROUND

With the exponential increase in the volume of biomedical literature, text mining tasks are becoming increasingly important in the medical domain. Named entities are the primary identification tasks in text mining, prerequisites and critical parts for building medical domain knowledge graphs, medical question and answer systems, medical text classification.

OBJECTIVE

The study goal is to recognize biomedical entities effectively by fusing multi-feature embedding. Multiple features provide more comprehensive information so that better predictions can be obtained.

METHODS

Firstly, three different kinds of features are generated, including deep contextual word-level features, local char-level features, and part-of-speech features at the word representation layer. The word representation vectors are inputs into BiLSTM as features to obtain the dependency information. Finally, the CRF algorithm is used to learn the features of the state sequences to obtain the global optimal tagging sequences.

RESULTS

The experimental results showed that the model outperformed other state-of-the-art methods for all-around performance in six datasets among eight of four biomedical entity types.

CONCLUSION

The proposed method has a positive effect on the prediction results. It comprehensively considers the relevant factors of named entity recognition because the semantic information is enhanced by fusing multi-features embedding.

摘要

背景

随着生物医学文献数量的指数级增长,文本挖掘任务在医学领域变得越来越重要。命名实体是文本挖掘的主要识别任务,是构建医学领域知识图谱、医学问答系统、医学文本分类等的前提和关键部分。

目的

本研究旨在通过融合多特征嵌入来有效地识别生物医学实体。多个特征提供了更全面的信息,从而可以获得更好的预测结果。

方法

首先,生成三种不同类型的特征,包括深度上下文单词级特征、局部字符级特征和单词表示层的词性特征。将单词表示向量作为特征输入到 BiLSTM 中,以获取依赖信息。最后,使用 CRF 算法学习状态序列的特征,以获得全局最优标记序列。

结果

实验结果表明,在六个数据集的八个生物医学实体类型中的四个中,该模型在所有方面的性能均优于其他最先进的方法。

结论

所提出的方法对预测结果有积极影响。它通过融合多特征嵌入来增强语义信息,从而全面考虑了命名实体识别的相关因素。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d094/10258877/0922b4a77acc/thc-31-thc236011-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d094/10258877/693d4389e0b4/thc-31-thc236011-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d094/10258877/0922b4a77acc/thc-31-thc236011-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d094/10258877/693d4389e0b4/thc-31-thc236011-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d094/10258877/0922b4a77acc/thc-31-thc236011-g002.jpg

相似文献

1
Biomedical named entity recognition based on fusion multi-features embedding.基于融合多特征嵌入的生物医学命名实体识别。
Technol Health Care. 2023;31(S1):111-121. doi: 10.3233/THC-236011.
2
BioBBC: a multi-feature model that enhances the detection of biomedical entities.生物 BBC:一种增强生物医学实体检测的多特征模型。
Sci Rep. 2024 Apr 2;14(1):7697. doi: 10.1038/s41598-024-58334-x.
3
BioByGANS: biomedical named entity recognition by fusing contextual and syntactic features through graph attention network in node classification framework.BioByGANS:通过图注意力网络在节点分类框架中融合上下文和句法特征进行生物医学命名实体识别。
BMC Bioinformatics. 2022 Nov 22;23(1):501. doi: 10.1186/s12859-022-05051-9.
4
Language model based on deep learning network for biomedical named entity recognition.基于深度学习网络的生物医学命名实体识别语言模型。
Methods. 2024 Jun;226:71-77. doi: 10.1016/j.ymeth.2024.04.013. Epub 2024 Apr 17.
5
Chinese Clinical Named Entity Recognition From Electronic Medical Records Based on Multisemantic Features by Using Robustly Optimized Bidirectional Encoder Representation From Transformers Pretraining Approach Whole Word Masking and Convolutional Neural Networks: Model Development and Validation.基于多语义特征,利用经过稳健优化的基于变换器预训练方法的全词掩码和卷积神经网络从电子病历中进行中文临床命名实体识别:模型开发与验证
JMIR Med Inform. 2023 May 10;11:e44597. doi: 10.2196/44597.
6
A BIGRU-Based Stacked Attention Network for Biomedical Named Entity Recognition with Chinese EMRs.基于 BIGRU 的堆叠注意力网络在中文电子病历中的生物医学命名实体识别。
Stud Health Technol Inform. 2023 Nov 23;308:757-767. doi: 10.3233/SHTI230909.
7
Named entity recognition of Chinese electronic medical records based on a hybrid neural network and medical MC-BERT.基于混合神经网络和医学 MC-BERT 的中文电子病历命名实体识别。
BMC Med Inform Decis Mak. 2022 Dec 1;22(1):315. doi: 10.1186/s12911-022-02059-2.
8
An imConvNet-based deep learning model for Chinese medical named entity recognition.基于 imConvNet 的深度学习模型在中文医疗命名实体识别中的应用。
BMC Med Inform Decis Mak. 2022 Nov 21;22(1):303. doi: 10.1186/s12911-022-02049-4.
9
Comparing general and specialized word embeddings for biomedical named entity recognition.比较用于生物医学命名实体识别的通用词嵌入和专用词嵌入。
PeerJ Comput Sci. 2021 Feb 18;7:e384. doi: 10.7717/peerj-cs.384. eCollection 2021.
10
Relation Extraction in Biomedical Texts Based on Multi-Head Attention Model With Syntactic Dependency Feature: Modeling Study.基于具有句法依存特征的多头注意力模型的生物医学文本关系抽取:建模研究
JMIR Med Inform. 2022 Oct 20;10(10):e41136. doi: 10.2196/41136.

引用本文的文献

1
Applied artificial intelligence in dentistry: emerging data modalities and modeling approaches.人工智能在牙科中的应用:新兴数据模式与建模方法
Front Artif Intell. 2024 Jul 23;7:1427517. doi: 10.3389/frai.2024.1427517. eCollection 2024.

本文引用的文献

1
A review of biomedical datasets relating to drug discovery: a knowledge graph perspective.生物医学数据集在药物发现中的应用综述:知识图谱视角。
Brief Bioinform. 2022 Nov 19;23(6). doi: 10.1093/bib/bbac404.
2
BERN2: an advanced neural biomedical named entity recognition and normalization tool.BERN2:一种先进的神经生物医学命名实体识别和标准化工具。
Bioinformatics. 2022 Oct 14;38(20):4837-4839. doi: 10.1093/bioinformatics/btac598.
3
Noise Reduction Learning Based on XLNet-CRF for Biomedical Named Entity Recognition.基于XLNet-CRF的降噪学习在生物医学命名实体识别中的应用
IEEE/ACM Trans Comput Biol Bioinform. 2023 Jan-Feb;20(1):595-605. doi: 10.1109/TCBB.2022.3157630. Epub 2023 Feb 3.
4
Hierarchical shared transfer learning for biomedical named entity recognition.基于层次共享迁移学习的生物医学命名实体识别。
BMC Bioinformatics. 2022 Jan 4;23(1):8. doi: 10.1186/s12859-021-04551-4.
5
Deep learning methods for biomedical named entity recognition: a survey and qualitative comparison.深度学习方法在生物医学命名实体识别中的应用:综述与定性比较。
Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab282.
6
Biomedical named entity recognition using BERT in the machine reading comprehension framework.基于机器阅读理解框架的 BERT 在生物医学命名实体识别中的应用。
J Biomed Inform. 2021 Jun;118:103799. doi: 10.1016/j.jbi.2021.103799. Epub 2021 May 6.
7
Application and evaluation of knowledge graph embeddings in biomedical data.知识图谱嵌入技术在生物医学数据中的应用与评估
PeerJ Comput Sci. 2021 Feb 18;7:e341. doi: 10.7717/peerj-cs.341. eCollection 2021.
8
A pre-training and self-training approach for biomedical named entity recognition.一种用于生物医学命名实体识别的预训练和自训练方法。
PLoS One. 2021 Feb 9;16(2):e0246310. doi: 10.1371/journal.pone.0246310. eCollection 2021.
9
HunFlair: an easy-to-use tool for state-of-the-art biomedical named entity recognition.HunFlair:一种用于最先进生物医学命名实体识别的易于使用的工具。
Bioinformatics. 2021 Sep 9;37(17):2792-2794. doi: 10.1093/bioinformatics/btab042.
10
DTranNER: biomedical named entity recognition with deep learning-based label-label transition model.DTranNER:基于深度学习的标签-标签转换模型的生物医学命名实体识别。
BMC Bioinformatics. 2020 Feb 11;21(1):53. doi: 10.1186/s12859-020-3393-1.