基于双向长短期记忆和注意力机制的神经网络的生物医学词义消歧。

Biomedical word sense disambiguation with bidirectional long short-term memory and attention-based neural networks.

机构信息

Department of Mathematics, Florida State University, Tallahassee, FL, US.

Department of Computer Science, Florida State University, Tallahassee, FL, US.

出版信息

BMC Bioinformatics. 2019 Dec 2;20(Suppl 16):502. doi: 10.1186/s12859-019-3079-8.

DOI:10.1186/s12859-019-3079-8

PMID:31787096

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6886160/

Abstract

BACKGROUND

In recent years, deep learning methods have been applied to many natural language processing tasks to achieve state-of-the-art performance. However, in the biomedical domain, they have not out-performed supervised word sense disambiguation (WSD) methods based on support vector machines or random forests, possibly due to inherent similarities of medical word senses.

RESULTS

In this paper, we propose two deep-learning-based models for supervised WSD: a model based on bi-directional long short-term memory (BiLSTM) network, and an attention model based on self-attention architecture. Our result shows that the BiLSTM neural network model with a suitable upper layer structure performs even better than the existing state-of-the-art models on the MSH WSD dataset, while our attention model was 3 or 4 times faster than our BiLSTM model with good accuracy. In addition, we trained "universal" models in order to disambiguate all ambiguous words together. That is, we concatenate the embedding of the target ambiguous word to the max-pooled vector in the universal models, acting as a "hint". The result shows that our universal BiLSTM neural network model yielded about 90 percent accuracy.

CONCLUSION

Deep contextual models based on sequential information processing methods are able to capture the relative contextual information from pre-trained input word embeddings, in order to provide state-of-the-art results for supervised biomedical WSD tasks.

摘要

背景

近年来，深度学习方法已被应用于许多自然语言处理任务中，以达到最新的性能水平。然而，在生物医学领域，它们并未超越基于支持向量机或随机森林的监督词义消歧（WSD）方法，这可能是由于医学词义之间存在内在的相似性。

结果

在本文中，我们提出了两种基于深度学习的监督 WSD 模型：一种基于双向长短期记忆（BiLSTM）网络的模型，另一种基于自注意力架构的注意力模型。我们的结果表明，在 MSH WSD 数据集上，具有合适上层结构的 BiLSTM 神经网络模型的性能甚至优于现有的最先进模型，而我们的注意力模型的准确性与 BiLSTM 模型相当，但速度要快 3 到 4 倍。此外，我们还训练了“通用”模型，以便一起消歧所有歧义词。也就是说，我们将目标歧义词的嵌入与通用模型中的最大池化向量连接起来，作为一个“提示”。结果表明，我们的通用 BiLSTM 神经网络模型的准确率约为 90%。

结论

基于顺序信息处理方法的深度上下文模型能够从预训练的输入词嵌入中捕获相对上下文信息，从而为监督生物医学 WSD 任务提供最新的结果。

相似文献

Biomedical word sense disambiguation with bidirectional long short-term memory and attention-based neural networks.

BMC Bioinformatics. 2019 Dec 2;20(Suppl 16):502. doi: 10.1186/s12859-019-3079-8.

deepBioWSD: effective deep neural word sense disambiguation of biomedical text data.

J Am Med Inform Assoc. 2019 May 1;26(5):438-446. doi: 10.1093/jamia/ocy189.

Word embeddings and recurrent neural networks based on Long-Short Term Memory nodes in supervised biomedical word sense disambiguation.

J Biomed Inform. 2017 Sep;73:137-147. doi: 10.1016/j.jbi.2017.08.001. Epub 2017 Aug 7.

Machine learning and word sense disambiguation in the biomedical domain: design and evaluation issues.

BMC Bioinformatics. 2006 Jul 5;7:334. doi: 10.1186/1471-2105-7-334.

Applying active learning to supervised word sense disambiguation in MEDLINE.

J Am Med Inform Assoc. 2013 Sep-Oct;20(5):1001-6. doi: 10.1136/amiajnl-2012-001244. Epub 2013 Jan 30.

Knowledge-Based Biomedical Word Sense Disambiguation with Neural Concept Embeddings.

Proc IEEE Int Symp Bioinformatics Bioeng. 2017 Oct;2017:163-170. doi: 10.1109/BIBE.2017.00-61. Epub 2018 Jan 11.

Supervised Learning and Knowledge-Based Approaches Applied to Biomedical Word Sense Disambiguation.

J Integr Bioinform. 2017 Dec 13;14(4):/j/jib.2017.14.issue-4/jib-2017-0051/jib-2017-0051.xml. doi: 10.1515/jib-2017-0051.

Determining the difficulty of Word Sense Disambiguation.

J Biomed Inform. 2014 Feb;47:83-90. doi: 10.1016/j.jbi.2013.09.009. Epub 2013 Sep 26.

Knowledge-based biomedical word sense disambiguation: comparison of approaches.

BMC Bioinformatics. 2010 Nov 22;11:569. doi: 10.1186/1471-2105-11-569.

Improving clinical abbreviation sense disambiguation using attention-based Bi-LSTM and hybrid balancing techniques in imbalanced datasets.

J Eval Clin Pract. 2024 Oct;30(7):1327-1336. doi: 10.1111/jep.14041. Epub 2024 Jun 21.

引用本文的文献

Improving broad-coverage medical entity linking with semantic type prediction and large-scale datasets.

J Biomed Inform. 2021 Sep;121:103880. doi: 10.1016/j.jbi.2021.103880. Epub 2021 Aug 12.

Status and Recommendations of Technological and Data-Driven Innovations in Cancer Care: Focus Group Study.

J Med Internet Res. 2020 Dec 15;22(12):e22034. doi: 10.2196/22034.

Named Entity Recognition and Relation Detection for Biomedical Information Extraction.

Front Cell Dev Biol. 2020 Aug 28;8:673. doi: 10.3389/fcell.2020.00673. eCollection 2020.

本文引用的文献

Interactive medical word sense disambiguation through informed learning.

J Am Med Inform Assoc. 2018 Jul 1;25(7):800-808. doi: 10.1093/jamia/ocy013.

Co-occurrence graphs for word sense disambiguation in the biomedical domain.

Artif Intell Med. 2018 May;87:9-19. doi: 10.1016/j.artmed.2018.03.002. Epub 2018 Mar 21.

Knowledge-Based Biomedical Word Sense Disambiguation with Neural Concept Embeddings.

Proc IEEE Int Symp Bioinformatics Bioeng. 2017 Oct;2017:163-170. doi: 10.1109/BIBE.2017.00-61. Epub 2018 Jan 11.

Word embeddings and recurrent neural networks based on Long-Short Term Memory nodes in supervised biomedical word sense disambiguation.

J Biomed Inform. 2017 Sep;73:137-147. doi: 10.1016/j.jbi.2017.08.001. Epub 2017 Aug 7.

Word Sense Disambiguation of Medical Terms via Recurrent Convolutional Neural Networks.

Stud Health Technol Inform. 2017;236:8-15.

Combining corpus-derived sense profiles with estimated frequency information to disambiguate clinical abbreviations.

AMIA Annu Symp Proc. 2012;2012:1004-13. Epub 2012 Nov 3.

Exploiting MeSH indexing in MEDLINE to generate a data set for word sense disambiguation.

BMC Bioinformatics. 2011 Jun 2;12:223. doi: 10.1186/1471-2105-12-223.

Knowledge-based biomedical word sense disambiguation: comparison of approaches.

BMC Bioinformatics. 2010 Nov 22;11:569. doi: 10.1186/1471-2105-11-569.

Word sense disambiguation across two domains: biomedical literature and clinical notes.

J Biomed Inform. 2008 Dec;41(6):1088-100. doi: 10.1016/j.jbi.2008.02.003. Epub 2008 Mar 4.

Using MEDLINE as a knowledge source for disambiguating abbreviations and acronyms in full-text biomedical journal articles.

J Biomed Inform. 2007 Apr;40(2):150-9. doi: 10.1016/j.jbi.2006.06.001. Epub 2006 Jun 7.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于双向长短期记忆和注意力机制的神经网络的生物医学词义消歧。

Biomedical word sense disambiguation with bidirectional long short-term memory and attention-based neural networks.

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSION

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献