利用领域信息进行医学文献的词义消歧。

Exploiting domain information for Word Sense Disambiguation of medical documents.

机构信息

Department of Computer Science, Sheffield University, Sheffield, UK.

出版信息

J Am Med Inform Assoc. 2012 Mar-Apr;19(2):235-40. doi: 10.1136/amiajnl-2011-000415. Epub 2011 Sep 7.

DOI:10.1136/amiajnl-2011-000415

PMID:21900701

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3277615/

Abstract

OBJECTIVE

Current techniques for knowledge-based Word Sense Disambiguation (WSD) of ambiguous biomedical terms rely on relations in the Unified Medical Language System Metathesaurus but do not take into account the domain of the target documents. The authors' goal is to improve these methods by using information about the topic of the document in which the ambiguous term appears.

DESIGN

The authors proposed and implemented several methods to extract lists of key terms associated with Medical Subject Heading terms. These key terms are used to represent the document topic in a knowledge-based WSD system. They are applied both alone and in combination with local context.

MEASUREMENTS

A standard measure of accuracy was calculated over the set of target words in the widely used National Library of Medicine WSD dataset.

RESULTS AND DISCUSSION

The authors report a significant improvement when combining those key terms with local context, showing that domain information improves the results of a WSD system based on the Unified Medical Language System Metathesaurus alone. The best results were obtained using key terms obtained by relevance feedback and weighted by inverse document frequency.

摘要

目的

当前基于知识的生物医学术语歧义消解（WSD）技术依赖于统一医学语言系统术语表中的关系，但并未考虑目标文档的领域。作者的目标是通过使用在出现歧义术语的文档的主题信息来改进这些方法。

设计

作者提出并实现了几种方法来提取与医学主题词相关的关键词列表。这些关键词用于在基于知识的 WSD 系统中表示文档主题。它们单独使用或与局部上下文结合使用。

测量

在广泛使用的国家医学图书馆 WSD 数据集的目标词集上计算了标准准确性度量。

结果与讨论

作者报告说，当将这些关键词与局部上下文结合使用时，准确性有了显著提高，这表明领域信息可以提高仅基于统一医学语言系统术语表的 WSD 系统的结果。使用通过相关性反馈获得的关键词并按逆文档频率加权得到的最佳结果。

相似文献

Exploiting domain information for Word Sense Disambiguation of medical documents.利用领域信息进行医学文献的词义消歧。

J Am Med Inform Assoc. 2012 Mar-Apr;19(2):235-40. doi: 10.1136/amiajnl-2011-000415. Epub 2011 Sep 7.

Collocation analysis for UMLS knowledge-based word sense disambiguation.基于 UMLS 的词汇搭配分析在词义消歧中的应用。

BMC Bioinformatics. 2011 Jun 9;12 Suppl 3(Suppl 3):S4. doi: 10.1186/1471-2105-12-S3-S4.

Exploiting MeSH indexing in MEDLINE to generate a data set for word sense disambiguation.利用 MEDLINE 中的 MeSH 索引生成用于词义消歧的数据集合。

BMC Bioinformatics. 2011 Jun 2;12:223. doi: 10.1186/1471-2105-12-223.

Disambiguation of ambiguous biomedical terms using examples generated from the UMLS Metathesaurus.利用 UMLS Metathesaurus 生成的示例对模糊生物医学术语进行消歧。

J Biomed Inform. 2010 Oct;43(5):762-73. doi: 10.1016/j.jbi.2010.06.001. Epub 2010 Jun 10.

Determining the difficulty of Word Sense Disambiguation.确定词义消歧的难度。

J Biomed Inform. 2014 Feb;47:83-90. doi: 10.1016/j.jbi.2013.09.009. Epub 2013 Sep 26.

Knowledge-based biomedical word sense disambiguation: an evaluation and application to clinical document classification.基于知识的生物医学词义消歧：评估及在临床文档分类中的应用。

J Am Med Inform Assoc. 2013 Sep-Oct;20(5):882-6. doi: 10.1136/amiajnl-2012-001350. Epub 2012 Oct 16.

Disambiguation in the biomedical domain: the role of ambiguity type.生物医学领域的消歧：歧义类型的作用。

J Biomed Inform. 2010 Dec;43(6):972-81. doi: 10.1016/j.jbi.2010.08.009. Epub 2010 Sep 9.

Word sense disambiguation in the clinical domain: a comparison of knowledge-rich and knowledge-poor unsupervised methods.临床领域的词义消歧：知识丰富和知识贫乏的无监督方法比较。

J Am Med Inform Assoc. 2014 Sep-Oct;21(5):842-9. doi: 10.1136/amiajnl-2013-002133. Epub 2014 Jan 17.

deepBioWSD: effective deep neural word sense disambiguation of biomedical text data.深度生物词汇语义消歧：生物医学文本数据的有效深度神经网络词汇语义消歧。

J Am Med Inform Assoc. 2019 May 1;26(5):438-446. doi: 10.1093/jamia/ocy189.

Studying the correlation between different word sense disambiguation methods and summarization effectiveness in biomedical texts.研究不同词义消歧方法与生物医学文本摘要有效性之间的相关性。

BMC Bioinformatics. 2011 Aug 26;12:355. doi: 10.1186/1471-2105-12-355.

引用本文的文献

Ambiguity in medical concept normalization: An analysis of types and coverage in electronic health record datasets.医学概念规范化中的歧义：电子健康记录数据集的类型和覆盖范围分析。

J Am Med Inform Assoc. 2021 Mar 1;28(3):516-532. doi: 10.1093/jamia/ocaa269.

J Am Med Inform Assoc. 2014 Sep-Oct;21(5):842-9. doi: 10.1136/amiajnl-2013-002133. Epub 2014 Jan 17.

Electronic health records-driven phenotyping: challenges, recent advances, and perspectives.电子健康记录驱动的表型分析：挑战、最新进展与展望

J Am Med Inform Assoc. 2013 Dec;20(e2):e206-11. doi: 10.1136/amiajnl-2013-002428.

A novel approach to word sense disambiguation based on topical and semantic association.一种基于主题和语义关联的词义消歧新方法。

ScientificWorldJournal. 2013 Oct 31;2013:586327. doi: 10.1155/2013/586327. eCollection 2013.

Evaluating measures of semantic similarity and relatedness to disambiguate terms in biomedical text.评估语义相似性和关联性的度量标准，以消除生物医学文本中的术语歧义。

J Biomed Inform. 2013 Dec;46(6):1116-24. doi: 10.1016/j.jbi.2013.08.008. Epub 2013 Sep 4.

Combining corpus-derived sense profiles with estimated frequency information to disambiguate clinical abbreviations.结合源自语料库的词义概况与估计的频率信息来消除临床缩写的歧义。

AMIA Annu Symp Proc. 2012;2012:1004-13. Epub 2012 Nov 3.

A learning-based approach for biomedical word sense disambiguation.一种基于学习的生物医学词义消歧方法。

ScientificWorldJournal. 2012;2012:949247. doi: 10.1100/2012/949247. Epub 2012 May 1.

本文引用的文献

Automatic Indexing of Documents from Journal Descriptors: A Preliminary Investigation.基于期刊描述符的文档自动索引：初步调查

J Am Soc Inf Sci. 1999;50(8):661-674. doi: 10.1002/(SICI)1097-4571(1999)50:8<661::AID-ASI4>3.0.CO;2-R.

Knowledge-based biomedical word sense disambiguation: comparison of approaches.基于知识的生物医学词义消歧：方法比较。

BMC Bioinformatics. 2010 Nov 22;11:569. doi: 10.1186/1471-2105-11-569.

Graph-based word sense disambiguation of biomedical documents.基于图的生物医学文献词义消歧。

Bioinformatics. 2010 Nov 15;26(22):2889-96. doi: 10.1093/bioinformatics/btq555. Epub 2010 Oct 7.

Disambiguation in the biomedical domain: the role of ambiguity type.生物医学领域的消歧：歧义类型的作用。

J Biomed Inform. 2010 Dec;43(6):972-81. doi: 10.1016/j.jbi.2010.08.009. Epub 2010 Sep 9.

Disambiguation of ambiguous biomedical terms using examples generated from the UMLS Metathesaurus.利用 UMLS Metathesaurus 生成的示例对模糊生物医学术语进行消歧。

J Biomed Inform. 2010 Oct;43(5):762-73. doi: 10.1016/j.jbi.2010.06.001. Epub 2010 Jun 10.

An overview of MetaMap: historical perspective and recent advances.MetaMap 概述：历史视角与最新进展。

J Am Med Inform Assoc. 2010 May-Jun;17(3):229-36. doi: 10.1136/jamia.2009.002733.

Word Sense Disambiguation by Selecting the Best Semantic Type Based on Journal Descriptor Indexing: Preliminary Experiment.基于期刊描述符索引选择最佳语义类型的词义消歧：初步实验

J Am Soc Inf Sci Technol. 2006 Jan 1;57(1):96-113. doi: 10.1002/asi.20257.

Disambiguation of biomedical text using diverse sources of information.利用多种信息来源对生物医学文本进行消歧。

BMC Bioinformatics. 2008 Nov 19;9 Suppl 11(Suppl 11):S7. doi: 10.1186/1471-2105-9-S11-S7.

Word sense disambiguation across two domains: biomedical literature and clinical notes.跨两个领域的词义消歧：生物医学文献和临床记录。

J Biomed Inform. 2008 Dec;41(6):1088-100. doi: 10.1016/j.jbi.2008.02.003. Epub 2008 Mar 4.

Impact of web searching and social feedback on consumer decision making: a prospective online experiment.网络搜索和社交反馈对消费者决策的影响：一项前瞻性在线实验

J Med Internet Res. 2008 Jan 22;10(1):e2. doi: 10.2196/jmir.963.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验