Suppr超能文献

词义消歧准确性对基于文献的发现的影响。

The effect of word sense disambiguation accuracy on literature based discovery.

作者信息

Preiss Judita, Stevenson Mark

机构信息

Advanced Computing Research Center, Department of Computer Science, The University of Sheffield, 211 Portobello, Sheffield, S1 4DP, UK.

出版信息

BMC Med Inform Decis Mak. 2016 Jul 18;16 Suppl 1(Suppl 1):57. doi: 10.1186/s12911-016-0296-1.

Abstract

BACKGROUND

The volume of research published in the biomedical domain has increasingly lead to researchers focussing on specific areas of interest and connections between findings being missed. Literature based discovery (LBD) attempts to address this problem by searching for previously unnoticed connections between published information (also known as "hidden knowledge"). A common approach is to identify hidden knowledge via shared linking terms. However, biomedical documents are highly ambiguous which can lead LBD systems to over generate hidden knowledge by hypothesising connections through different meanings of linking terms. Word Sense Disambiguation (WSD) aims to resolve ambiguities in text by identifying the meaning of ambiguous terms. This study explores the effect of WSD accuracy on LBD performance.

METHODS

An existing LBD system is employed and four approaches to WSD of biomedical documents integrated with it. The accuracy of each WSD approach is determined by comparing its output against a standard benchmark. Evaluation of the LBD output is carried out using timeslicing approach, where hidden knowledge is generated from articles published prior to a certain cutoff date and a gold standard extracted from publications after the cutoff date.

RESULTS

WSD accuracy varies depending on the approach used. The connection between the performance of the LBD and WSD systems are analysed to reveal a correlation between WSD accuracy and LBD performance.

CONCLUSION

This study reveals that LBD performance is sensitive to WSD accuracy. It is therefore concluded that WSD has the potential to improve the output of LBD systems by reducing the amount of spurious hidden knowledge that is generated. It is also suggested that further improvements in WSD accuracy have the potential to improve LBD accuracy.

摘要

背景

生物医学领域发表的研究数量日益增加,这使得研究人员越来越专注于特定的感兴趣领域,从而忽略了研究结果之间的联系。基于文献的发现(LBD)试图通过搜索已发表信息之间以前未被注意到的联系(也称为“隐藏知识”)来解决这个问题。一种常见的方法是通过共享链接词来识别隐藏知识。然而,生物医学文档具有高度的歧义性,这可能导致LBD系统通过对链接词的不同含义进行假设来过度生成隐藏知识。词义消歧(WSD)旨在通过识别歧义词的含义来解决文本中的歧义。本研究探讨了WSD准确性对LBD性能的影响。

方法

采用现有的LBD系统,并将四种生物医学文档WSD方法与之集成。每种WSD方法的准确性通过将其输出与标准基准进行比较来确定。使用时间切片方法对LBD输出进行评估,其中隐藏知识是从某个截止日期之前发表的文章中生成的,而黄金标准是从截止日期之后的出版物中提取的。

结果

WSD准确性因所使用的方法而异。分析了LBD和WSD系统性能之间的联系,以揭示WSD准确性与LBD性能之间的相关性。

结论

本研究表明LBD性能对WSD准确性敏感。因此得出结论,WSD有潜力通过减少生成的虚假隐藏知识的数量来提高LBD系统的输出。还建议进一步提高WSD准确性有可能提高LBD准确性。

相似文献

1
The effect of word sense disambiguation accuracy on literature based discovery.词义消歧准确性对基于文献的发现的影响。
BMC Med Inform Decis Mak. 2016 Jul 18;16 Suppl 1(Suppl 1):57. doi: 10.1186/s12911-016-0296-1.
2
Determining the difficulty of Word Sense Disambiguation.确定词义消歧的难度。
J Biomed Inform. 2014 Feb;47:83-90. doi: 10.1016/j.jbi.2013.09.009. Epub 2013 Sep 26.
9
Supervised Learning and Knowledge-Based Approaches Applied to Biomedical Word Sense Disambiguation.应用于生物医学词义消歧的监督学习和基于知识的方法。
J Integr Bioinform. 2017 Dec 13;14(4):/j/jib.2017.14.issue-4/jib-2017-0051/jib-2017-0051.xml. doi: 10.1515/jib-2017-0051.

本文引用的文献

1
Exploring relation types for literature-based discovery.探索基于文献的发现中的关系类型。
J Am Med Inform Assoc. 2015 Sep;22(5):987-92. doi: 10.1093/jamia/ocv002. Epub 2015 May 13.
3
Graph-based word sense disambiguation of biomedical documents.基于图的生物医学文献词义消歧。
Bioinformatics. 2010 Nov 15;26(22):2889-96. doi: 10.1093/bioinformatics/btq555. Epub 2010 Oct 7.
4
Disambiguation in the biomedical domain: the role of ambiguity type.生物医学领域的消歧:歧义类型的作用。
J Biomed Inform. 2010 Dec;43(6):972-81. doi: 10.1016/j.jbi.2010.08.009. Epub 2010 Sep 9.
5
An overview of MetaMap: historical perspective and recent advances.MetaMap 概述:历史视角与最新进展。
J Am Med Inform Assoc. 2010 May-Jun;17(3):229-36. doi: 10.1136/jamia.2009.002733.
6
A new evaluation methodology for literature-based discovery systems.一种基于文献的发现系统的新评估方法。
J Biomed Inform. 2009 Aug;42(4):633-43. doi: 10.1016/j.jbi.2008.12.001. Epub 2008 Dec 16.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验