Suppr超能文献

基于知识的条件方法从自由文本中提取药物基因组学特定的药物-基因关系。

A knowledge-driven conditional approach to extract pharmacogenomics specific drug-gene relationships from free text.

机构信息

Medical Informatics Division, Case Western Reserve University, OH, USA.

出版信息

J Biomed Inform. 2012 Oct;45(5):827-34. doi: 10.1016/j.jbi.2012.04.011. Epub 2012 Apr 27.

Abstract

An important task in pharmacogenomics (PGx) studies is to identify genetic variants that may impact drug response. The success of many systematic and integrative computational approaches for PGx studies depends on the availability of accurate, comprehensive and machine understandable drug-gene relationship knowledge bases. Scientific literature is one of the most comprehensive knowledge sources for PGx-specific drug-gene relationships. However, the major barrier in accessing this information is that the knowledge is buried in a large amount of free text with limited machine understandability. Therefore there is a need to develop automatic approaches to extract structured PGx-specific drug-gene relationships from unstructured free text literature. In this study, we have developed a conditional relationship extraction approach to extract PGx-specific drug-gene pairs from 20 million MEDLINE abstracts using known drug-gene pairs as prior knowledge. We have demonstrated that the conditional drug-gene relationship extraction approach significantly improves the precision and F1 measure compared to the unconditioned approach (precision: 0.345 vs. 0.11; recall: 0.481 vs. 1.00; F1: 0.402 vs. 0.201). In this study, a method based on co-occurrence is used as the underlying relationship extraction method for its simplicity. It can be replaced by or combined with more advanced methods such as machine learning or natural language processing approaches to further improve the performance of the drug-gene relationship extraction from free text. Our method is not limited to extracting a drug-gene relationship; it can be generalized to extract other types of relationships when related background knowledge bases exist.

摘要

在药物基因组学(PGx)研究中,一个重要任务是识别可能影响药物反应的遗传变异。许多系统和综合的计算方法在 PGx 研究中的成功与否取决于是否有准确、全面且易于机器理解的药物-基因关系知识库。科学文献是 PGx 特定药物-基因关系最全面的知识来源之一。然而,获取这些信息的主要障碍是,这些知识埋藏在大量的自由文本中,机器理解能力有限。因此,需要开发自动方法从非结构化的自由文本文献中提取结构化的 PGx 特定药物-基因关系。在这项研究中,我们开发了一种条件关系提取方法,使用已知的药物-基因对作为先验知识,从 2000 万篇 MEDLINE 摘要中提取 PGx 特定的药物-基因对。我们证明,与非条件方法相比,条件药物-基因关系提取方法显著提高了精度和 F1 度量(精度:0.345 与 0.11;召回率:0.481 与 1.00;F1:0.402 与 0.201)。在这项研究中,基于共现的方法被用作基础关系提取方法,因为它简单。它可以被更先进的方法(如机器学习或自然语言处理方法)取代或结合,以进一步提高从自由文本中提取药物-基因关系的性能。我们的方法不仅限于提取药物-基因关系;当存在相关的背景知识库时,它可以推广到提取其他类型的关系。

相似文献

7
Using text to build semantic networks for pharmacogenomics.利用文本构建药物基因组学的语义网络。
J Biomed Inform. 2010 Dec;43(6):1009-19. doi: 10.1016/j.jbi.2010.08.005. Epub 2010 Aug 17.

引用本文的文献

5
Learning the Structure of Biomedical Relationships from Unstructured Text.从非结构化文本中学习生物医学关系的结构
PLoS Comput Biol. 2015 Jul 28;11(7):e1004216. doi: 10.1371/journal.pcbi.1004216. eCollection 2015 Jul.

本文引用的文献

2
Using text to build semantic networks for pharmacogenomics.利用文本构建药物基因组学的语义网络。
J Biomed Inform. 2010 Dec;43(6):1009-19. doi: 10.1016/j.jbi.2010.08.005. Epub 2010 Aug 17.
4
Predicting drug side-effects by chemical systems biology.通过化学系统生物学预测药物副作用。
Genome Biol. 2009;10(9):238. doi: 10.1186/gb-2009-10-9-238. Epub 2009 Sep 2.
9
Big data: The future of biocuration.大数据:生物编目的未来。
Nature. 2008 Sep 4;455(7209):47-50. doi: 10.1038/455047a.
10
Creating and evaluating genetic tests predictive of drug response.创建和评估预测药物反应的基因检测。
Nat Rev Drug Discov. 2008 Jul;7(7):568-74. doi: 10.1038/nrd2520. Epub 2008 Jun 20.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验