从药物基因组学文献中自动提取信息的最新进展。

Recent progress in automatically extracting information from the pharmacogenomic literature.

机构信息

Biomedical Informatics, Stanford University, Stanford, CA 94305, USA.

出版信息

Pharmacogenomics. 2010 Oct;11(10):1467-89. doi: 10.2217/pgs.10.136.

DOI:10.2217/pgs.10.136

PMID:21047206

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3035632/

Abstract

The biomedical literature holds our understanding of pharmacogenomics, but it is dispersed across many journals. In order to integrate our knowledge, connect important facts across publications and generate new hypotheses we must organize and encode the contents of the literature. By creating databases of structured pharmocogenomic knowledge, we can make the value of the literature much greater than the sum of the individual reports. We can, for example, generate candidate gene lists or interpret surprising hits in genome-wide association studies. Text mining automatically adds structure to the unstructured knowledge embedded in millions of publications, and recent years have seen a surge in work on biomedical text mining, some specific to pharmacogenomics literature. These methods enable extraction of specific types of information and can also provide answers to general, systemic queries. In this article, we describe the main tasks of text mining in the context of pharmacogenomics, summarize recent applications and anticipate the next phase of text mining applications.

摘要

生物医学文献承载着我们对药物基因组学的理解，但它分散在许多期刊中。为了整合我们的知识，在出版物之间建立联系，并生成新的假设，我们必须对文献的内容进行组织和编码。通过创建结构化药物基因组学知识库，我们可以使文献的价值远远超过各个报告的总和。例如，我们可以生成候选基因列表，或解释全基因组关联研究中的惊人结果。文本挖掘自动为隐藏在数百万篇文献中的非结构化知识添加结构，近年来，生物医学文本挖掘工作大量涌现，其中一些专门针对药物基因组学文献。这些方法不仅能够提取特定类型的信息，还可以为一般性、系统性查询提供答案。在本文中，我们将描述在药物基因组学背景下文本挖掘的主要任务，总结最近的应用，并预测文本挖掘应用的下一阶段。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b87/3035632/6a281099c41a/nihms268321f1.jpg

相似文献

Recent progress in automatically extracting information from the pharmacogenomic literature.从药物基因组学文献中自动提取信息的最新进展。

Pharmacogenomics. 2010 Oct;11(10):1467-89. doi: 10.2217/pgs.10.136.

Pharmspresso: a text mining tool for extraction of pharmacogenomic concepts and relationships from full text.Pharmspresso：一种用于从全文中提取药物基因组学概念和关系的文本挖掘工具。

BMC Bioinformatics. 2009 Feb 5;10 Suppl 2(Suppl 2):S6. doi: 10.1186/1471-2105-10-S2-S6.

Mining the pharmacogenomics literature--a survey of the state of the art.挖掘药物基因组学文献——技术现状调查。

Brief Bioinform. 2012 Jul;13(4):460-94. doi: 10.1093/bib/bbs018.

How to learn about gene function: text-mining or ontologies?如何了解基因功能：文本挖掘还是本体论？

Methods. 2015 Mar;74:3-15. doi: 10.1016/j.ymeth.2014.07.004. Epub 2014 Aug 1.

Text Mining for Precision Medicine: Bringing Structure to EHRs and Biomedical Literature to Understand Genes and Health.精准医学的文本挖掘：为电子健康记录和生物医学文献构建结构以理解基因与健康。

Adv Exp Med Biol. 2016;939:139-166. doi: 10.1007/978-981-10-1503-8_7.

Text-based knowledge discovery: search and mining of life-sciences documents.基于文本的知识发现：生命科学文献的搜索和挖掘。

Drug Discov Today. 2002 Jun 1;7(11):S89-98. doi: 10.1016/s1359-6446(02)02286-9.

Recent Advances and Emerging Applications in Text and Data Mining for Biomedical Discovery.用于生物医学发现的文本与数据挖掘的最新进展及新兴应用

Brief Bioinform. 2016 Jan;17(1):33-42. doi: 10.1093/bib/bbv087. Epub 2015 Sep 29.

Teaching computers to read the pharmacogenomics literature ... so you don't have to.教计算机阅读药物基因组学文献……这样你就不用自己读了。

Pharmacogenomics. 2010 Apr;11(4):515-8. doi: 10.2217/pgs.10.48.

Learning the Structure of Biomedical Relationships from Unstructured Text.从非结构化文本中学习生物医学关系的结构

PLoS Comput Biol. 2015 Jul 28;11(7):e1004216. doi: 10.1371/journal.pcbi.1004216. eCollection 2015 Jul.

Systematic identification of pharmacogenomics information from clinical trials.从临床试验中系统地识别药物基因组学信息。

J Biomed Inform. 2012 Oct;45(5):870-8. doi: 10.1016/j.jbi.2012.04.005. Epub 2012 Apr 24.

引用本文的文献

PharmGKB, an Integrated Resource of Pharmacogenomic Knowledge.PharmGKB，一个综合性的药物基因组学知识库。

Curr Protoc. 2021 Aug;1(8):e226. doi: 10.1002/cpz1.226.

The E. coli Whole-Cell Modeling Project.大肠杆菌全细胞建模项目。

EcoSal Plus. 2021 Dec 15;9(2):eESP00012020. doi: 10.1128/ecosalplus.ESP-0001-2020. Epub 2021 Jul 9.

A Decade of Pharmacogenetic Studies in Jordan: A Systemic Review.约旦十年药物遗传学研究：系统综述。

Pharmacogenomics J. 2021 Oct;21(5):543-550. doi: 10.1038/s41397-021-00236-6. Epub 2021 Apr 13.

Global Text Mining and Development of Pharmacogenomic Knowledge Resource for Precision Medicine.用于精准医学的全球文本挖掘与药物基因组学知识资源开发。

Front Pharmacol. 2019 Aug 7;10:839. doi: 10.3389/fphar.2019.00839. eCollection 2019.

PGxO and PGxLOD: a reconciliation of pharmacogenomic knowledge of various provenances, enabling further comparison.PGxO 和 PGxLOD：对各种来源的药物基因组学知识进行协调，从而实现进一步比较。

BMC Bioinformatics. 2019 Apr 18;20(Suppl 4):139. doi: 10.1186/s12859-019-2693-9.

PubCaseFinder: A Case-Report-Based, Phenotype-Driven Differential-Diagnosis System for Rare Diseases.PubCaseFinder：一种基于病例报告、表型驱动的罕见病鉴别诊断系统。

Am J Hum Genet. 2018 Sep 6;103(3):389-399. doi: 10.1016/j.ajhg.2018.08.003. Epub 2018 Aug 30.

Harnessing Biomedical Natural Language Processing Tools to Identify Medicinal Plant Knowledge from Historical Texts.利用生物医学自然语言处理工具从历史文本中识别药用植物知识。

AMIA Annu Symp Proc. 2018 Apr 16;2017:1537-1546. eCollection 2017.

Automated Metabolic Phenotyping of Cytochrome Polymorphisms Using PubMed Abstract Mining.利用PubMed摘要挖掘技术对细胞色素多态性进行自动代谢表型分析

AMIA Annu Symp Proc. 2018 Apr 16;2017:535-544. eCollection 2017.

eGARD: Extracting associations between genomic anomalies and drug responses from text.eGARD：从文本中提取基因组异常与药物反应之间的关联。

PLoS One. 2017 Dec 20;12(12):e0189663. doi: 10.1371/journal.pone.0189663. eCollection 2017.

Classification and analysis of a large collection of in vivo bioassay descriptions.大量体内生物测定描述的分类与分析

PLoS Comput Biol. 2017 Jul 5;13(7):e1005641. doi: 10.1371/journal.pcbi.1005641. eCollection 2017 Jul.

本文引用的文献

A Comprehensive Analysis of Five Million UMLS Metathesaurus Terms Using Eighteen Million MEDLINE Citations.使用一千八百万条MEDLINE引文对五百万条统一医学语言系统（UMLS）元词表术语进行的综合分析。

AMIA Annu Symp Proc. 2010 Nov 13;2010:907-11.

Using text to build semantic networks for pharmacogenomics.利用文本构建药物基因组学的语义网络。

J Biomed Inform. 2010 Dec;43(6):1009-19. doi: 10.1016/j.jbi.2010.08.005. Epub 2010 Aug 17.

An overview of MetaMap: historical perspective and recent advances.MetaMap 概述：历史视角与最新进展。

J Am Med Inform Assoc. 2010 May-Jun;17(3):229-36. doi: 10.1136/jamia.2009.002733.

Resolving anaphoras for the extraction of drug-drug interactions in pharmacological documents.解决药理学文献中药物-药物相互作用提取的回指问题。

BMC Bioinformatics. 2010 Apr 16;11 Suppl 2(Suppl 2):S1. doi: 10.1186/1471-2105-11-S2-S1.

Teaching computers to read the pharmacogenomics literature ... so you don't have to.教计算机阅读药物基因组学文献……这样你就不用自己读了。

Pharmacogenomics. 2010 Apr;11(4):515-8. doi: 10.2217/pgs.10.48.

Modeling sample variables with an Experimental Factor Ontology.运用实验因子本体对样本变量进行建模。

Bioinformatics. 2010 Apr 15;26(8):1112-8. doi: 10.1093/bioinformatics/btq099. Epub 2010 Mar 3.

A side effect resource to capture phenotypic effects of drugs.一个用于捕捉药物表型效应的副作用资源。

Mol Syst Biol. 2010;6:343. doi: 10.1038/msb.2009.98. Epub 2010 Jan 19.

Synthesis of pharmacokinetic pathways through knowledge acquisition and automated reasoning.通过知识获取和自动推理合成药代动力学途径。

Pac Symp Biocomput. 2010:465-76. doi: 10.1142/9789814295291_0048.

Improving the prediction of pharmacogenes using text-derived drug-gene relationships.利用文本衍生的药物-基因关系改进药物基因的预测。

Pac Symp Biocomput. 2010:305-14. doi: 10.1142/9789814295291_0033.

Extraction of genotype-phenotype-drug relationships from text: from entity recognition to bioinformatics application.从文本中提取基因型-表型-药物关系：从实体识别到生物信息学应用。

Pac Symp Biocomput. 2010:485-7. doi: 10.1142/9789814295291_0051.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验