使用随机索引推断本体中词汇的语义关系：在药物基因组学中的应用

Inferring the semantic relationships of words within an ontology using random indexing: applications to pharmacogenomics.

作者信息

Percha Bethany, Altman Russ B

机构信息

Stanford University, Stanford, CA.

出版信息

AMIA Annu Symp Proc. 2013 Nov 16;2013:1123-32. eCollection 2013.

PMID:24551397

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3900134/

Abstract

The biomedical literature presents a uniquely challenging text mining problem. Sentences are long and complex, the subject matter is highly specialized with a distinct vocabulary, and producing annotated training data for this domain is time consuming and expensive. In this environment, unsupervised text mining methods that do not rely on annotated training data are valuable. Here we investigate the use of random indexing, an automated method for producing vector-space semantic representations of words from large, unlabeled corpora, to address the problem of term normalization in sentences describing drugs and genes. We show that random indexing produces similarity scores that capture some of the structure of PHARE, a manually curated ontology of pharmacogenomics concepts. We further show that random indexing can be used to identify likely word candidates for inclusion in the ontology, and can help localize these new labels among classes and roles within the ontology.

摘要

生物医学文献提出了一个极具挑战性的文本挖掘问题。句子冗长复杂，主题高度专业化且有独特的词汇，为该领域生成带注释的训练数据既耗时又昂贵。在这种环境下，不依赖带注释训练数据的无监督文本挖掘方法很有价值。在此，我们研究随机索引的应用，这是一种从大型未标记语料库生成单词向量空间语义表示的自动化方法，以解决描述药物和基因的句子中的术语规范化问题。我们表明，随机索引产生的相似度得分能够捕捉PHARE（一个人工策划的药物基因组学概念本体）的一些结构。我们进一步表明，随机索引可用于识别可能纳入本体的单词候选词，并有助于在本体中的类别和角色之间定位这些新标签。

相似文献

Inferring the semantic relationships of words within an ontology using random indexing: applications to pharmacogenomics.

AMIA Annu Symp Proc. 2013 Nov 16;2013:1123-32. eCollection 2013.

Semantic role labeling for protein transport predicates.

BMC Bioinformatics. 2008 Jun 11;9:277. doi: 10.1186/1471-2105-9-277.

SIFR annotator: ontology-based semantic annotation of French biomedical text and clinical notes.

BMC Bioinformatics. 2018 Nov 6;19(1):405. doi: 10.1186/s12859-018-2429-2.

Identification of key concepts in biomedical literature using a modified Markov heuristic.

Bioinformatics. 2003 Feb 12;19(3):402-7. doi: 10.1093/bioinformatics/btg010.

Predication-based semantic indexing: permutations as a means to encode predications in semantic space.

AMIA Annu Symp Proc. 2009 Nov 14;2009:114-8.

Evaluating semantic similarity between Chinese biomedical terms through multiple ontologies with score normalization: An initial study.

J Biomed Inform. 2016 Dec;64:273-287. doi: 10.1016/j.jbi.2016.10.017. Epub 2016 Nov 1.

A novel feature selection strategy for enhanced biomedical event extraction using the Turku system.

Biomed Res Int. 2014;2014:205239. doi: 10.1155/2014/205239. Epub 2014 Apr 6.

Gene clustering by latent semantic indexing of MEDLINE abstracts.

Bioinformatics. 2005 Jan 1;21(1):104-15. doi: 10.1093/bioinformatics/bth464. Epub 2004 Aug 12.

Multi-Ontology Refined Embeddings (MORE): A hybrid multi-ontology and corpus-based semantic representation model for biomedical concepts.

J Biomed Inform. 2020 Nov;111:103581. doi: 10.1016/j.jbi.2020.103581. Epub 2020 Oct 1.

Automated ontology generation framework powered by linked biomedical ontologies for disease-drug domain.

Comput Methods Programs Biomed. 2018 Oct;165:117-128. doi: 10.1016/j.cmpb.2018.08.010. Epub 2018 Aug 16.

引用本文的文献

Pharmacogenomics in the clinic.

Nature. 2015 Oct 15;526(7573):343-50. doi: 10.1038/nature15817.

Learning the Structure of Biomedical Relationships from Unstructured Text.

PLoS Comput Biol. 2015 Jul 28;11(7):e1004216. doi: 10.1371/journal.pcbi.1004216. eCollection 2015 Jul.

An ontology for Autism Spectrum Disorder (ASD) to infer ASD phenotypes from Autism Diagnostic Interview-Revised data.

J Biomed Inform. 2015 Aug;56:333-47. doi: 10.1016/j.jbi.2015.06.026. Epub 2015 Jul 4.

本文引用的文献

Discovery and explanation of drug-drug interactions via text mining.

Pac Symp Biocomput. 2012:410-21.

Integration and publication of heterogeneous text-mined relationships on the Semantic Web.

J Biomed Semantics. 2011 May 17;2 Suppl 2(Suppl 2):S10. doi: 10.1186/2041-1480-2-S2-S10.

Using text to build semantic networks for pharmacogenomics.

J Biomed Inform. 2010 Dec;43(6):1009-19. doi: 10.1016/j.jbi.2010.08.005. Epub 2010 Aug 17.

Natural Language Processing methods and systems for biomedical ontology learning.

J Biomed Inform. 2011 Feb;44(1):163-79. doi: 10.1016/j.jbi.2010.07.006. Epub 2010 Jul 18.

Reflective Random Indexing and indirect inference: a scalable method for discovery of implicit connections.

J Biomed Inform. 2010 Apr;43(2):240-56. doi: 10.1016/j.jbi.2009.09.003. Epub 2009 Sep 15.

Empirical distributional semantics: methods and biomedical applications.

J Biomed Inform. 2009 Apr;42(2):390-405. doi: 10.1016/j.jbi.2009.02.002. Epub 2009 Feb 14.

Representing word meaning and order information in a composite holographic lexicon.

Psychol Rev. 2007 Jan;114(1):1-37. doi: 10.1037/0033-295X.114.1.1.

RelEx--relation extraction using dependency parse trees.

Bioinformatics. 2007 Feb 1;23(3):365-71. doi: 10.1093/bioinformatics/btl616. Epub 2006 Dec 1.

PharmGKB: the Pharmacogenetics Knowledge Base.

Nucleic Acids Res. 2002 Jan 1;30(1):163-5. doi: 10.1093/nar/30.1.163.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

使用随机索引推断本体中词汇的语义关系：在药物基因组学中的应用

Inferring the semantic relationships of words within an ontology using random indexing: applications to pharmacogenomics.

作者信息

Percha Bethany, Altman Russ B

机构信息

Stanford University, Stanford, CA.

出版信息

AMIA Annu Symp Proc. 2013 Nov 16;2013:1123-32. eCollection 2013.

PMID:24551397

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3900134/

Abstract

摘要

使用随机索引推断本体中词汇的语义关系：在药物基因组学中的应用

Inferring the semantic relationships of words within an ontology using random indexing: applications to pharmacogenomics.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

使用随机索引推断本体中词汇的语义关系：在药物基因组学中的应用

Inferring the semantic relationships of words within an ontology using random indexing: applications to pharmacogenomics.

作者信息

机构信息

出版信息