Suppr超能文献

基因本体术语的组成结构。

The compositional structure of Gene Ontology terms.

作者信息

Ogren P V, Cohen K B, Acquaah-Mensah G K, Eberlein J, Hunter L

机构信息

University of Colorado at Boulder, Dept. of Computer Science, Boulder, CO, USA.

出版信息

Pac Symp Biocomput. 2004:214-25. doi: 10.1142/9789812704856_0021.

Abstract

An analysis of the term names in the Gene Ontology reveals the prevalence of substring relations between terms: 65.3% of all GO terms contain another GO term as a proper substring. This substring relation often coincides with a derivational relationship between the terms. For example, the term regulation of cell proliferation (GO:0042127) is derived from the term cell proliferation (GO:0008283) by addition of the phrase regulation of. Further, we note that particular substrings which are not themselves GO terms (e.g. regulation of in the preceding example) recur frequently and in consistent subtrees of the ontology, and that these frequently occurring substrings often indicate interesting semantic relationships between the related terms. We describe the extent of these phenomena--substring relations between terms, and the recurrence of derivational phrases such as regulation of--and propose that these phenomena can be exploited in various ways to make the information in GO more computationally accessible, to construct a conceptually richer representation of the data encoded in the ontology, and to assist in the analysis of natural language texts.

摘要

对基因本体论中术语名称的分析揭示了术语之间子串关系的普遍性

所有基因本体论术语中有65.3%包含另一个基因本体论术语作为其恰当的子串。这种子串关系常常与术语之间的派生关系相吻合。例如,细胞增殖调控(GO:0042127)这个术语是通过添加“调控”这个短语从细胞增殖(GO:0008283)这个术语派生而来的。此外,我们注意到,那些本身并非基因本体论术语的特定子串(如前例中的“调控”)在本体论的一致子树中频繁出现,并且这些频繁出现的子串常常表明相关术语之间存在有趣的语义关系。我们描述了这些现象的程度——术语之间的子串关系以及诸如“调控”等派生短语的反复出现——并提出可以通过多种方式利用这些现象,以使基因本体论中的信息在计算上更易于获取,构建一个在概念上更丰富的本体论编码数据表示,并协助分析自然语言文本。

相似文献

1
The compositional structure of Gene Ontology terms.
Pac Symp Biocomput. 2004:214-25. doi: 10.1142/9789812704856_0021.
2
A relation based measure of semantic similarity for Gene Ontology annotations.
BMC Bioinformatics. 2008 Nov 4;9:468. doi: 10.1186/1471-2105-9-468.
3
Enrichment of OBO ontologies.
J Biomed Inform. 2007 Jun;40(3):300-15. doi: 10.1016/j.jbi.2006.07.003. Epub 2006 Jul 26.
4
Additional gene ontology structure for improved biological reasoning.
Bioinformatics. 2006 Aug 15;22(16):2020-7. doi: 10.1093/bioinformatics/btl334. Epub 2006 Jun 20.
5
6
Textpresso: an ontology-based information retrieval and extraction system for biological literature.
PLoS Biol. 2004 Nov;2(11):e309. doi: 10.1371/journal.pbio.0020309. Epub 2004 Sep 21.
7
Integration of the Gene Ontology into an object-oriented architecture.
BMC Bioinformatics. 2005 May 10;6:113. doi: 10.1186/1471-2105-6-113.
8
GOSemSim: an R package for measuring semantic similarity among GO terms and gene products.
Bioinformatics. 2010 Apr 1;26(7):976-8. doi: 10.1093/bioinformatics/btq064. Epub 2010 Feb 23.
10
Gene Ontology synonym generation rules lead to increased performance in biomedical concept recognition.
J Biomed Semantics. 2016 Sep 9;7(1):52. doi: 10.1186/s13326-016-0096-7.

引用本文的文献

2
SSIF: Subsumption-based Sub-term Inference Framework to audit Gene Ontology.
Bioinformatics. 2020 May 1;36(10):3207-3214. doi: 10.1093/bioinformatics/btaa106.
3
A new synonym-substitution method to enrich the human phenotype ontology.
BMC Bioinformatics. 2017 Oct 10;18(1):446. doi: 10.1186/s12859-017-1858-7.
4
Gene Ontology synonym generation rules lead to increased performance in biomedical concept recognition.
J Biomed Semantics. 2016 Sep 9;7(1):52. doi: 10.1186/s13326-016-0096-7.
6
Management of Dynamic Biomedical Terminologies: Current Status and Future Challenges.
Yearb Med Inform. 2015 Aug 13;10(1):125-33. doi: 10.15265/IY-2015-002.
7
Exploitation of semantic methods to cluster pharmacovigilance terms.
J Biomed Semantics. 2014 Apr 16;5:18. doi: 10.1186/2041-1480-5-18. eCollection 2014.
8
A web-portal for interactive data exploration, visualization, and hypothesis testing.
Front Neuroinform. 2014 Mar 26;8:25. doi: 10.3389/fninf.2014.00025. eCollection 2014.
10
Dissecting the Ambiguity of FMA Concept Names Using Taxonomy and Partonomy Structural Information.
AMIA Jt Summits Transl Sci Proc. 2013 Mar 18;2013:157-61. eCollection 2013.

本文引用的文献

1
Bringing ontology to the gene ontology.
Comp Funct Genomics. 2003;4(1):90-3. doi: 10.1002/cfg.253.
2
A methodology to migrate the gene ontology to a description logic environment using DAML+OIL.
Pac Symp Biocomput. 2003:624-35. doi: 10.1142/9789812776303_0058.
3
Knowledge acquisition, consistency checking and concurrency control for Gene Ontology (GO).
Bioinformatics. 2003 Jan 22;19(2):241-8. doi: 10.1093/bioinformatics/19.2.241.
6
Creating the gene ontology resource: design and implementation.
Genome Res. 2001 Aug;11(8):1425-33. doi: 10.1101/gr.180801.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验