Computational Biology Group, Department of Clinical Laboratory Sciences, Institute of Infectious Disease and Molecular Medicine, University of Cape Town, Medical School, Observatory, Cape Town, 7925, South Africa.
BMC Bioinformatics. 2013 Sep 25;14:284. doi: 10.1186/1471-2105-14-284.
The use of Gene Ontology (GO) data in protein analyses have largely contributed to the improved outcomes of these analyses. Several GO semantic similarity measures have been proposed in recent years and provide tools that allow the integration of biological knowledge embedded in the GO structure into different biological analyses. There is a need for a unified tool that provides the scientific community with the opportunity to explore these different GO similarity measure approaches and their biological applications.
We have developed DaGO-Fun, an online tool available at http://web.cbio.uct.ac.za/ITGOM, which incorporates many different GO similarity measures for exploring, analyzing and comparing GO terms and proteins within the context of GO. It uses GO data and UniProt proteins with their GO annotations as provided by the Gene Ontology Annotation (GOA) project to precompute GO term information content (IC), enabling rapid response to user queries.
The DaGO-Fun online tool presents the advantage of integrating all the relevant IC-based GO similarity measures, including topology- and annotation-based approaches to facilitate effective exploration of these measures, thus enabling users to choose the most relevant approach for their application. Furthermore, this tool includes several biological applications related to GO semantic similarity scores, including the retrieval of genes based on their GO annotations, the clustering of functionally related genes within a set, and term enrichment analysis.
在蛋白质分析中使用基因本体论(GO)数据在很大程度上提高了这些分析的结果。近年来已经提出了几种 GO 语义相似性度量方法,为将 GO 结构中嵌入的生物学知识集成到不同的生物学分析中提供了工具。需要有一种统一的工具,为科学界提供机会来探索这些不同的 GO 相似性度量方法及其生物学应用。
我们开发了 DaGO-Fun,这是一个在线工具,可在 http://web.cbio.uct.ac.za/ITGOM 上访问,它结合了许多不同的 GO 相似性度量方法,用于探索、分析和比较 GO 术语和蛋白质在 GO 上下文中的关系。它使用 GO 数据和 UniProt 蛋白质及其 GO 注释,这些注释是由基因本体论注释(GOA)项目提供的,以预先计算 GO 术语的信息含量(IC),从而能够快速响应用户查询。
DaGO-Fun 在线工具的优势在于集成了所有相关的基于 IC 的 GO 相似性度量方法,包括基于拓扑和注释的方法,以方便有效地探索这些方法,从而使用户能够为他们的应用选择最相关的方法。此外,这个工具还包括与 GO 语义相似性评分相关的几个生物学应用,包括基于其 GO 注释检索基因、在一组中对功能相关基因进行聚类以及术语富集分析。