Suppr超能文献

GOrilla:一种用于在排序后的基因列表中发现和可视化富集的基因本体(GO)术语的工具。

GOrilla: a tool for discovery and visualization of enriched GO terms in ranked gene lists.

作者信息

Eden Eran, Navon Roy, Steinfeld Israel, Lipson Doron, Yakhini Zohar

机构信息

Molecular Cell Biology Department, Weizmann Institute of Science, Rehovot, Israel.

出版信息

BMC Bioinformatics. 2009 Feb 3;10:48. doi: 10.1186/1471-2105-10-48.

Abstract

BACKGROUND

Since the inception of the GO annotation project, a variety of tools have been developed that support exploring and searching the GO database. In particular, a variety of tools that perform GO enrichment analysis are currently available. Most of these tools require as input a target set of genes and a background set and seek enrichment in the target set compared to the background set. A few tools also exist that support analyzing ranked lists. The latter typically rely on simulations or on union-bound correction for assigning statistical significance to the results.

RESULTS

GOrilla is a web-based application that identifies enriched GO terms in ranked lists of genes, without requiring the user to provide explicit target and background sets. This is particularly useful in many typical cases where genomic data may be naturally represented as a ranked list of genes (e.g. by level of expression or of differential expression). GOrilla employs a flexible threshold statistical approach to discover GO terms that are significantly enriched at the top of a ranked gene list. Building on a complete theoretical characterization of the underlying distribution, called mHG, GOrilla computes an exact p-value for the observed enrichment, taking threshold multiple testing into account without the need for simulations. This enables rigorous statistical analysis of thousand of genes and thousands of GO terms in order of seconds. The output of the enrichment analysis is visualized as a hierarchical structure, providing a clear view of the relations between enriched GO terms.

CONCLUSION

GOrilla is an efficient GO analysis tool with unique features that make a useful addition to the existing repertoire of GO enrichment tools. GOrilla's unique features and advantages over other threshold free enrichment tools include rigorous statistics, fast running time and an effective graphical representation. GOrilla is publicly available at: http://cbl-gorilla.cs.technion.ac.il

摘要

背景

自基因本体(GO)注释项目启动以来,已开发出多种支持探索和搜索GO数据库的工具。特别是,目前有多种执行GO富集分析的工具。这些工具大多需要将一组目标基因和一个背景集作为输入,并在与背景集相比的目标集中寻找富集情况。也存在一些支持分析排序列表的工具。后者通常依靠模拟或并集界校正来为结果赋予统计显著性。

结果

GOrilla是一个基于网络的应用程序,可识别基因排序列表中富集的GO术语,而无需用户提供明确的目标集和背景集。这在许多典型情况下特别有用,在这些情况下基因组数据可能自然地表示为基因排序列表(例如按表达水平或差异表达水平)。GOrilla采用灵活的阈值统计方法来发现基因排序列表顶部显著富集的GO术语。基于对潜在分布(称为mHG)的完整理论表征,GOrilla计算观察到的富集的精确p值,考虑阈值多重检验而无需模拟。这使得能够在数秒内对数千个基因和数千个GO术语进行严格的统计分析。富集分析的输出可视化为层次结构,清楚地显示了富集的GO术语之间的关系。

结论

GOrilla是一个高效的GO分析工具,具有独特的功能,为现有的GO富集工具库增添了有用的内容。GOrilla相对于其他无阈值富集工具的独特功能和优势包括严格的统计、快速的运行时间和有效的图形表示。GOrilla可在以下网址公开获取:http://cbl-gorilla.cs.technion.ac.il

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be0e/2644678/23c7a444644d/1471-2105-10-48-1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验