Bastos Hugo P, Sousa Lisete, Clarke Luka A, Couto Francisco M
LaSIGE, Departamento de Informática, Faculdade de Ciências, Universidade de Lisboa, Lisboa, Portugal.
Departamento de Estatística e Investigação Operacional e Centro de Estatística e Aplicações, Faculdade de Ciências, Universidade de Lisboa, Lisboa, Portugal.
PLoS One. 2015 Mar 20;10(3):e0119631. doi: 10.1371/journal.pone.0119631. eCollection 2015.
Functional context for biological sequence is provided in the form of annotations. However, within a group of similar sequences there can be annotation heterogeneity in terms of coverage and specificity. This in turn can introduce issues regarding the interpretation of actual functional similarity and overall functional coherence of such a group. One way to mitigate such issues is through the use of visualization and statistical techniques. Therefore, in order to help interpret this annotation heterogeneity we created a web application that generates Gene Ontology annotation graphs for protein sets and their associated statistics from simple frequencies to enrichment values and Information Content based metrics. The publicly accessible website http://xldb.di.fc.ul.pt/gryfun/ currently accepts lists of UniProt accession numbers in order to create user-defined protein sets for subsequent annotation visualization and statistical assessment. GRYFUN is a freely available web application that allows GO annotation visualization of protein sets and which can be used for annotation coherence and cohesiveness analysis and annotation extension assessments within under-annotated protein sets.
生物序列的功能背景以注释的形式提供。然而,在一组相似序列中,在覆盖范围和特异性方面可能存在注释异质性。这反过来又会引发关于此类序列组实际功能相似性和整体功能连贯性解释的问题。缓解此类问题的一种方法是使用可视化和统计技术。因此,为了帮助解释这种注释异质性,我们创建了一个网络应用程序,该程序可为蛋白质集生成基因本体注释图及其相关统计信息,从简单频率到富集值以及基于信息内容的指标。公开可用的网站http://xldb.di.fc.ul.pt/gryfun/目前接受UniProt登录号列表,以便创建用户定义的蛋白质集,用于后续的注释可视化和统计评估。GRYFUN是一个免费的网络应用程序,可对蛋白质集进行基因本体注释可视化,可用于注释连贯性和凝聚性分析以及注释不足的蛋白质集内的注释扩展评估。