Götz Stefan, García-Gómez Juan Miguel, Terol Javier, Williams Tim D, Nagaraj Shivashankar H, Nueda María José, Robles Montserrat, Talón Manuel, Dopazo Joaquín, Conesa Ana
Bioinformatics Department, Centro de Investigación Principe Felipe, Valencia, Spain.
Nucleic Acids Res. 2008 Jun;36(10):3420-35. doi: 10.1093/nar/gkn176. Epub 2008 Apr 29.
Functional genomics technologies have been widely adopted in the biological research of both model and non-model species. An efficient functional annotation of DNA or protein sequences is a major requirement for the successful application of these approaches as functional information on gene products is often the key to the interpretation of experimental results. Therefore, there is an increasing need for bioinformatics resources which are able to cope with large amount of sequence data, produce valuable annotation results and are easily accessible to laboratories where functional genomics projects are being undertaken. We present the Blast2GO suite as an integrated and biologist-oriented solution for the high-throughput and automatic functional annotation of DNA or protein sequences based on the Gene Ontology vocabulary. The most outstanding Blast2GO features are: (i) the combination of various annotation strategies and tools controlling type and intensity of annotation, (ii) the numerous graphical features such as the interactive GO-graph visualization for gene-set function profiling or descriptive charts, (iii) the general sequence management features and (iv) high-throughput capabilities. We used the Blast2GO framework to carry out a detailed analysis of annotation behaviour through homology transfer and its impact in functional genomics research. Our aim is to offer biologists useful information to take into account when addressing the task of functionally characterizing their sequence data.
功能基因组学技术已在模式生物和非模式生物的生物学研究中广泛应用。对DNA或蛋白质序列进行有效的功能注释是成功应用这些方法的主要要求,因为基因产物的功能信息往往是解释实验结果的关键。因此,越来越需要能够处理大量序列数据、产生有价值注释结果且便于开展功能基因组学项目的实验室使用的生物信息学资源。我们展示了Blast2GO套件,它是一个基于基因本体论词汇表,用于DNA或蛋白质序列高通量自动功能注释的、面向生物学家的集成解决方案。Blast2GO最突出的特点包括:(i)多种注释策略和控制注释类型及强度的工具相结合;(ii)众多图形化特性,如用于基因集功能分析的交互式基因本体图可视化或描述性图表;(iii)常规序列管理特性;(iv)高通量能力。我们使用Blast2GO框架通过同源转移对注释行为及其在功能基因组学研究中的影响进行了详细分析。我们的目的是为生物学家在对其序列数据进行功能表征时提供有用信息以供参考。