Hedeler Cornelia, Wong Han Min, Cornell Michael J, Alam Intikhab, Soanes Darren M, Rattray Magnus, Hubbard Simon J, Talbot Nicholas J, Oliver Stephen G, Paton Norman W
School of Computer Science, The University of Manchester, Manchester, M13 9PL, UK.
BMC Genomics. 2007 Nov 20;8:426. doi: 10.1186/1471-2164-8-426.
The number of sequenced fungal genomes is ever increasing, with about 200 genomes already fully sequenced or in progress. Only a small percentage of those genomes have been comprehensively studied, for example using techniques from functional genomics. Comparative analysis has proven to be a useful strategy for enhancing our understanding of evolutionary biology and of the less well understood genomes. However, the data required for these analyses tends to be distributed in various heterogeneous data sources, making systematic comparative studies a cumbersome task. Furthermore, comparative analyses benefit from close integration of derived data sets that cluster genes or organisms in a way that eases the expression of requests that clarify points of similarity or difference between species.
To support systematic comparative analyses of fungal genomes we have developed the e-Fungi database, which integrates a variety of data for more than 30 fungal genomes. Publicly available genome data, functional annotations, and pathway information has been integrated into a single data repository and complemented with results of comparative analyses, such as MCL and OrthoMCL cluster analysis, and predictions of signaling proteins and the sub-cellular localisation of proteins. To access the data, a library of analysis tasks is available through a web interface. The analysis tasks are motivated by recent comparative genomics studies, and aim to support the study of evolutionary biology as well as community efforts for improving the annotation of genomes. Web services for each query are also available, enabling the tasks to be incorporated into workflows.
The e-Fungi database provides fungal biologists with a resource for comparative studies of a large range of fungal genomes. Its analysis library supports the comparative study of genome data, functional annotation, and results of large scale analyses over all the genomes stored in the database. The database is accessible at http://www.e-fungi.org.uk, as is the WSDL for the web services.
已测序的真菌基因组数量不断增加,目前已有约200个基因组完成全测序或测序工作正在进行中。其中只有一小部分基因组得到了全面研究,例如采用功能基因组学技术进行研究。比较分析已被证明是一种有用的策略,有助于增进我们对进化生物学以及了解较少的基因组的认识。然而,这些分析所需的数据往往分布在各种异构数据源中,使得系统的比较研究成为一项繁琐的任务。此外,比较分析受益于派生数据集的紧密整合,这些数据集以某种方式对基因或生物体进行聚类,从而便于表达能够阐明物种间异同点的查询请求。
为支持真菌基因组的系统比较分析,我们开发了电子真菌数据库(e-Fungi database),该数据库整合了30多个真菌基因组的各种数据。公开可用的基因组数据、功能注释和通路信息已被整合到一个单一的数据存储库中,并辅以比较分析的结果,如MCL和OrthoMCL聚类分析,以及信号蛋白预测和蛋白质亚细胞定位预测。为了访问这些数据,可通过网络界面使用一个分析任务库。这些分析任务受到近期比较基因组学研究的推动,旨在支持进化生物学研究以及改进基因组注释的社区工作。每个查询还提供了网络服务,使这些任务能够纳入工作流程。
电子真菌数据库为真菌生物学家提供了一个对大量真菌基因组进行比较研究的资源。其分析库支持对数据库中存储的所有基因组的基因组数据、功能注释和大规模分析结果进行比较研究。该数据库可通过http://www.e-fungi.org.uk访问,网络服务的WSDL也可通过该网址获取。